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Preface 


We  began  this  project  in  1968  to  fill  a  gap  we  each  felt  acutely  after  several 
years  of  teaching  introductory  solid  state  physics  to  Cornell  students  of  physics, 
chemistry,  engineering,  and  materials  science.  In  both  undergraduate  and  graduate 
courses  we  had  to  resort  to  a  patchwork  array  of  reading  assignments,  assembled 
from  some  half  dozen  texts  and  treatises.  This  was  only  partly  because  of  the  great 
diversity  of  the  subject;  the  main  problem  lay  in  its  dual  nature.  On  the  one  hand  an 
introduction  to  solid  state  physics  must  describe  in  some  detail  the  vast  range  of  real 
solids,  with  an  emphasis  on  representative  data  and  illustrative  examples.  On  the 
other  hand  there  is  now  a  well-established  basic  theory  of  solids,  with  which  any 
seriously  interested  student  must  become  familiar. 

Rather  to  our  surprise,  it  has  taken  us  seven  years  to  produce  what  we  needed :  a 
single  introductory  text  presenting  both  aspects  of  the  subject,  descriptive  and  ana¬ 
lytical.  Our  aim  has  been  to  explore  the  variety  of  phenomena  associated  with  the 
major  forms  of  crystalline  matter,  while  laying  the  foundation  for  a  working  under¬ 
standing  of  solids  through  clear,  detailed,  and  elementary  treatments  of  fundamental 
theoretical  concepts. 

Our  book  is  designed  for  introductory  courses  at  either  the  undergraduate  or 
graduate  level.1  Statistical  mechanics  and  the  quantum  theory  lie  at  the  heart  of  solid 
state  physics.  Although  these  subjects  are  used  as  needed,  we  have  tried,  especially  in 
the  more  elementary  chapters,  to  recognize  that  many  readers,  particularly  under¬ 
graduates,  will  not  yet  have  acquired  expertise.  When  it  is  natural  to  do  so,  we  have 
clearly  separated  topics  based  entirely  on  classical  methods  from  those  demanding  a 
quantum  treatment.  In  the  latter  case,  and  in  applications  of  statistical  mechanics,  we 
have  proceeded  carefully  from  explicitly  stated  first  principles.  The  book  is  therefore 
suitable  for  an  introductory  course  taken  concurrently  with  first  courses  in  quantum 
theory  and  statistical  mechanics.  Only  in  the  more  advanced  chapters  and  appendices 
do  we  assume  a  more  experienced  readership. 

The  problems  that  follow  each  chapter  are  tied  rather  closely  to  the  text,  and  are  of 
three  general  kinds:  (a)  routine  steps  in  analytical  development  are  sometimes 
relegated  to  problems,  partly  to  avoid  burdening  the  text  w  ith  formulas  of  no  intrinsic 
interest,  but,  more  importantly,  because  such  steps  are  better  understood  if  completed 
by  the  reader  with  the  aid  of  hints  and  suggestions;  (b)  extensions  of  the  chapter 
(which  the  spectre  of  a  two  volume  work  prevented  us  from  including)  are  presented 
as  problems  when  they  lend  themselves  to  this  type  of  exposition ;  (c)  further  numerical 
and  analytical  applications  are  given  as  problems,  either  to  communicate  additional 


1  Suggestions  for  how  to  use  the  text  in  courses  of  varying  length  and  level  are  given  on  pp.  xviii-xxi. 
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information  or  to  exercise  newly  acquired  skills.  Readers  should  therefore  examine 
the  problems,  even  if  they  do  not  intend  to  attempt  their  solution. 

Although  we  have  respected  the  adage  that  one  picture  is  worth  a  thousand  words, 
we  are  also  aware  that  an  uninformative  illustration,  though  decorative,  takes  up  the 
space  that  could  usefully  be  filled  by  several  hundred.  The  reader  will  thus  encounter 
stretches  of  expository  prose  unrelieved  by  figures,  when  none  are  necessary,  as  well  as 
sections  that  can  profitably  be  perused  entirely  by  looking  at  the  figures  and  their 
captions. 

We  anticipate  use  of  the  book  at  different  levels  with  different  areas  of  major 
emphasis.  A  particular  course  is  unlikely  to  follow  the  chapters  (or  even  selected 
chapters)  in  the  order  in  which  they  are  presented  here,  and  we  have  written  them  in  a 
way  that  permits  easy  selection  and  rearrangement.2  Our  particular  choice  of  sequence 
follows  certain  major  strands  of  the  subject  from  their  first  elementary  exposition  to 
their  more  advanced  aspects,  with  a  minimum  of  digression. 

We  begin  the  book3  with  the  elementary  classical  [1]  and  quantum  [2]  aspects  of 
the  free  electron  theory  of  metals  because  this  requires  a  minimum  of  background  and 
immediately  introduces,  through  a  particular  class  of  examples,  almost  all  of  the 
phenomena  with  which  theories  of  insulators,  semiconductors,  and  metals  must  come 
to  grips.  The  reader  is  thereby  spared  the  impression  that  nothing  can  be  understood 
until  a  host  of  arcane  definitions  (relating  to  periodic  structures)  and  elaborate 
quantum  mechanical  explorations  (of  periodic  systems)  have  been  mastered. 

Periodic  structures  are  introduced  only  after  a  survey  [3]  of  those  metallic  prop¬ 
erties  that  can  and  cannot  be  understood  without  investigating  the  consequences 
of  periodicity.  We  have  tried  to  alleviate  the  tedium  induced  by  a  first  exposure 
to  the  language  of  periodic  systems  by  (a)  separating  the  very  important  conse¬ 
quences  of  purely  translational  symmetry  [4,  5]  from  the  remaining  but  rather  less 
essential  rotational  aspects  [7],  (b)  separating  the  description  in  ordinary  space  [4] 
from  that  in  the  less  familiar  reciprocal  space  [5],  and  (c)  separating  the  abstract 
and  descriptive  treatment  of  periodicity  from  its  elementary  application  to  X-ray 
diffraction  [6]. 

Armed  with  the  terminology  of  periodic  systems,  readers  can  pursue  to  whatever 
point  seems  appropriate  the  resolution  of  the  difficulties  in  the  free  electron  model  of 
metals  or.  alternatively,  can  embark  directly  upon  the  investigation  of  lattice  vibra¬ 
tions.  The  book  follows  the  first  line.  Bloch's  theorem  is  described  and  its  implications 
examined  [8]  in  general  terms,  to  emphasize  that  its  consequences  transcend  the 
illustrative  and  very  important  practical  cases  of  nearly  free  electrons  [9]  and  tight 
binding  [10].  Much  of  the  content  of  these  two  chapters  is  suitable  for  a  more  advanced 
course,  as  is  the  following  survey  of  methods  used  to  compute  real  band  structures 
[11].  The  remarkable  subject  of  semiclassical  mechanics  is  introduced  and  given 
elementary  applications  [12]  before  being  incorporated  into  the  more  elaborate  semi¬ 
classical  theory  of  transport  [13].  The  description  of  methods  by  which  Fermi  surfaces 
are  measured  [14]  may  be  more  suitable  for  advanced  readers,  but  much  of  the  survey 


“  The  Table  on  pp.  xix-xxi  lists  the  prerequisites  for  each  chapter,  to  aid  those  interested  primar¬ 
ily  in  one  aspect  of  the  subject,  or  those  preferring  a  different  order  of  presentation. 

3  References  to  chapter  numbers  are  given  in  brackets. 
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of  the  band  structures  of  actual  metals  [15]  is  readily  incorporated  into  an  elementary 
course. 

Except  for  the  discussion  of  screening,  an  elementary  course  might  also  bypass  the 
essays  on  what  is  overlooked  by  the  relaxation-time  approximation  [16]  and  by  the 
neglect  of  electron-electron  interactions  [17], 

Work  functions  and  other  surface  properties  [18]  can  be  taken  up  at  any  time 
after  the  discussion  of  translational  symmetry  in  real  space.  Our  description  of  the 
conventional  classification  of  solids  [19]  has  been  separated  from  the  analysis  of 
cohesive  energies  [20],  Both  have  been  placed  after  the  introduction  to  band  structure, 
because  it  is  in  terms  of  electronic  structure  that  the  categories  are  most  clearly 
distinguished. 

To  motivate  the  study  of  lattice  vibrations  (at  whatever  point  after  Chapter  5 
readers  choose  to  begin  the  subject)  a  summary  [21]  lists  those  solid  properties  that 
cannot  be  understood  without  their  consideration.  Lattice  dynamics  is  given  an 
elementary  introduction,  with  the  classical  [22]  and  quantum  [23]  aspects  of  the 
harmonic  crystal  treated  separately.  The  ways  in  which  phonon  spectra  are  measured 
[24],  the  consequences  of  anharmonicity  [25],  and  the  special  problems  associated 
with  phonons  in  metals  [26]  and  ionic  crystals  [27]  are  surveyed  at  an  elementary 
level,  though  some  parts  of  these  last  four  chapters  might  well  be  reserved  for  a  more 
advanced  course.  None  of  the  chapters  on  lattice  vibrations  rely  on  the  use  of  normal 
mode  raising  and  lowering  operators;  these  are  described  in  several  appendices  for 
readers  wanting  a  more  advanced  treatment. 

Homogeneous  [28]  and  inhomogeneous  [29]  semiconductors  can  be  examined  at 
any  point  after  the  introduction  of  Bloch’s  theorem  and  the  elementary  discussion  of 
semiclassical  mechanics.  Crystalline  defects  [30]  can  be  studied  as  soon  as  crystals 
themselves  have  been  introduced,  though  parts  of  earlier  chapters  are  occasionally 
referred  to. 

Following  a  review  of  atomic  magnetism,  we  examine  how  it  is  modified  in  a  solid 
environment  [31],  explore  exchange  and  other  magnetic  interactions  [32],  and  apply 
the  resulting  models  to  magnetic  ordering  [33],  This  brief  introduction  to  magnetism 
and  the  concluding  essay  on  superconductivity  [34]  are  largely  self-contained.  They 
are  placed  at  the  end  of  the  book  so  the  phenomena  can  be  viewed,  not  in  terms  of 
abstract  models  but  as  striking  properties  of  real  solids. 

To  our  dismay,  we  discovered  that  it  is  impossible  at  the  end  of  a  seven  year  project, 
labored  upon  not  only  at  Cornell,  but  also  during  extended  stays  in  Cambridge, 
London,  Rome,  Wellington,  and  Jiilich,  to  recall  all  the  occasions  when  students, 
postdoctoral  fellows,  visitors,  and  colleagues  gave  us  invaluable  criticism,  advice, 
and  instruction.  Among  others  we  are  indebted  to  V.  Ambegaokar,  B.  W.  Batterman, 
D.  Beaglehole,  R.  Bowers,  A.  B.  Bringer,  C.  di  Castro,  R.  G.  Chambers.  G.  V\ 
Chester,  R.  M.  Cotts,  R.  A.  Cowley,  G.  Eilenberger,  D.  B.  Fitchen,  C.  Friedli, 
V.  Heine,  R.  L.  Henderson,  D.  F.  Holcomb,  R.  O.  Jones,  B.  D.  Josephson,  J.  A. 
Krumhansl,  C.  A.  Kukkonen,  D.  C.  Langreth,  W.  L.  McLean,  H.  Mahr,  B.  W. 
Maxfield,  R.  Monnier,  L.  G.  Parratt,  O.  Penrose,  R.  O.  Pohl.  J.  J.  Quinn.  J.  J. 
Rehr,  M.  V.  Romerio,  A.  L.  Ruoff,  G.  Russakoff,  H.  S.  Sack,  W.  L.  Schaich,  J.  R. 
Schrieffer.  J.  W.  Serene,  A.  J.  Sievers,  J.  Silcox,  R.  H.  Silsbee,  J.  P.  Straley,  D.  M. 
Straus,  D.  Stroud,  K.  Sturm,  and  J.  W.  Wilkins. 
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One  person,  however,  has  influenced  almost  every  chapter.  Michael  E.  Fisher, 
Horace  White  Professor  of  Chemistry,  Physics,  and  Mathematics,  friend  and  neigh¬ 
bor,  gadfly  and  troubadour,  began  to  read  the  manuscript  six  years  ago  and  has 
followed  ever  since,  hard  upon  our  tracks,  through  chapter,  and,  on  occasion, 
through  revision  and  re-revision,  pouncing  on  obscurities,  condemning  dishonesties, 
decrying  omissions,  labeling  axes,  correcting  misspellings,  redrawing  figures,  and 
often  making  our  lives  very  much  more  difficult  by  his  unrelenting  insistence  that  we 
could  be  more  literate,  accurate,  intelligible,  and  thorough.  We  hope  he  will  be 
pleased  at  how  many  of  his  illegible  red  marginalia  have  found  their  way  into  our  text, 
and  expect  to  be  hearing  from  him  about  those  that  have  not. 

One  of  us  (NDM)  is  most  grateful  to  the  Alfred  P.  Sloan  Foundation  and  the 
John  Simon  Guggenheim  Foundation  for  their  generous  support  at  critical  stages  of 
this  project,  and  to  friends  at  Imperial  College  London  and  the  Istituto  di  Fisica 
“G.  Marconi,”  where  parts  of  the  book  were  written.  He  is  also  deeply  indebted  to 
R.  E.  Peierls,  whose  lectures  converted  him  to  the  view  that  solid  state  physics  is  a 
discipline  of  beauty,  clarity,  and  coherence.  The  other  (NWA),  having  learnt  the 
subject  from  J.  M.  Ziman  and  A.  B.  Pippard,  has  never  been  in  need  of  conversion. 
He  also  wishes  to  acknowledge  with  gratitude  the  support  and  hospitality  of  the 
Kernforschungsanlage  Jiilich.  the  Victoria  University  of  Wellington,  and  the 
Cavendish  Laboratory  and  Clare  Hall.  Cambridge. 


Ithaca 
June  1975 


N.  W.  Ashcroft 
N.  D.  Mermin 
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Suggestions  for 
Using  the  Book 

It  is  in  the  nature  of  books  that  chapters  must  be  linearly  ordered.  We 
have  tried  to  select  a  sequence  that  least  obscures  several  interwoven  lines  of  develop¬ 
ment.  The  accompanying  Table  (pp.  xix-xxi)  is  designed  to  help  readers  with 
special  interests  (in  lattice  vibrations,  semiconductors,  or  metals,  for  example)  or 
teachers  of  courses  with  particular  constraints  on  time  or  level. 

The  prerequisites  for  each  chapter  are  given  in  the  Table  according  to  the  following 
conventions:  (a)  If  “M"  is  listed  after  Chapter  N,  then  the  contents  of  Chapter  M  (as 
well,  of  course,  as  its  essential  prerequisites)  are  essential  for  understanding  much  or 
all  of  Chapter  N;  (b)  If  “(M)”  appears  after  Chapter  N,  then  Chapter  M  is  not  an 
essential  prerequisite:  either  a  small  part  of  Chapter  N  is  based  on  Chapter  M,  or  a 
few  parts  of  Chapter  M  may  be  of  some  help  in  reading  Chapter  N ;  (c)  The  absence  of 
"M”  or  “(M)”  after  Chapter  N  does  not  mean  that  no  reference  back  to  Chapter  M 
is  made ;  however,  such  references  as  may  occur  are  primarily  because  N  illuminates 
the  subject  ot  M,  rather  than  because  M  aids  in  the  development  of  N.1 

The  rest  of  the  Table  indicates  how  the  book  might  be  used  in  a  one  semester 
(40  to  50  lectures)  or  two-semester  (80  to  100  lectures)  introductory  undergraduate 
course.  Chapters  (or  selections  from  a  chapter)  are  listed  for  reading2  if  they  are 
almost  entirely  descriptive,  or,  alternatively,  if  we  felt  that  an  introductory  course 
should  at  least  make  students  aware  of  a  topic,  even  if  time  was  not  available  for  its 
careful  exploration.  The  order  of  presentation  is,  of  course,  flexible.  For  example, 
although  a  two-semester  course  could  follow  the  order  of  the  book,  one  might  well 
prefer  to  follow  the  pattern  of  the  one-semester  course  for  the  first  term,  filling  in  the 
more  advanced  topics  in  the  second. 

An  introductory  course  at  the  graduate  level,  or  a  graduate  course  following  a  one- 
semester  undergraduate  survey,  would  probably  make  use  of  sections  omitted  in  the 
two-semester  undergraduate  course,  as  well  as  many  of  the  sixteen  appendices. 


1  Thus  to  proceed  to  Chapter  12  with  a  minimum  of  digression  it  is  necessary  to  read  Chapters  8.  5,  4. 
2.  and  I 

“  Students  would  presumably  be  asked  to  read  the  chapters  bearing  on  the  lectures  as  well. 
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2  (  haptcr  1  The  Drude  Theory  of  Metals 


Metals  occupy  a  rather  special  position  in  the  study  of  solids,  sharing  a  \ariety  of 
striking  properties  that  other  solids  (such  as  quartz,  sulfur,  or  common  salt)  lack. 
They  are  excellent  conductors  of  heat  and  electricity,  are  ductile  and  malleable,  and 
display  a  striking  luster  on  freshly  exposed  surfaces.  The  challenge  of  accounting  for 
these  metallic  features  gave  the  starting  impetus  to  the  modern  theory  of  solids. 

Although  the  majority  of  commonly  encountered  solids  are  nonmetallic.  metals 
ha\  e  continued  to  play  a  prominent  role  in  the  theory  of  solids  from  the  late  nineteenth 
century  to  the  present  day.  Indeed,  the  metallic  state  has  proved  to  be  one  of  the  great 
fundamental  states  of  matter.  The  elements,  for  example,  definitely  favor  the  metallic 
state:  over  two  thirds  are  metals.  Even  to  understand  nonmetals  one  must  also 
understand  metals,  for  in  explaining  why  copper  conducts  so  well,  one  begins  to 
learn  why  common  salt  does  not. 

During  the  last  hundred  years  physicists  ha\e  tried  to  construct  simple  models  of 
the  metallic  state  that  account  in  a  qualitative,  and  even  quantitative,  way  for  the 
characteristic  metallic  properties.  In  the  course  of  this  search  brilliant  successes  have 
appeared  hand  in  hand  with  apparently  hopeless  failures,  time  and  again.  Even  the 
earliest  models,  though  strikingly  wrong  in  some  respects,  remain,  when  properly 
used,  of  immense  value  to  solid  state  physicists  today. 

In  this  chapter  we  shall  examine  the  theory  of  metallic  conduction  put  forth  by 
P.  Drude1  at  the  turn  of  the  century  .  The  successes  of  the  Drude  model  were  con¬ 
siderable.  and  it  is  still  used  today  as  a  quick  practical  way  to  form  simple  pictures 
and  rough  estimates  of  properties  whose  more  precise  comprehension  may  require 
analysis  of  considerable  complexity.  The  failures  of  the  Drude  model  to  account  for 
some  experiments,  and  the  conceptual  puzzles  it  raised,  defined  the  problems  with 
which  the  theory  of  metals  was  to  grapple  over  the  next  quarter  century.  Thcse-found 
their  resolution  only  in  the  rich  and  subtle  structure  of  the  quantum  theory  of  solids. 

BASIC  ASSUMPTIONS  OF  THE  DRUDE  MODEL 

J.  J.  Thomson's  discovery  of  the  electron  in  1897  had  a  vast  and  immediate  impact 
on  theories  of  the  structure  of  matter,  and  suggested  an  obvious  mechanism  for  con¬ 
duction  in  metals.  Three  years  after  Thomson's  discovery  Drude  constructed  his 
theory  of  electrical  and  thermal  conduction  by  applying  the  highly  successful  kinetic 
theory  of  gases  to  a  metal,  considered  as  a  gas  of  electrons. 

In  its  simplest  form  kinetic  theory  treats  the  molecules  of  a  gas  as  identical  solid 
spheres,  which  move  in  straight  lines  until  they  collide  with  one  another.2  The  lime 
taken  up  by  a  single  collision  is  assumed  to  be  negligible,  and.  except  for  the  forces 
coming  momentarily  into  play  during  each  collision,  no  other  forces  are  assumed  to 
act  between  the  particles. 

Although  there  is  only  one  kind  of  particle  present  in  the  simplest  gases,  in  a  metal 
there  must  be  at  least  two,  for  the  electrons  are  negatively  charged,  yet  the  metal  is 
electrically  neutral.  Drude  assumed  that  the  compensating  positive  charge  was  at- 


1  Annalen  der  Physik  I.  566  and  3.  369  (1900). 

2  Or  wilh  ihe  walls  of  [he  vessel  containing  them,  a  possibility  generally  ignored  in  discussing  metals 
unless  one  is  interested  in  very  fine  wires,  thin  sheets,  or  effects  at  the  surface. 
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□  Nucleus 

Core  electrons 
Valence  electrons 

(a)  (b) 

Figure  1.1 

(a)  Schematic  picture  of  an  isolated  atom  (not  to  scale),  (b)  In  a  metal  the  nucleus  and  ion 
core  retain  their  configuration  in  the  free  atom,  but  the  valence  electrons  leave  the  atom  to 
form  the  electron  gas. 


□  Nucleus 

1  1  I  Fore 

[  1  Conduction  electrons 


tached  to  much  heavier  particles,  which  he  considered  to  be  immobile.  At  his  time, 
however,  there  was  no  precise  notion  of  the  origin  of  the  light,  mobile  electrons  and 
the  heavier,  immobile,  positively  charged  particles.  The  solution  to  this  problem  is 
one  of  the  fundamental  achievements  of  the  modern  quantum  theory  of  solids.  In 
this  discussion  of  the  Drude  model,  however,  we  shall  simply  assume  (and  in  many 
metals  this  assumption  can  be  justified)  that  when  atoms  of  a  metallic  element  are 
brought  together  to  form  a  metal,  the  valence  electrons  become  detached  and  wander 
freely  through  the  metal,  while  the  metallic  ions  remain  intact  and  play  the  role  of  the 
immobile  positive  particles  in  Drude's  theory.  This  model  is  indicated  schematically 
in  Figure  1.1.  A  single  isolated  atom  of  the  metallic  element  has  a  nucleus  of  charge 
eZ„,  where  Za  is  the  atomic  number  and  e  is  the  magnitude  of  the  electronic  charge3: 
e  =  4.80  x  10“ 10  electrostatic  units  (esu)  =  1.60  x  10“  19  coulombs.  Surrounding 
the  nucleus  are  Z„  electrons  of  total  charge  —  eZQ.  A  few  of  these,  Z,  are  the  relatively 
weakly  bound  valence  electrons.  The  remaining  Z„  —  Z  electrons  are  relatively  tightly 
bound  to  the  nucleus,  play  much  less  of  a  role  in  chemical  reactions,  and  are  known 
as  the  core  electrons.  When  these  isolated  atoms  condense  to  form  a  metal,  the  core 
electrons  remain  bound  to  the  nucleus  to  form  the  metallic  ion,  but  the  valence 
electrons  are  allowed  to  wander  far  away  from  their  parent  atoms.  In  the  metallic 
context  they  are  called  conduction  electrons.4 * * 


3  We  shall  always  lake  e  lo  be  a  positive  number. 

4  When,  as  in  the  Drude  model,  the  core  electrons  play  a  passive  role  and  the  ion  acts  as  an  indivisible 

inert  entity,  one  often  refers  to  the  conduction  electrons  simply  as  “the  electrons,*’  saving  the  full  term  for 

limes  when  the  distinction  between  conduction  and  core  electrons  is  lo  be  emphasized. 
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Drude  applied  kinetic  theory  to  this  “gas”  of  conduction  electrons  of  mass  m,  which 
(in  contrast  to  the  molecules  of  an  ordinary  gas)  move  against  a  background  of  heavy 
immobile  ions.  The  density  of  the  electron  gas  can  be  calculated  as  follows: 

A  metallic  element  contains  0.6022  x  1024  atoms  per  mole  (Avogadro’s  number) 
and  pJA  moles  per  cm3,  where  p„  is  the  mass  density  (in  grams  per  cubic  centimeter) 
and  A  is  the  atomic  mass  of  the  element.  Since  each  atom  contributes  Z  electrons, 
the  number  of  electrons  per  cubic  centimeter,  n  =  N/V, ,  is 

n  =  0.6022  x  1024  — .  (1.1) 

A 

Table  1.1  shows  the  conduction  electron  densities  for  some  selected  metals. 
They  are  typically  of  order  1022  conduction  electrons  per  cubic  centimeter,  varying 
from  0.91  x  1022  for  cesium  up  to  24.7  x  1022  for  beryllium.5  Also  listed  in 
Table  1.1  is  a  widely  used  measure  of  the  electronic  density.  rs.  defined  as  the 
radius  of  a  sphere  whose  volume  is  equal  to  the  volume  per  conduction  electron. 
Thus 


Table  1.1  lists  rs  both  in  angstroms  (1(T8  cm)  and  in  units  of  the  Bohr  radius  o0  = 
h2/me 2  =  0.529  x  10~8cm;  the  latter  length,  being  a  measure  of  the  radius  of  a 
hydrogen  atom  in  its  ground  state,  is  often  used  as  a  scale  for  measuring  atomic 
distances.  Note  that  rs/a0  is  between  2  and  3  in  most  cases,  although  it  ranges  between 
3  and  6  in  the  alkali  metals  (and  can  be  as  large  as  10  in  some  metallic  compounds). 

These  densities  are  typically  a  thousand  times  greater  than  those  of  a  classical  gas 
at  normal  temperatures  and  pressures.  In  spite  of  this  and  in  spite  of  the  strong 
electron-electron  and  electron-ion  electromagnetic  interactions,  the  Drude  model 
boldly  treats  the  dense  metallic  electron  gas  by  the  methods  of  the  kinetic  theory  of 
a  neutral  dilute  gas,  with  only  slight  modifications.  The  basic  assumptions  are  these: 

1.  Between  collisions  the  interaction  of  a  given  electron,  both  with  the  others 
and  with  the  ions,  is  neglected.  Thus  in  the  absence  of  externally  applied  electro¬ 
magnetic  fields  each  electron  is  taken  to  move  uniformly  in  a  straight  line.  In  the 
presence  of  externally  applied  fields  each  electron  is  taken  to  move  as  determined 
by  Newton’s  laws  of  motion  in  the  presence  of  those  external  fields,  but  neglecting 
the  additional  complicated  fields  produced  by  the  other  electrons  and  ions.6 *  The 
neglect  of  electron-electron  interactions  between  collisions  is  known  as  the  indepen¬ 
dent  electron  approximation.  The  corresponding  neglect  of  electron-ion  interactions 
is  known  as  the  free  electron  approximation.  We  shall  find  in  subsequent  chapters  that 


5  This  is  the  range  for  metallic  elements  under  normal  conditions.  Higher  densities  can  be  attained 
by  application  of  pressure  (which  tends  to  favor  the  metallic  state).  Lower  densities  are  found  in  com¬ 
pounds. 

6  Strictly  speaking,  the  electron-ion  interaction  is  not  entirely  ignored,  for  the  Drude  model  im¬ 

plicitly  assumes  that  the  electrons  are  confined  to  the  interior  of  the  metal.  Evidently  this  confinement  is 

brought  about  by  their  attraction  to  the  positively  charged  ions.  Gross  effects  of  the  electron-ion  and 
electron -electron  interaction  like  this  are  often  taken  into  account  by  adding  to  the  external  fields  a  suitably 
defined  internal  field  representing  the  average  effect  of  the  electron-electron  and  electron-ion  interactions. 
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Table  l.l 

FREE  ELECTRON  DENSITIES  OF  SELECTED  METALLIC  ELE¬ 
MENTS” 


ELEMENT 

Z 

n  (1022/cm3) 

c(A) 

rja  o 

Li  (78  K) 

1 

4.70 

1.72 

3.25 

Na  (5  K) 

1 

2.65 

2.08 

3.93 

K  (5  K) 

1 

1.40 

2.57 

4.86 

Rb  (5  K) 

1 

1.15 

2.75 

5.20 

Cs  (5  K) 

1 

0.91 

2.98 

5.62 

Cu 

1 

8.47 

1.41 

2.67 

Ag 

1 

5.86 

1.60 

3.02 

Au 

1 

5.90 

1.59 

3.01 

Be 

2 

24.7 

0.99 

1.87 

Mg 

2- 

8.61 

1.41 

2.66 

Ca 

2 

4.61 

1.73 

3.27 

Sr 

2 

3.55 

1.89 

3.57 

Ba 

2 

3.15 

1.96 

3.71 

Nb 

1 

5.56 

1.63 

3.07 

Fe 

2 

17.0 

1.12 

2.12 

Mn  (a) 

2 

16.5 

1.13 

2.14 

Zn 

2 

13.2 

1.22 

2.30 

Cd 

2 

9.27 

1.37 

2.59 

Hg  (78  K) 

2 

8.65 

1.40 

2.65 

Al 

3 

18.1 

1.10 

2.07 

Ga 

3 

15.4 

1.16 

2.19 

In 

3 

11.5 

1.27 

2.41 

Tl 

3 

10.5 

1.31 

2.48 

Sn 

4 

14.8 

1.17 

2.22 

Pb 

4 

13.2 

1.22 

2.30 

Bi 

5 

14.1 

1.19 

2.25 

Sb 

5 

16.5 

1.13 

2.14 

“  At  room  temperature  (about  300  K)  and  atmospheric  pressure,  unless 
otherwise  noted.  The  radius  rs  of  the  free  electron  sphere  is  defined  in  Eq.  (1.2). 
We  have  arbitrarily  selected  one  value  of  Z  for  those  elements  that  display 
more  than  one  chemical  valence.  The  Drude  model  gives  no  theoretical 
basis  for  the  choice.  Values  of  n  are  based  on  data  from  R.  W.  G.  Wyckoff, 
Crystal  Structures ,  2nd  ed..  Interscience,  New  York,  1963. 


although  the  independent  electron  approximation  is  in  many  contexts  surprisingly 
good,  the  free  electron  approximation  must  be  abandoned  if  one  is  to  arrive  at  even 
a  qualitative  understanding  of  much  of  metallic  behavior. 

2.  Collisions  in  the  Drude  model,  as  in  kinetic  theory,  are  instantaneous  events 
that  abruptly  alter  the  velocity  of  an  electron.  Drude  attributed  them  to  the  electrons 
bouncing  off  the  impenetrable  ion  cores  (rather  than  to  electron-electron  collisions, 
the  analogue  of  the  predominant  collision  mechanism  in  an  ordinary  gas).  We  shall 
find  later  that  electron-electron  scattering  is  indeed  one  of  the  least  important  of  the 
several  scattering  mechanisms  in  a  metal,  except  under  unusual  conditions.  However, 
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Figure  1.2 

Trajectory  of  a  conduction  electron  scattering  off  the 
ions,  according  to  the  naive  picture  of  Drude. 


the  simple  mechanical  picture  (Figure  1.2)  of  an  electron  bumping  along  from  ion  to 
ion  is  very  far  off  the  mark.7  Fortunately,  this  does  not  matter  for  many  purposes: 
a  qualitative  (and  often  a  quantitative)  understanding  of  metallic  conduction  can 
be  achieved  by  simply  assuming  that  there  is  some  scattering  mechanism,  without 
inquiring  too  closely  into  just  what  that  mechanism  might  be.  By  appealing,  in  our 
analysis,  to  only  a  few  general  effects  of  the  collision  process,  we  can  avoid  committing 
ourselves  to  any  specific  picture  of  how  electron  scattering  actually  takes  place.  These 
broad  features  are  described  in  the  following  tu'o  assumptions. 

3.  We  shall  assume  that  an  electron  experiences  a  collision  (i.e..  suffers  an  abrupt 
change  in  its  velocity)  with  a  probability  per  unit  time  1/t.  We  mean  by  this  that  the 
probability  of  an  electron  undergoing  a  collision  in  any  infinitesimal  time  interval  of 
length  dt  is  just  dt/r.  The  time  r  is  variously  known  as  the  relaxation  time,  the  collision 
time,  or  the  mean  free  time,  and  it  plays  a  fundamental  role  in  the  theory  of  metallic 
conduction.  It  follows  from  this  assumption  that  an  electron  picked  at  random  at  a 
given  moment  will,  on  the  average,  travel  for  a  time  t  before  its  next  collision,  and 
will,  on  the  average,  have  been  traveling  for  a  time  r  since  its  last  collision.8  In  the 
simplest  applications  of  the  Drude  model  the  collision  time  r  is  taken  to  be  inde¬ 
pendent  of  an  electron’s  position  and  velocity.  We  shall  see  later  that  this  turns  out 
to  be  a  surprisingly  good  assumption  for  many  (but  by  no  means  all)  applications. 

4.  Electrons  are  assumed  to  achieve  thermal  equilibrium  with  their  surroundings 
only  through  collisions.9  These  collisions  are  assumed  to  maintain  local  thermo¬ 
dynamic  equilibrium  in  a  particularly  simple  way:  immediately  after  each  collision 
an  electron  is  taken  to  emerge  with  a  velocity  that  is  not  related  to  its  velocity 
just  before  the  collision,  but  randomly  directed  and  with  a  speed  appropriate  to  the 
temperature  prevailing  at  the  place  where  the  collision  occurred.  Thus  the  hotter  the 
region  in  which  a  collision  occurs,  the  faster  a  typical  electron  will  emerge  from  the 
collision. 

In  the  rest  of  this  chapter  we  shall  illustrate  these  notions  through  their  most 
important  applications,  noting  the  extent  to  which  they  succeed  or  fail  to  describe 
the  observed  phenomena. 

DC  ELECTRICAL  CONDUCTIVITY  OF  A  METAL 

According  to  Ohm's  law,  the  current  /  flowing  in  a  wire  is  proportional  to  the  potential 
drop  F along  the  wire:  V  =  IR,  where  R,  the  resistance  of  the  wire,  depends  on  its 


7  For  some  time  people  were  led  into  difficult  but  irrelevant  problems  connected  with  the  proper 
aiming  of  an  electron  al  an  ion  in  each  collision.  So  literal  an  interpretation  of  Figure  1 .2  is  strenuously 
to  be  avoided. 

8  See  Problem  1 . 

9  Given  the  free  and  independent  electron  approximation,  this  is  the  only  possible  mechanism  left. 
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dimensions,  but  is  independent  of  the  size  of  the  current  or  potential  drop.  The  Drude 
model  accounts  for  this  behavior  and  provides  an  estimate  of  the  size  of  the  resistance. 

One  generally  eliminates  the  dependence  of  R  on  the  shape  of  the  wire  by  intro¬ 
ducing  a  quantity  characteristic  only  of  the  metal  of  which  the  wire  is  composed.  The 
resistivity  p  is  defined  to  be  the  proportionality  constant  between  the  electric  field 
E  at  a  point  in  the  metal  and  the  current  density  j  that  it  induces10: 

E  =  pj.  (1.3) 

The  current  density  j  is  a  vector,  parallel  to  the  flow  of  charge,  whose  magnitude  is 
the  amount  of  charge  per  unit  time  crossing  a  unit  area  perpendicular  to  the  flow. 
Thus  if  a  uniform  current  1  flows  through  a  wire  of  length  Land  cross-sectional  area 
A,  the  current  density  will  be  j  =  I,  A.  Since  the  potential  drop  along  the  wire  will  be 
V  =  EL,  Eq.  (1.3)  gives  V  =  IpL/A ,  and  hence  R  =  pL/A. 

If  n  electrons  per  unit  volume  all  move  with  velocity  v,  then  the  current  density 
they  give  rise  to  will  be  parallel  to  v.  Furthermore,  in  a  time  dt  the  electrons  will 
advance  by  a  distance  vdt  in  the  direction  of  v,  so  that  n(vdt)A  electrons  will  cross 
an  area  A  perpendicular  to  the  direction  of  flow.  Since  each  electron  carries  a  charge 
—  e,  the  charge  crossing  A  in  the  time  dt  will  be  —net A  dt,  and  hence  the  current 
density  is 

j  =  —  ne\.  (1.4) 

At  any  point  in  a  metal,  electrons  are  always  moving  in  a  variety  of  directions 
with  a  variety  of  thermal  energies.  The  net  current  density  is  thus  given  by  (1.4), 
where  v  is  the  average  electronic  velocity.  In  the  absence  of  an  electric  field,  electrons 
are  as  likely  to  be  moving  in  any  one  direction  as  in  any  other,  v  averages  to  zero, 
and,  as  expected,  there  is  no  net  electric  current  density.  In  the  presence  of  a  field  E, 
however,  there  will  be  a  mean  electronic  velocity  directed  opposite  to  the  field  (the 
electronic  charge  being  negative),  which  we  can  compute  as  follows: 

Consider  a  typical  electron  at  time  zero.  Let  t  be  the  time  elapsed  since  its  last 
collision.  Its  velocity  at  time  zero  will  be  its  velocity  v0  immediately  after  that  collision 
plus  the  additional  velocity  —eEt/m  it  has  subsequently  acquired.  Since  we  assume 
that  an  electron  emerges  from  a  collision  in  a  random  direction,  there  will  be  no 
contribution  from  v0  to  the  average  electronic  velocity,  which  must  therefore  be  given 
entirely  by  the  average  of  —  eEt/m.  However,  the  average  of  t  is  the  relaxation  time 
t.  Therefore 


This  result  is  usually  stated  in  terms  of  the  inverse  of  the  resistivity,  the  conductivity 
a  =  1/p: 


(1.6) 


10  fn  general,  E  and  j  need  not  be  parallel  One  then  defines  a  resistivity  tensor.  See  Chapters  1 2  and  13. 
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This  establishes  the  linear  dependence  of  j  on  E  and  gives  an  estimate  of  the 
conductivity  <7  in  terms  of  quantities  that  are  ail  known  except  for  the  relaxation 
time  r.  We  may  therefore  use  (1.6)  and  the  observed  resistivities  to  estimate  the  size 
of  the  relaxation  time: 


Table  1.2  gives  the  resistivities  of  several  representative  metals  at  several  temper¬ 
atures.  Note  the  strong  temperature  dependence.  At  room  temperature  the  resistivity 
is  roughly  linear  in  T,  but  it  falls  away  much  more  steeply  as  low  temperatures  are 

Table  1 .2 

ELECTRICAL  RESISTIVITIES  OF  SELECTED  ELEMENTS” 


ELEMENT 

77  K 

273  K 

373  K 

(p/T) 373  K 
(.P/T) 273  K 

Li 

1.04 

8.55 

12.4 

1.06 

Na 

0.8 

4.2 

Melted 

K 

1.38 

6.1 

Melted 

Rb 

2.2 

11.0 

Melted 

Cs 

4.5 

18.8 

Melted 

Cu 

0.2 

1.56 

2.24 

1.05 

Ag 

0.3 

1.51 

2.13 

1.03 

Au 

0.5 

2.04 

2.84 

1.02 

Be 

2.8 

5.3 

1.39 

Mg 

0.62 

3.9 

5.6 

1.05 

Ca 

3.43 

5.0 

1.07 

Sr 

7 

23 

Ba 

17 

60 

Nb 

3.0 

15.2 

19.2 

0.92 

Fe 

0.66 

8.9 

14.7 

1.21 

Zn 

l.l 

5.5 

7.8 

1.04 

Cd 

1.6 

6.8 

Hg 

5.8 

Melted 

Melted 

Al 

0.3 

2.45 

3.55 

1.06 

Ga 

2.75 

13.6 

Melted 

In 

1.8 

8.0 

12.1 

1.11 

Tl 

3.7 

15 

22.8 

1.11 

Sn 

2.1 

10.6 

15.8 

1.09 

Pb 

4.7 

19.0 

27.0 

1.04 

Bi 

35 

107 

156 

1.07 

Sb 

8 

39 

59 

1.11 

“  Resistivities  in  microhm  centimeters  are  given  at  77  K  (the  boiling  point  of  liquid 
nitrogen  at  atmospheric  pressure),  273  K,  and  373  K.  The  last  column  gives  the 
ratio  of  pjT at  373  K  and  273  K  to  display  the  approximate  linear  temperature 
dependence  of  the  resistivity  near  room  temperature. 

Source:  G.  W.  C.  Kaye  and  T.  H.  Laby,  Table  of  Physical  and  Chemical  Constants , 
Longmans  Green,  London,  1966. 
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reached.  Room  temperature  resistivities  are  typically  of  the  order  of  microhm  centi¬ 
meters  (pohm-cm)  or.  in  atomic  units,  of  order  1CT18  statohm-cm.1 1  If  pu  is  the 
resistivity  in  microhm  centimeters,  then  a  convenient  way  of  ex  pressing  the  relaxation 
time  implied  by  (1.7)  is 

i  = 

Relaxation  times  calculated  from  (1.8)  and  the  resistivities  in  Table  1.2  are  displayed 
in  Table  1.3.  Note  that  at  room  temperatures  x  is  typically  10“ 14  to  10“ 15  sec.  In 
considering  whether  this  is  a  reasonable  number,  it  is  more  instructive  to  contemplate 
the  mean  free  path,  (  =  v0z,  where  v0  is  the  average  electronic  speed.  The  length  l 
measures  the  average  distance  an  electron  travels  between  collisions.  In  Drude’s  time 
it  was  natural  to  estimate  v0  from  classical  equipartition  of  energy:  jmv02  =  jkBT. 
Using  the  known  electronic  mass,  we  find  a  r0  of  order  107  cm/sec  at  room  tem¬ 
perature,  and  hence  a  mean  free  path  of  1  to  10  A.  Since  this  distance  is  comparable 
to  the  interatomic  spacing,  the  result  is  quite  consistent  with  Drude’s  original  view 
that  collisions  are  due  to  the  electron  bumping  into  the  large  heavy  ions. 

However,  we  shall  see  in  Chapter  2  that  this  classical  estimate  of  r0  is  an  order 
of  magnitude  too  small  at  room  temperatures.  Furthermore,  at  the  lowest  tempera¬ 
tures  in  Table  1.3,  x  is  an  order  of  magnitude  larger  than  at  room  temperature,  while 
(as  we  shall  see  in  Chapter  2)  r0  is  actually  temperature-independent.  This  can  raise 
the  low-temperature  mean  free  path  to  iO3  or  more  angstroms,  about  a  thousand 
times  the  spacing  between  ions.  Today,  by  working  at  sufficiently  low  temperatures 
with  carefully  prepared  samples,  mean  free  paths  or  the  order  of  centimeters  (i.e.,  108 
interatomic  spacings)  can  be  achieved.  This  is  strong  evidence  that  the  electrons  do 
not  simply  bump  off  the  ions,  as  Drude  supposed. 

Fortunately,  however,  we  may  continue  to  calculate  with  the  Drude  model  without 
any  precise  understanding  of  the  cause  of  collisions.  In  the  absence  of  a  theory  of  the 
collision  time  it  becomes  important  to  find  predictions  of  the  Drude  model  that  are 
independent  of  the  value  of  the  relaxation  time  r.  As  it  happens,  there  are  several 
such  r-independent  quantities,  w  hich  even  today  remain  of  fundamental  interest,  for 
in  many  respects  the  precise  quantitative  treatment  or  the  relaxation  time  remains 
the  weakest  link  in  modern  treatments  of  metallic  conductivity.  As  a  result,  r-inde¬ 
pendent  quantities  are  highly  valued,  for  they  often  yield  considerably  more  reliable 
information. 

Two  cases  of  particular  interest  are  the  calculation  of  the  electrical  conductivity 
when  a  spatially  uniform  static  magnetic  field  is  present,  and  when  the  electric  field 


1 1  To  convert  resistivities  from  microhm  centimeters  to  statohm  centimeters  note  that  a  resistivity  of 
I  /iohm-cm  yields  an  electric  field  of  10  6  volt  cm  in  the  presence  of  a  current  of  1  amp  cm2.  Since  1  amp 
is  3  x  109  esu  sec,  and  1  volt  is  3^0  statvolt,  a  resistivity  of  1  /iohm-cm  yields  a  field  of  1  stalvoll'cm  when 
the  current  density  is  300  x  106  x  3  x  109  esu-cm-2-sec“  ’.  The  statohm-centimcter  is  the  electrostatic 
unit  of  resistivity,  and  therefore  gives  1  statvolt  cm  with  a  current  density  of  only  1  esu-cm'2-sec_1  Thus 
1  pohm-cm  is  equivalent  to  i  x  IO1’  statohm-cm.  To  avoid  using  the  statohm-centimcter.  one  may 
evaluate  (1.7)  taking  p  in  ohm  meters,  ra  in  kilograms.  11  in  electrons  per  cubic  meter,  and  e  in  coulombs. 
(Note:  The  most  important  formulas,  constants,  and  conversion  factors  from  Chapters  1  and  3  are  sum- 
marized  in  Appendix  A.) 


(“)(;)  ■  -- 


sec. 


(1.8) 
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Table  1.3 

DRLDE  RELAXATION  TIMES  IN  UNITS  OF  10  >4  SECOND" 


ELEMENT 

77  K 

273  K 

373  K 

Li 

7.3 

0.88 

061 

Na 

17 

3.2 

K 

18 

4.1 

Rb 

14 

2.8 

Cs 

8.6 

2.1 

Cu 

21 

2.7 

1.9 

Ag 

20 

4.0 

2.8 

Au 

12 

3.0 

2.1 

Be 

0.51 

0.27 

Mg 

6.7 

1.1 

0.74 

Ca 

2.2 

1.5 

Sr 

1.4 

0.44 

Ba 

0.66 

0.19 

Nb 

2.1 

0.42 

0.33 

Fe 

3.2 

0.24 

0.14 

Zn 

2.4 

0.49 

0.34 

Cd 

2.4 

0.56 

Hg 

0.71 

Al 

6.5 

0.80 

0.55 

Ga 

0.84 

0.17 

In 

1.7 

0.38 

0.25 

T1 

0.91 

0.22 

0.15 

Sn 

1.1 

0.23 

0.15 

Pb 

0.57 

0.14 

0.099 

Bi 

0.072 

0.023 

0.016 

Sb 

0.27 

0.055 

0.036 

°  Relaxation  times  are  calculated  from  the  data  in  Tables  1.1  and  1.2, 
and  Eq.  (1.8).  The  slight  temperature  dependence  of  n  is  ignored. 

is  spatially  uniform  but  time-dependent.  Both  of  these  cases  are  most  simply  dealt 
with  by  the  following  observation: 

At  any  time  t  the  average  electronic  velocity  v  is  just  p(t)/m,  where  p  is  the  total 
momentum  per  electron.  Hence  the  current  density  is 


ocpU) 

m 


(1.9) 


Given  that  the  momentum  per  electron  is  p(f)  at  time  t,  let  us  calculate  the  momentum 
per  electron  p(t  +  dt )  an  infinitesimal  time  dt  later.  An  electron  taken  at  random  at 
time  f  will  have  a  collision  before  time  t  +  dt,  with  probability  dt/x ,  and  will  therefore 
survive  to  time  t  +  dt  without  suffering  a  collision  with  probability  1  -  dt/x.  If  it 
experiences  no  collision,  however,  it  simply  evolves  under  the  influence  of  the  force 
f(t)  (due  to  the  spatially  uniform  electric  pnd/or  magnetic  fields)  and  will  therefore 
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acquire  an  additional  momentum12  f(t)i/r  -  0(dt}2.  The  contribution  of  all  those 
electrons  that  do  not  collide  between  t  and  t  +  dt  to  the  momentum  per  electron  at 
timet  +  dt  is  the  fraction  ( 1  —  dt/z)  they  constitute  of  all  electrons,  times  their  average 
momentum  per  electron,  p (t)  +  f {t)dt  +  O(dt)2. 

Thus  neglecting  for  the  moment  the  contribution  to  p(t  +  dt)  from  those  electrons 
that  do  undergo  a  collision  in  the  time  between  t  and  t  +  dt,  we  have13 

p(t  4-  dt)  =  ^1  - 

=  p(t)  -  ^0pU)  +  f(t)t/t  +  O(dt)1.  (1.10) 

The  correction  to  (1.10)  due  to  those  electrons  that  have  had  a  collision  in  the 
interval  t  to  t  +  dt  is  only  of  the  order  of  (dt)2.  To  see  this,  first  note  that  such  electrons 
constitute  a  fraction  dt/ r  of  the  total  number  of  electrons.  Furthermore,  since  the 
electronic  velocity  (and  momentum)  is  randomly  directed  immediately  after  a  col¬ 
lision,  each  such  electron  will  contribute  to  the  average  momentum  p(f  +  dt)  only 
to  the  extent  that  it  has  acquired  momentum  from  the  force  f  since  its  last  collision. 
Such  momentum  is  acquired  over  a  time  no  longer  than  dt.  and  is  therefore  of  order 
f(t)dt.  Thus  the  correction  to  (1.10)  is  of  order  (dt/z)f(t)dt.  and  does  not  affect  the 
terms  of  linear  order  in  dt.  We  may  therefore  write: 

p(t  +  dr)  -  p(r)  =  -  ^jp(r)  +  f(t)dt  -r  O(dt)2.  (1.11) 

where  the  contribution  of  all  electrons  to  p(f  +  dt)  is  accounted  for.  Dividing  this 
by  dt  and  taking  the  limit  as  dt  -*  0,  we  find 


t) 


pit)  +  f (t)dt  +  Oldt)2 


dp(t) 

dt 


pit) 


+  f(r). 


(1.12) 


This  simply  states  that  the  effect  of  individual  electron  collisions  is  to  introduce  a 
frictional  damping  term  into  the  equation  of  motion  for  the  momentum  per  electron. 
We  now  apply  (1.12)  to  several  cases  of  interest. 


HALL  EFFECT  AND  MAGNETORESISTANCE 

In  1879  E.  H.  Hall  tried  to  determine  whether  the  force  experienced  by  a  current 
carrying  wire  in  a  magnetic  field  was  exerted  on  the  whole  wire  or  only  upon  (what 
we  would  now  call)  the  moving  electrons  in  the  wire.  He  suspected  it  was  the  latter, 
and  his  experiment  was  based  on  the  argument  that  “if  the  current  of  electricity  in  a 
fixed  conductor  is  itself  attracted  by  a  magnet,  the  current  should  be  drawn  to  one 
side  of  the  wire,  and  therefore  the  resistance  experienced  should  be  increased.” 14  His 


12  By  0(dt)2  we  mean  a  term  of  the  order  of  idi)2. 

13  If  the  force  on  the  electrons  is  not  the  same  for  every  electron,  (1.10)  will  remain  valid  provided 
that  we  interpret  f  as  the  average  force  per  electron. 

14  Am.  J.  Math.  2,  287  (1879). 
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efforts  to  detect  this  extra  resistance  were  unsuccessful,15  but  Hall  did  not  regard  this 
as  conclusive:  “The  magnet  may  tend  to  deflect  the  current  without  being  able  to  do 
so.  It  is  evident  that  in  this  case  there  would  exist  a  state  of  stress  in  the  conductor, 
the  electricity  pressing,  as  it  were,  toward  one  side  of  the  wire."  This  state  of  stress 
should  appear  as  a  transverse  voltage  (known  today  as  the  Hall  voltage),  which  Hall 
was  able  to  observe. 

Hall’s  experiment  is  depicted  in  Figure  1.3.  An  electric  field  Ex  is  applied  to  a  wire 
extending  in  the  x-direction  and  a  current  density  jx  flows  in  the  wire.  In  addition,  a 
magnetic  field  H  points  in  the  positive  z-direction.  As  a  result  the  Lorentz  force16 

-%xH  (1.13) 

acts  to  deflect  electrons  in  the  negative  y-direction  (an  electron’s  drift  velocity  is 
opposite  to  the  current  flow).  However  the  electrons  cannot  move  very  far  in  the 
y-direction  before  running  up  against  the  sides  of  the  wire.  As  they  accumulate  there, 
an  electric  field  builds  up  in  the  v-direction  that  opposes  their  motion  and  their 
further  accumulation.  In  equilibrium  this  transverse  field  (or  Hall  field)  Ey  will  balance 
the  Lorentz  force,  and  current  will  flow  only  in  the  x-direction. 


Figure  1.3 

Schematic  view  of  Hall’s  experiment. 


There  are  two  quantities  of  interest.  One  is  the  ratio  of  the  field  along  the  wire 
Ex  to  the  current  density  jx, 

p{H)  =  ~.  (1.14) 

Jx 

This  is  the  magnetoresistance,17  which  Hall  found  to  be  field-independent.  The  other 
is  the  size  of  the  transverse  field  Ey.  Since  it  balances  the  Lorentz  force,  one  might 
expect  it  to  be  proportional  both  to  the  applied  field  H  and  to  the  current  along  the 


15  The  increase  in  resistance  (known  as  the  magnetoresistance)  does  occur,  as  we  shall  see  in  Chapters 
12  and  13.  The  Drude  model,  however,  predicts  Hall’s  null  result. 

16  When  dealing  with  nonmagnetic  (or  weakly  magnetic)  materials,  we  shall  always  call  the  field  H, 
the  difference  between  B  and  H  being  extremely  small. 

More  precisely,  it  is  the  transverse  magnetoresistance.  There  is  also  a  longitudinal  magneto¬ 
resistance.  measured  with  the  magnetic  field  parallel  to  the  current. 
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wire  jx.  One  therefore  defines  a  quantity  known  as  the  Hall  coefficient  by 

Rh  =  0.15) 

Note  that  since  the  Hall  field  is  in  the  negative  y-direction  (Figure  1.3),  RH  should 
be  negative.  If,  on  the  other  hand,  the  charge  carriers  were  positive,  then  the  sign  of 
their  ^-velocity  would  be  reversed,  and  the  Lorentz  force  would  therefore  be  un¬ 
changed.  As  a  consequence  the  Hall  field  would  be  opposite  to  the  direction  it  has  for 
negatively  charged  carriers.  This  is  of  great  importance,  for  it  means  that  a  measure¬ 
ment  of  the  Hall  field  determines  the  sign  of  the  charge  carriers.  Hall’s  original  data 
agreed  with  the  sign  of  the  electronic  charge  later  determined  by  Thomson.  One  of 
the  remarkable  aspects  of  the  Hall  effect,  however,  is  that  in  some  metals  the  Hall 
coefficient  is  positive,  suggesting  that  the  carriers  have  a  charge  opposite  to  that  of 
the  electron.  This  is  another  mystery  whose  solution  had  to  await  the  full  quantum 
theory  of  solids.  In  this  chapter  we  shall  consider  only  the  simple  Drude  model 
analysis,  which  though  incapable  of  accounting  for  positive  Hall  coefficients,  is  often 
in  fairly  good  agreement  with  experiment. 

To  calculate  the  Hall  coefficient  and  magnetoresistance  we  first  find  the  current 
densities  jx  and  jy  in  the  presence  of  an  electric  field  with  arbitrary  components  Ex 
and  Ey,  and  in  the  presence  of  a  magnetic  field  H  along  the  z-axis.  The  (position 
independent)  force  acting  on  each  electron  is  f  =  -e(E  +  v  x  H/c),  and  therefore 
Eq.  (1.12)  for  the  momentum  per  electron  becomes18 


dp 

dt 


\  me 


P 

T 


(1.16) 


In  the  steady  state  the  current  is  independent  of  time,  and  therefore  px  and  py  will 
satisfy 


0  =  —  eEx  —  eocPy 


(1.17) 


where 


0  = 


—eEy  +  wcpx 


coc 


eH 

me 


(1.18) 


We  multiply  these  equations  by  —nex/m  and  introduce  the  current  density  com¬ 
ponents  through  (1.4)  to  find 

caEx  =  WyXjy  +  jx, 

o0Ey  =  -coccjx  +  jy,  (1.19) 

where  <r0  is  just  the  Drude  model  DC  conductivity  in  the  absence  of  a  magnetic 
field,  given  by  (1.6). 


18  Note  that  the  Lorentz  force  is  not  the  same  for  each  electron  since  it  depends  on  the  electronic 
velocity  v.  Therefore  the  force  f  in  (1.12)  is  to  be  taken  as  the  average  force  per  electron  (see  Footnote  13). 
Because,  however,  the  force  depends  on  the  electron  on  which  it  acts  only  through  a  term  linear  in  the 
electron’s  velocity,  the  average  force  is  obtained  simply  by  replacing  that  velocity  by  the  average  velocity. 
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The  Hall  field  Ey  is  determined  by  the  requirement  that  there  be  no  transverse 
current  jy.  Setting  jy  to  zero  in  the  second  equation  of  (1 .19)  we  find  that 


Therefore  the  Hall  coefficient  (1.15)  is 

R„  =  -  — -  (1.21) 

nec 

This  is  a  very  striking  result,  for  it  asserts  that  the  Hall  coefficient  depends  on  no 
parameters  of  the  metal  except  the  density  of  carriers.  Since  we  have  already  calcu¬ 
lated  n  assuming  that  the  atomic  valence  electrons  become  the  metallic  conduction 
electrons,  a  measurement  of  the  Hall  constant  provides  a  direct  test  of  the  validity 
of  this  assumption. 

In  try  ing  to  extract  the  electron  density  n  from  measured  Hall  coefficients  one  is 
faced  with  the  problem  that,  contrary  to  the  prediction  of  (1.21),  they  generally  do 
depend  on  magnetic  field.  Furthermore,  they  depend  on  temperature  and  on  the  care 
with  which  the  sample  has  been  prepared.  This  result  is  somewhat  unexpected,  since 
the  relaxation  time  r.  which  can  depend  strongly  on  temperature  and  the  condition 
of  the  sample,  does  not  appear  in  (1.21).  However,  at  very  low  temperatures  in  very 
pure,  carefully  prepared  samples  at  very  high  fields,  the  measured  Hall  constants  do 
appear  to  approach  a  limiting  value.  The  more  elaborate  theory  of  Chapters  1 2  and  1 3 
predicts  that  for  many  (but  not  all)  metals  this  limiting  value  is  precisely  the  simple 
Drude  result  (1.21). 

Some  Hall  coefficients  at  high  and  moderate  fields  are  listed  in  Table  1.4'.  Note 
the  occurrence  of  cases  in  which  R„  is  actually  positive,  apparently  corresponding 
to  carriers  with  a  positive  charge.  A  striking  example  of  observed  field  dependence 
totally  unexplained  by  Drude  theory  is  shown  in  Figure  1.4. 

The  Drude  result  confirms  Hall’s  observation  that  the  resistance  does  not  depend 
on  field,  for  when  jr  =  0  (as  is  the  case  in  the  steady  state  when  the  Hall  field  has  been 
established),  the  first  equation  of  (1.19)  reduces  to  jx  =  a0£„  the  expected  result  for 
the  conductivity  in  zero  magnetic  field.  However,  more  careful  experiments  on  a 
variety  of  metals  have  revealed  that  there  is  a  magnetic  field  dependence  to  the  re¬ 
sistance,  which  can  be  quite  dramatic  in  some  cases.  Here  again  the  quantum  theory 
of  solids  is  needed  to  explain  why  the  Drude  result  applies  in  some  metals  and  to 
account  for  some  truly  extraordinary  deviations  from  it  in  others. 

Before  leaving  the  subject  of  DC  phenomena  in  a  uniform  magnetic  field,  we  note 
for  future  applications  that  the  quantity  <ac t  is  an  important,  dimensionless  measure 
of  the  strength  of  a  magnetic  field.  When  wct  is  small,  Eq.  (1.19)  gives  j  very  nearly 
parallel  to  E,  as  in  the  absence  of  a  magnetic  field.  In  general,  however,  j  is  at  an 
angle  <fr  (known  as  the  Hall  angle)  to  E,  where  (1.19)  gives  tan  <fr  =  cocr.  The  quantity 
oc.  known  as  the  cyclotron  frequency,  is  simply  the  angular  frequency  of  revolution19 


10  In  a  uniform  magnetic  field  the* orbit  of  an  electron  is  a  spiral  along  the  field  whose  projection  in  a 
plane  perpendicular  to  the  field  is  a  circle.  The  angular  frequency  aje  is  determined  by  the  condition  that  the 
centripetal  acceleration  ioc2r  be  pro\ided  by  the  Lorentz  force,  (e,  i  )(tocr)H. 
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Table  1.4 


HALL  COEFFICIENTS  OF  SELECTED  ELEMENTS 

IN  MODERATE  TO  HIGH  FIELDS” 

METAL 

VALENCE 

—  1  /R„nec 

Li 

1 

0.8 

Na 

1 

1.2 

K 

1 

1.1 

Rb 

1 

1.0 

Cs 

1 

0.9 

Cu 

1 

1.5 

Ag 

1 

1.3 

Au 

1 

1.5 

Be 

2 

-0.2 

Mg 

2 

-0.4 

In 

3 

-0.3 

A1 

3 

-0.3 

‘  These  are  roughly  the  limiting  values  assumed  by  R„  as  the 
field  becomes  very  large  (of  order  10”  G),  and  the  temperature 
very  low,  in  carefully  prepared  specimens.  The  data  are  quoted 
in  the  form  njn,  where  n0  is  the  density  for  which  the  Drude 
form  (1.21)  agrees  with  the  measured  RH:n0  =  -l/R„ec. 
Evidently  the  alkali  metals  obey  the  Drude  result  reasonably 
well,  the  noble  metals  (Cu,  Ag,  Au)  less  well,  and  the  remaining 
entries,  not  at  all. 


of  a  free  electron  in  the  magnetic  field  H.  Thus  cocr  will  be  small  if  electrons  can 
complete  only  a  small  part  of  a  revolution  between  collisions,  and  large  if  they  can 
complete  many  revolutions.  Alternatively,  when  coc t  is  small  the  magnetic  field  de¬ 
forms  the  electronic  orbits  only  slightly,  but  when  coct  is  comparable  to  unity  or 
larger,  the  effect  of  the  magnetic  field  on  the  electronic  orbits  is  quite  drastic.  A  useful 
numerical  evaluation  of  the  cyclotron  frequency  is 

vc  (109  hertz)  =  2.80  x  H  (kilogauss),  coc  =  2rcvc.  (1.22) 


Figure  1.4 

The  quantity  n0/n  = 
—  1  /RHnec,  for  aluminum,  as 
a  function  of  wcz.  The  free 
electron  density  n  is  based  on 
a  nominal  chemical  valence 
of  3.  The  high  field  value 
suggests  only  one  carrier  per 
primitive  cell,  with  a  positive 
charge.  (From  R.  Liick,  Phys. 
Stat.  Sol.  18,  49  (1966).) 
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AC  ELECTRICAL  CONDUCTIVITY  OF  A  METAL 


To  calculate  the  current  induced  in  a  metal  by  a  time-dependent  electric  field,  we 
write  the  field  in  the  form 


E(f)  =  Re  (E(ro)e  ““). 


The  equation  of  motion  (1.12)  for  the  momentum  per  electron  becomes 

dp 


“P  v  r 

—  = - £>E. 

at  t 


(1.23) 


(1.24) 


We  seek  a  steady-state  solution  of  the  form 

p(t)  =  Re  (pMe-^).  (1.25) 

Substituting  the  complex  p  and  E  into  (1.24),  which  must  be  satisfied  by  both  the 
real  and  imaginary  parts  of  any  complex  solution,  we  find  that  pRo)  must  satisfy 

—  top(oj)  =  —  ^ — -  —  eE(o>).  (1.26) 

T 

Since  j  =  —  nep/m,  the  current  density  is  just 

j(f )  =  Re  G(to)e’i“'), 

nep{w)  (ne2/w)E<w) 

*a)  =  ~  = 

One  customarily  writes  this  result  as 

j(co)  =  <7(<o)E(u>), 

where  c(o>).  known  as  the  frequency-dependent  (or  AC)  conductivity,  is  given  by 

c(w)  =  - cr0  =  — .  (1.29) 

1  —  ico  r  m 

Note  that  this  correctly  reduces  to  the  DC  Drude  result  (1.6)  at  zero  frequency. 

The  most  important  application  of  this  result  is  to  the  propagation  of  electro¬ 
magnetic  radiation  in  a  metal.  It  might  appear  that  the  assumptions  we  made  to 
derive  ( 1 .29)  would  render  it  inapplicable  to  this  case,  since  (a)  the  E  field  in  an  electro¬ 
magnetic  wave  is  accompanied  by  a  perpendicular  magnetic  field  H  of  the  same 
magnitude,20  which  we  have  not  included  in  ( 1 .24),  and  (b)  the  fields  in  an  electro¬ 
magnetic  wave  vary  in  space  as  well  as  time,  whereas  Eq.  (1.12)  was  derived  by 
assuming  a  spatially  uniform  force. 

The  first  complication  can  always  be  ignored.  It  leads  to  an  additional  term 
—  ep jmc  x  H  in  (1.24),  which  is  smaller  than  the  term  in  E  by  a  factor  r/c,  where 
v  is  the  magnitude  of  the  mean  electronic  velocity.  But  even  in  a  current  as  large  as 
1  amp/mm2,  v  =  j/ne  is  only  of  order  0.1  cm/sec.  Hence  the  term  in  the  magnetic 
field  is  typically  10“ 10  of  the  term  in  the  electric  field  and  can  quite  correctly  be 
ignored. 


(1.27) 

(1.28) 


20 


One  of  ihe  more  appealing  features  of  CGS  units. 
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The  second  point  raises  more  serious  questions.  Equation  (1.12)  was  derived  by 
assuming  that  at  any  time  the  same  force  acts  on  each  electron,  which  is  not  the  case 
if  the  electric  field  varies  in  space.  Note,  however,  that  the  current  density  at  point 
r  is  entirely  determined  by  what  the  electric  field  has  done  to  each  electron  at  r  since 
its  last  collision.  This  last  collision,  in  the  overwhelming  majority  of  cases,  takes  place 
no  more  than  a  few  mean  free  paths  away  from  r.  Therefore  if  the  field  does  not  vary 
appreciably  over  distances  comparable  to  the  electronic  mean  free  path,  we  may 
correctly  calculate  j(r,  t).  the  current  density  at  point  r,  by  taking  the  field  everywhere 
in  space  to  be  given  by  its  value  E(r,  f)  at  the  point  r.  The  result, 

j(r,  to)  =  <7(oj)F(r,  (,■)).  (1.30) 

is  therefore  valid  whenever  the  wavelength  ).  of  the  field  is  large  compared  to  the 
electronic  mean  free  path  l.  This  is  ordinarily  satisfied  in  a  metal  by  visible  light  ( whose 
wavelength  is  of  the  order  of  10'  to  104  A).  When  it  is  not  satisfied,  one  must  resort 
to  so-called  nonlocal  theories,  of  greater  complexity. 

Assuming,  then,  that  the  wavelength  is  large  compared  to  the  mean  free  path,  we 
may  proceed  as  follow  s:  in  the  presence  of  a  specified  current  density  j  we  may  w  rite 
Maxwell’s  equations  as21 


V  -  E  =  0:  V  ■  H  =  0:  V  x  E  =  - 


4tt  .  1  rE 

V  x  H  =  — j  H - — . 

c  c  ct 


i  m 

c  ct 


(1.31) 


We  look  for  a  solution  with  time  dependence  e  noting  that  in  a  metal  we  can 
write  j  in  terms  of  E  via  ( 1 .28).  \\  e  then  find 


or 


...  _  iio  _  T ,  ico  /4ra7  ioj 

V  x  (V  x  E|  =  -V-E  =  —  V  x  H  =  —  —  E - E 

<•  c  \  c  c 

W~  (  47l/oA  „ 


This  has  the  form  of  the  usual  w  ave  equation. 


-V2E  =  —  efivlE, 
c~ 


with  a  complex  dielectric  constant  given  by 


4717(7 


€(«)  =  1  + 

10 

If  we  are  at  frequencies  high  enough  to  satisfy 

on  »  1, 


(1.32) 

(1.33) 

(1-34) 

(1-35) 

(1.36) 


:  1  We  are  considering  here  an  electromagnetic  wave,  in  which  the  induced  charge  density  p  vanishes. 
Below  we  examine  the  possibility  of  oscillations  in  the  charge  density. 
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then,  to  a  first  approximation,  Eqs.  (1.35)  and  (1.29)  give 


e(co)  =  1 


(1.37) 


where  top,  known  as  the  plasma  frequency,  is  given  by 

,  -ten'2 


(1.38) 


When  e  is  real  and  negative  (to  <  cop)  the  solutions  to  (1.34)  decay  exponentially  in 
space;  i.e.,  no  radiation  can  propagate.  However,  when  €  is  positive  (to  >  cup)  the 
solutions  to  (1.34)  become  oscillatory,  radiation  can  propagate,  and  the  metal  should 
become  transparent.  This  conclusion  is  only  valid,  of  course,  if  our  high-frequency 
assumption  (1.36)  is  satisfied  in  the  neighborhood  of  a  =  top.  If  we  express  r  in  terms 
of  the  resistivity  through  Eq.  (1.8).  then  w  e  can  use  the  definition  (1.38)  of  the  plasma 
frequency  to  compute  that 

Since  the  resistivity  in  microhm  centimeters,  pp.  is  of  the  order  of  unity  or  less,  and 
since  rja0  is  in  the  range  from  2  to  6.  the  high  frequency  condition  ( 1 .36)  will  be  well 
satisfied  at  the  plasma  frequency. 

The  alkali  metals  have,  in  fact,  been  observed  to  become  transparent  in  the  ultra¬ 
violet.  A  numerical  evaluation  of  (1.38)  gives  the  frequency  at  which  transparency 
should  set  in  as 

v.  =  —  =  11.4  x  f— )  x  1015  Hz  (1.40) 

2  n  \aoJ 

or 

/.„  =  -=  0.26  ( -V'2  x  103  A.  (1.41) 

Vp  W 

In  Table  1.5  we  list  the  threshold  wavelengths  calculated  from  (1.41),  along  with  the 


Table  1.5 

OBSERVED  AND  THEORETICAL  WAVELENGTHS  BELOW 
WHICH  THE  ALKALI  METALS  BECOME  TRANSPARENT 


ELEMENT 

theoretical"  /. 

(103  A) 

OBSERVED  ). 

(103  A) 

Li 

1.5 

2.0 

Na 

2.0 

2.1 

K 

2.8 

3.1 

Rb 

3.1 

3.6 

Cs 

3.5 

4.4 

“  From  Eq.  (1.41). 

Source:  M.  Born  and  E.  Wolf.  Principles  of  Opt, 
1964. 

ics,  Pergaraon,  New  York, 
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observed  thresholds.  The  agreement  between  theory  and  experiment  is  rather  good. 
As  we  shall  see,  the  actual  dielectric  constant  of  a  metal  is  far  more  complicated  than 
( 1 .37)  and  it  is  to  some  extent  a  piece  of  good  fortune  that  the  alkali  metals  so  strikingly 
display  this  Drude  behavior.  In  other  metals  different  contributions  to  the  dielectric 
constant  compete  quite  substantially  with  the  "Drude  term  '  ( 1 .37). 

A  second  important  consequence  of  (1.37)  is  that  the  electron  gas  can  sustain 
charge  density  oscillations.  By  this  we  mean  a  disturbance  in  which  the  electric  charge 
density22  has  an  oscillatory  time  dependence  e-1"'.  From  the  equation  of  continuity, 

V  •  i  =  -  — ,  V  •  jM  =  iojp(oj).  (1-42) 

ct 

and  Gauss's  law, 

V  •  E(m)  =  4np(to),  (1-43) 

we  find,  in  view  of  Eq.  (1.30).  that 

iwp(  to)  =  4ncUu)pUr>).  (1-44) 

This  has  a  solution  provided  that 

,  +  4ni^l  =  0,  (1.45) 

CO 

which  is  precisely  the  condition  we  found  above  for  the  onset  of  propagation  of 
radiation.  In  the  present  context  it  emerges  as  the  condition  the  frequency  must  meet 
if  a  charge  density  wave  is  to  propagate. 

The  nature  of  this  charge  density  wave,  known  as  a  plasma  oscillation  or  plasmon, 
can  be  understood  in  terms  of  a  very  simple  model.23  Imagine  displacing  the  entire 
electron  gas,  as  a  whole,  through  a  distance  d  with  respect  to  the  fixed  positive  back¬ 
ground  of  the  ions  (Figure  1.5).24  The  resulting  surface  charge  gives  rise  to  an  electric 
field  of  magnitude  4 no,  where  o  is  the  charge  per  unit  area25  at  either  end  of  the  slab. 


Figure  1.5 

Simple  model  of  a  plasma  oscillation. 


N  electrons 


+ 

+ 

+ 

+ 

+ 

E  =  2  wo  +  2  770  =  4t7  nde 

- 

l  - - 

N  Z  ions 


22  The  charge  density  p  should  not  be  confused  with  the  resistivity,  also  generally  denoted  by  p. 
The  context  will  always  make  it  clear  which  is  being  referred  to. 

33  Since  the  field  of  a  uniform  plane  of  charge  is  independent  of  the  distance  from  the  plane,  this  crude 
argument,  which  places  all  of  the  charge  density  on  two  opposite  surfaces,  is  not  as  crude  as  it  appears  at 
first  glance. 

24  We  observed  earlier  that  the  Drude  model  does  take  the  electron-ion  interaction  into  account  by 
acknowledging  that  the  attraction  to  the  positively  charged  ions  confines  the  electrons  to  the  interior  of  the 
metal.  In  this  simple  model  of  a  plasma  oscillation  it  is  precisely  this  attraction  that  provides  the  restoring 

25  The  surface  charge  density  a  should  not  be  confused  with  the  conductivity,  also  generally  denoted 
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Consequently  the  electron  gas  as  a  whole  will  obey  the  equation  of  motion: 

Nmd  =  —Ne\4no\  =  —  Ne  (4nnde)  =  —  4nne1Nd,  (1.46) 

which  leads  to  oscillation  at  the  plasma  frequency. 

Few  direct  observations  have  been  made  of  plasmons.  Perhaps  the  most  notable 
is  the  observation  of  energy  losses  in  multiples  of  fimpwhen  electrons  are  fired  through 
thin,  metallic  films.26  Nevertheless,  the  possibility  of  their  excitation  in  the  course 
of  other  electronic  processes  must  always  be  borne  in  mind. 

THERMAL  CONDUCTIVITY  OF  A  METAL 

The  most  impressive  success  of  the  Drude  model  at  the  time  it  was  proposed  was  its 
explanation  of  the  empirical  law  of  Wiedemann  and  Franz  (1853).  The  Wiedemann- 
Franz  law  states  that  the  ratio,  k/c,  of  the  thermal  to  the  electrical  conductivity  of 
a  great  number  of  metals  is  directly  proportional  to  the  temperature,  w'ith  a  propor¬ 
tionality  constant  which  is  to  a  fair  accuracy  the  same  for  all  metals.  This  remarkable 
regularity  can  be  seen  in  Table  1.6,  where  measured  thermal  conductivities  are  given 
for  several  metals  at  273  K  and  373  K.  along  with  the  ratios  k/cT  (known  as  the 
Lorenz  number)  at  the  two  temperatures. 

In  accounting  for  this  the  Drude  model  assumes  that  the  bulk  of  the  thermal  current 
in  a  metal  is  carried  by  the  conduction  electrons.  This  assumption  is  based  on  the 
empirical  observation  that  metals  conduct  heat  much  better  than  insulators  do.  Thus 
thermal  conduction  by  the  ions27  (present  in  both  metals  and  insulators)  is  much 
less  important  than  thermal  conduction  by  the  conduction  electrons  (present  only 
in  metals). 

To  define  and  estimate  the  thermal  conductivity,  consider  a  metal  bar  along  which 
the  temperature  varies  slowly.  If  there  were  no  sources  and  sinks  of  heat  at  the  ends 
of  the  bar  to  maintain  the  temperature  gradient,  then  the  hot  end  would  cool  and  the 
cool  end  would  warm,  i.e..  thermal  energy  would  flow  in  a  sense  opposite  to  the 
temperature  gradient.  By  supplying  heat  to  the  hot  end  as  fast  as  it  flows  away,  one 
can  produce  a  steady  state  in  which  both  a  temperature  gradient  and  a  uniform  flow 
of  thermal  energy  are  present.  We  define  the  thermal  current  density  j1'  to  be  a  vector 
parallel  to  the  direction  of  heat  flow,  whose  magnitude  gives  the  thermal  energy  per 
unit  time  crossing  a  unit  area  perpendicular  to  the  flow.28  For  small  temperature 
gradients  the  thermal  current  is  observed  to  be  proportional  to  VT  (Fourier's  law): 

'f  =  —k  VT.  (1.47) 

The  proportionality  constant  k  is  known  as  the  thermal  conductivity,  and  is  positive, 
since  the  thermal  current  flows  opposite  to  the  direction  of  the  temperature  gradient. 


-«  C.  J.  Powell  and  J.  B.  Swan.  Pins.  Re r.  1 15.  869  ( 1959). 

2“  Although  the  metallic  ions  cannot  wander  through  the  metal,  there  is  a  wav  in  which  they  can 
transport  thermal  energy  (though  not  electric  charge) :  the  ions  can  vibrate  a  little  about  their  mean  posi¬ 
tions.  leading  to  the  transmission  of  thermal  energy  in  the  form  of  elastic  waves  propagating  through  the 
network  of  ions.  See  Chapter  25. 

28  Note  the  analogy  to  the  definition  of  the  electrical  current  density  j.  as  well  as  the  analogy  between 
the  laws  of  Ohm  and  Fourier. 
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Table  1.6 

EXPERIMENTAL  THERMAL  CONDUCTIVITIES  AND  LORENZ  NUMBERS 
OF  SELECTED  METALS 


ELEMENT 

273  K 

373  K 

K 

(watt/cm- K) 

k/<jT 

(watt-ohm/K2) 

K 

( watt/cm- K) 

k/gT 

(watt-ohm/K2) 

Li 

0.71 

2.22  x  10“ 8 

0.73 

2.43  x  10“8 

Na 

1.38 

2.12 

K 

1.0 

2.23 

Rb 

0.6 

2.42 

Cu 

3.85 

2.20 

3.82 

2.29 

Ag 

4.18 

2.31 

4.17 

2.38 

Au 

3.1 

2.32 

3.1 

2.36 

Be 

2.3 

2.36 

1.7 

2.42 

Mg 

1.5 

2.14 

1.5 

2.25 

Nb 

0.52 

2.90 

0.54 

2.78 

Fe 

0.80 

2.61 

0.73 

2.88 

Zn 

1.13 

2.28 

1.1 

2.30 

Cd 

1.0 

2.49 

1.0 

A1 

2.38 

2.14 

2.30 

2.19 

'  In 

0.88 

2.58 

0.80 

2.60 

T1 

0.5 

2.75 

0.45 

2.75 

Sn 

0.64 

2.48 

0.60 

2.54 

Pb 

0.38 

2.64 

0.35 

2.53 

Bi 

0.09 

3.53 

0.08 

3.35 

Sb 

0.18 

2.57 

0.17 

2.69 

Source :  G.  W.  C.  Kaye  and  T.  H.  Laby,  Table  of  Physical  and  Chemical  Constants,  Longmans 
Green,  London,  1966 

As  a  concrete  example  let  us  examine  a  case  where  the  temperature  drop  is  uniform 
in  the  positive  x-direction.  In  the  steady  state  the  thermal  current  will  also  flow  in 
the  x-direction  and  have  a  magnitude  jq  =  —KdT/dx.  To  calculate  the  thermal 
current  we  note  (assumption  4,  page  6)  that  after  each  collision  an  electron  emerges 
with  a  speed  appropriate  to  the  local  temperature;  the  hotter  the  place  of  the  collision, 
the  more  energetic  the  emerging  electron.  Consequently,  even  though  the  mean 
electronic  velocity  at  a  point  may  vanish  (in  contrast  to  the  case  when  an  electric 
current  flows)  electrons  arriving  at  the  point  from  the  high-temperature  side  will  have 
higher  energies  than  those  arriving  from  the  low-temperature  side  leading  to  a  net 
flow  of  thermal  energy  toward  the  low-temperature  side  (Figure  1.6). 

To  extract  a  quantitative  estimate  of  the  thermal  conductivity  from  this  picture, 
consider  first  an  oversimplified  “one-dimensional”  model,  in  which  the  electrons  can 
only  move  along  the  x-axis,  so  that  at  a  point  x  half  the  electrons  come  from  the  high- 
temperature  side  of  x,  and  half  from  the  low.  If  £(T)  is  the  thermal  energy  per  electron 
in  a  metal  in  equilibrium  at  temperature  T,  then  an  electron  whose  last  collision  was 
at  x’  will,  on  the  average,  have  a  thermal  energy  £(T[x']).  The  electrons  arriving  at 
x  from  the  high-temperature  side  will,  on  the  average,  have  had  their  last  collision  at 
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High  T 


Low  T 


Figure  1.6 

Schematic  view  of  the  relation  between  temperature 
gradient  and  thermal  current.  Electrons  arriving  at  the 
center  of  the  bar  from  the  left  had  their  last  collision 
in  the  high-temperature  region.  Those  arriving  at  Ihe 
center  from  the  right  had  their  last  collision  in  the 
low-temperature  region.  Hence  electrons  moving  to 
the  right  at  the  center  of  the  bar  tend  to  be  more 
energetic  than  those  moving  to  the  left,  yielding  a  net 
thermal  current  to  the  right. 


x  ~  L'T>  and  will  therefore  carry  a  thermal  energy  per  electron  of  size  £(T[x  -  nr]). 
Their  contribution  to  the  thermal  current  density  at  x  will  therefore  be  the  number  of 
such  electrons  per  unit  volume,  it/2,  times  their  velocity,  v,  times  this  energy,  or 
(n/2)v£(T[x  -  nr]).  The  electrons  arriving  at  x  from  the  low-temperature  side,  on 
the  other  hand,  will  contribute  (n/2)(  —  u)[£(T[x  +  ft] )],  since  they  have  come  from 
the  positive  x-direction  and  are  moving  toward  negative  x.  Adding  these  together  gives 

f  -  V[e<r[x  -  t>t])  -  6(T[x  +  nr])].  (1.48) 

Provided  that  the  variation  in  temperature  over  a  mean  free  path  (t  -  vr)  is  very 
small,29  we  may  expand  this  about  the  point  x  to  find: 


To  go  from  this  to  the  three-dimensional  case  we  need  only  replace  v  by  the  x- 
component  vx  of  the  electronic  velocity  v,  and  average  over  all  directions.  Since30 
<vx2>  =  <vy2}  =  <vz2)  =  |uz,  and  since  n  dZ/dT  =  (N/V)d£/dT  =  ( dE/dT)/V  = 
c„,  the  electronic  specific  heat,  we  have: 


f  =  Wr  Cj-VT) 

(1.50) 

K  =  yt;2TCt,  =  itvcr. 

(1.51) 

where  v2  is  the  mean  square  electronic  speed. 

We  emphasize  the  roughness  of  this  argument.  We  have  spoken  rather  glibly  about 
the  thermal  energy  per  electron  carried  by  a  particular  group  of  electrons,  a  quantity 
one  might  be  hard  pressed  to  define  with  precision.  We  have  also  been  quite  careless 
about  replacing  quantities,  at  various  stages  of  the  calculation,  by  their  thermal 
averages.  One  might  object,  for  example,  that  if  the  thermal  energy  per  electron 
depends  on  the  direction  the  electrons  come  from,  so  will  their  average  speed,  since 
this  too  depends  on  the  temperature  at  the  place  of  their  last  collision.  We  shall  note 
below  that  this  last  oversight  is  canceled  by  yet  another  oversight,  and  in  Chapter  13 


29  Its  change  in  ( is  (t/L )  times  its  change  in  the  sample  length  L. 

3°  In  equilibrium  the  velocity  distribution  is  isotropic.  Corrections  to  this  due  to  the  temperature 
gradient  are  exceedingly  small. 


I 
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we  shall  find  by  a  more  rigorous  argument  that  the  result  (1.51)  is  quite  close  to  (and, 
in  special  circumstances,  precisely)  the  correct  one. 

Given  the  estimate  (1.51),  we  can  derive  another  result  independent  of  the  mysteries 
buried  in  the  relaxation  time  t,  by  dividing  the  thermal  conductivity  by  the  electrical 


conductivity  (1.6): 


k  %ccmv2 
a  ne2 


(1.52) 


It  was  natural  for  Drude  to  apply  the  classical  ideal  gas  laws  in  evaluating  the 
electronic  specific  heat  and  mean  square  velocity.  He  therefore  in  effect  took  cv  to 
be  | nkB  and  \mi2  to  be  f kBT,  where  kB  is  Boltzmann’s  constant,  1.38  x  10“ 16  erg/K. 
This  leads  to  the  result 

(,-53) 

The  right  side  of  (1.53)  is  proportional  to  T  and  depends  only  on  the  universal  con¬ 
stants  k8  and  e,  in  complete  agreement  with  the  law  of  Wiedemann  and  Franz. 
Equation  (1.53)  gives  a  Lorenz  number31 

—  =  -f— ^  =  1.24  x  10~13  (erg/esu-K)2 
cT  2\e  J 

—  1.11  x  1 0“ 8  watt-ohm 'K2,  (1-54) 


which  is  about  half  the  typical  value  given  in  Table  1.6.  In  his  original  calculation 
of  the  electrical  conductivity,  Drude  erroneously  found  half  the  correct  result  (1.6), 
as  a  result  of  which  he  found  a  value  k/cT  =  2.22  x  If)  8  watt-ohm/K2.  in  extraor¬ 
dinary  agreement  with  experiment. 

This  success,  though  wholly  fortuitous,  was  so  impressive  as  to  spur  further  in¬ 
vestigations  with  the  model.  It  was,  however,  quite  puzzling,  since  no  electronic 
contribution  to  the  specific  heat  remotely  comparable  to  inkB  was  ever  observed. 
Indeed,  at  room  temperature  there  appeared  to  be  no  electronic  contribution  to  the 
specific  heat  measured  at  all.  In  Chapter  2  we  shall  find  that  the  classical  ideal  gas 
laws  cannot  be  applied  to  the  electron  gas  in  a  metal.  Drude’s  impressive  success, 
aside  from  his  factor-of-two  mistake,  is  a  consequence  of  two  errors  of  about  100  that 
cancel:  at  room  temperature  the  actual  electronic  contribution  to  the  specific  heat 
is  about  100  times  smaller  than  the  classical  prediction,  but  the  mean  square  electronic 
speed  is  about  100  times  larger. 

We  shall  examine  the  correct  theory  of  the  equilibrium  thermal  properties  of  the 
free  electron  gas  in  Chapter  2,  and  shall  return  to  a  more  correct  analysis  of  the  thermal 
conductivity  of  a  metal  in  Chapter  13.  Before  leaving  the  subject  of  thermal  transport, 
however,  we  should  correct  one  oversimplification  in  our  analysis  that  obscures  an 
important  physical  phenomenon: 

We  calculated  the  thermal  conductivity  by  ignoring  all  manifestations  of  the 
temperature  gradient  except  for  the  fact  that  the  thermal  energy  carried  by  a  group 


31  Since  (joule/coulomb)2  =  (watt/amp)2  =  watt-ohm,  the  practical  units  in  which  Lorenz  numbers 
are  quoted  are  often  called  watt-ohm/K2,  instead  of  (joule/coulomb- K)2. 
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of  electrons  depends  on  the  temperature  at  the  place  of  their  last  collision.  But  if 
electrons  emerge  from  a  collision  with  higher  energies  when  the  temperature  is  higher, 
they  will  also  have  higher  speeds.  It  would  therefore  appear  that  we  should  let  the 
electron’s  speed  v  as  well  as  its  contribution  to  the  thermal  energy  depend  on  the  place 
of  the  last  collision.  As  it  turns  out  such  an  additional  term  only  alters  the  result 
by  a  factor  of  the  order  of  unity,  but  we  were  in  fact  quite  right  to  ignore  such  a  correc¬ 
tion.  It  is  true  that  immediately  after  the  temperature  gradient  is  applied  there  will  be 
a  nonvanishing  mean  electronic  velocity  directed  toward  the  low-temperature  region. 
Since  the  electrons  are  charged,  however,  this  velocity  will  result  in  an  electric  current. 
But  thermal  conductivity  measurements  are  performed  under  open-circuit  conditions, 
in  which  no  electric  current  can  flow.  Therefore  the  electric  current  can  continue  only 
until  enough  charge  has  accumulated  at  the  surface  of  the  sample  to  build  up  a  re¬ 
tarding  electric  field  that  opposes  the  further  accumulation  of  charge,  and  hence 
precisely  cancels  the  effect  of  the  temperature  gradient  on  the  electronic  mean  ve¬ 
locity.32  When  the  steady  state  is  reached  there  will  be  no  electric  current  flow,  and 
we  were  therefore  correct  in  assuming  that  the  mean  electronic  velocity  at  a  point 
vanished. 

In  this  way  we  are  led  to  consider  another  physical  effect:  A  temperature  gradient 
in  a  long,  thin  bar  should  be  accompanied  by  an  electric  field  directed  opposite  to 
the  temperature  gradient.  The  existence  of  such  a  field,  known  as  the  thermoelectric 
field,  has  been  known  for  some  time  (the  Seebeck  effect).  The  field  is  conventionally 
written  as 


E  =  QST,  (1.55) 

and  the  proportionality  constant  Q  is  known  as  the  thermopower.  To  estimate  the 
thermopower  note  that  in  our  "one-dimensional”  model  the  mean  electronic  velocity 
at  a  point  x  due  to  the  temperature  gradient  is 


vq  =  -  «)  -  if*  +  ft)]  = 


(1.56) 


We  can  again  generalize  to  three  dimensions33  by  letting  t2 
(vx2y  =  <i’y2>  =  (vz2)  =  if2.  so  that 


t  dr2 

~  6  df 

The  mean  velocity  due  to  the  electric  field  is34 


(VT). 


v£ 


cEt 

w 


vx2,  and  noting  that 

(1.57) 

(1.58) 


32  See  the  analogous  discussion  of  the  genesis  of  the  Hall  field  on  page  12. 

33  Cf.  the  discussion  leading  from  Eq.  ( 1 .49)  to  Eq.  (1.50). 

34  See  the  discussion  on  page  7. 
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To  have  vQ  +  v£  =  0,  we  require  that 


Q 


1  \  d  mv 2 
Je)df~T 


Cy 

3ne 


(1.59) 


This  result  is  also  independent  of  the  relaxation  time.  Drude  evaluated  it  by  another 
inappropriate  application  of  classical  statistical  mechanics,  setting  cL.  equal  to  3nkB/2 
to  find  that 

Q  =  =  -0.43  x  10-4  volt/K.  (1.60) 


Observed  metallic  thermopowers  at  room  temperature  are  of  the  order  of  microvolts 
per  degree,  a  factor  of  100  smaller.  This  is  the  same  error  of  100  that  appeared  twice 
in  Drude’s  derivation  of  the  Wiedemann-Franz  law,  but  being  now  uncompensated, 
it  offers  unambiguous  evidence  of  the  inadequacy  of  classical  statistical  mechanics  in 
describing  the  metallic  electron  gas. 

With  the  use  of  quantum  statistical  mechanics  one  removes  this  discrepancy. 
However,  in  some  metals  the  sign  of  the  thermopower — the  direction  of  the  thermo¬ 
electric  field — is  opposite  to  what  the  Drude  model  predicts.  This  is  as  mysterious  as 
the  discrepancies  in  the  sign  of  the  Hall  coefficient.  The  quantum  theory  of  solids  can 
account  for  a  sign  reversal  in  the  thermopower  as  well,  but  one’s  sense  of  triumph 
is  somewhat  tempered  in  this  case,  for  a  really  quantitative  theory  of  the  thermo¬ 
electric  field  is  still  lacking.  We  shall  note  in  later  discussions  some  of  the  peculiarities 
of  this  phenomenon  that  make  it  particularly  difficult  to  calculate  with  precision. 


These  last  examples  have  made  it  clear  that  we  cannot  proceed  very  far  with  a 
free  electron  theory  without  a  proper  use  of  quantum  statistics.  This  is  the  subject  of 
Chapter  2. 


PROBLEMS 


1.  Poisson  Distribution 

In  the  Drude  model  the  probability  of  an  electron  suffering  a  collision  in  any  infinitesimal  interval 
dt  is  just  dt/ t. 

(a)  Show  that  an  electron  picked  at  random  at  a  given  moment  had  no  collision  during  the 
preceding  t  seconds  with  probability  e~'!'.  Show  that  it  will  have  no  collision  during  the  next 
t  seconds  with  the  same  probability. 

(b)  Show  that  the  probability  that  the  time  interval  betw  een  two  successive  collisions  of  an 
electron  falls  in  the  range  between  t  and  t  +  dt  is  (dt/r)e 

(c)  Show  as  a  consequence  of  (a)  that  at  any  moment  the  mean  time  back  to  the  last  collision 
(or  up  to  the  next  collision)  averaged  over  all  electrons  is  r. 

(d)  Show  as  a  consequence  of  (b)  that  the  mean  time  between  successive  collisions  of  an 
electron  is  r. 

(e)  Part  (c)  implies  that  at  any  moment  the  time  T  between  the  last  and  next  collision 
averaged  over  all  electrons  is  2r.  Explain  why  this  is  not  inconsistent  with  the  result  in  (d).  (A 
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thorough  explanation  should  include  a  derivation  of  the  probability  distribution  for  T.)  A  failure 
to  appreciate  this  subtlety  led  Drude  to  a  conductivity  only  half  of  (1.6).  He  did  not  make  the 
same  mistake  in  the  thermal  conductivity,  whence  the  factor  of  two  in  his  calculation  of  the 
Lorenz  number  (see  page  23). 

2.  Joule  Heating 

Consider  a  metal  at  uniform  temperature  in  a  static  uniform  electric  field  E.  An  electron  expe¬ 
riences  a  collision,  and  then,  after  a  time  r,  a  second  collision.  In  the  Drude  model,  energy  is  not 
conserved  in  collisions,  for  the  mean  speed  of  an  electron  emerging  from  a  collision  does  not 
depend  on  the  energy  that  the  electron  acquired  from  the  field  since  the  time  of  the  preceding 
collision  (assumption  4.  page  6). 

(a)  Show  that  the  average  energy  lost  to  the  ions  in  the  second  of  two  collisions  separated 
by  a  time  r  is  (cEi)2/2m.  (The  average  is  over  all  directions  in  which  the  electron  emerged  from 
the  first  collision.) 

(b)  Show,  using  the  result  of  Problem  1(b),  that  the  average  energy  loss  to  the  ions  per 
electron  per  collision  is  (e£t)2/m,  and  hence  that  the  average  loss  per  cubic  centimeter  per  second 
is  (ne2r/iit)£2  =  a E1.  Deduce  that  the  power  loss  in  a  wire  of  length  L  and  cross  section  A  is 
IZR,  where  /  is  the  current  flowing  and  R  is  the  resistance  of  the  wire. 

3.  Thomson  Effect 

Suppose  that  in  addition  to  the  applied  electric  field  in  Problem  2  there  is  also  a  uniform  tem¬ 
perature  gradient  \T in  the  metal.  Since  an  electron  emerges  from  a  collision  at  an  energy  deter¬ 
mined  by  the  local  temperature,  the  energy  lost  in  collisions  will  depend  on  how  far  down  the 
temperature  gradient  the  electron  travels  between  collisions,  as  well  as  on  how  much  energy  it 
has  gained  from  the  electric  field.  Consequently  the  power  lost  will  contain  a  term  proportional 
to  E  -  VT(which  is  easily  isolated  from  other  terms  since  it  is  the  only  term  in  the  second-order 
energy  loss  that  changes  sign  when  the  sign  of  E  is  reversed).  Show  that  this  contribution  is  given 
in  the  Drude  model  by  a  term  of  order  (nex/m)  [dZ/dT]  (E  •  VT),  where  E  is  the  mean  thermal 
energy  per  electron.  (Calculate  the  energy  lost  by  a  typical  electron  colliding  at  r,  which  made  its 
last  collision  at  r  —  d.  Assuming  a  fixed  (that  is,  energy-independent)  relaxation  time  t,  d  can  be 
found  to  linear  order  in  the  field  and  temperature  gradient  by  simple  kinematic  arguments,  which 
is  enough  to  give  the  energy  loss  to  second  order.) 

4.  Helicon  Waves 

Suppose  that  a  metal  is  placed  in  a  uniform  magnetic  field  H  along  the  z-axis.  Let  an  AC  electric 
field  Ee-,“"  be  applied  perpendicular  to  H. 

(a)  If  the  electric  field  is  circularly  polarized  (£,  =  ±i£, )  show  that  Eq.  (1.28)  must  be 
generalized  to 

h  =  - 7“ — -  )  £*’  jy  =  j*  =  °-  (l-61) 

\1  -  t(oj  +  <Jjc)xJ 


(b)  Show  that,  in  conjunction  with  (1.61),  Maxwell's  equations  (1.31)  have  a  solution 

Ex  =  E0eilk!-‘ff  E,  =  ±i'E„  Ez  =  0,  (1.62) 

provided  that  k2c 2  =  ecu2,  where 


e(co) 


■  -**(  _  1  A 

(!)  \(0  +  0)c  +  l/XJ 


(1.63) 
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(c)  Sketch  e(cu)  for  cu  >  0  (choosing  the  polarization  £,  -  iEj  and  demonstrate  that  solu¬ 
tions  to  k2c2  =  ecu2  exist  for  arbitrary  k  at  frequencies  cu  >  cup  and  cu  <  «v  (Assume  the 
high  field  condition  ojcr  »  1.  and  note  that  even  for  hundreds  of  kilogauss,  wp/wc  »  1.) 

(d)  Show  that  when  cu  «  cuc  the  relation  between  k  and  cu  for  the  low-frequency  solution  is 


<o  =  a>c 


(1.64) 


This  low-frequency  wave,  known  as  a  helicon,  has  been  observed  in  many  metals. 2S  Estimate 
the  helicon  frequency  if  the  wavelength  is  1  cm  and  the  field  is  10  kilogauss,  at  typical  metallic 
densities. 


5.  Surface  Plasmons 

An  electromagnetic  wave  that  can  propagate  along  the  surface  of  a  metal  complicates  the  obser¬ 
vation  of  ordinary  (bulk)  plasmons.  Let  the  metal  be  contained  in  the  half  space  z  >  0,  z  <  0 
being  vacuum.  Assume  that  the  electric  charge  density  p  appearing  in  Maxwell’s  equations 
vanishes  both  inside  and  outside  the  metal.  (This  does  not  preclude  a  surface  charge  density 
concentrated  in  the  plane  z  =  0.)  The  surface  plasmon  is  a  solution  to  Maxwell's  equations  of 
the  form: 

Ex  =  Aelqxe~Kx,  Ey  =  0,  E.  =  Be‘qxe  Ks.  z  >  0; 

Ex  =  CeiqxeK  z,  Ey  =  0,  £-  =  De‘qxeK  z  <  0;  (1.65) 

q,  K,  K‘  real.  K,  K '  positive. 

(a)  Assuming  the  usual  boundary  conditions  (E||  continuous,  (eE)x  continuous)  and  using 
the  Drude  results  (1.35)  and  (1.29)  find  three  equations  relating  q,  K,  and  K'  as  functions  of  cu. 

(b)  Assuming  that  cor  »  1,  plot  q2c2  as  a  function  of  cu2. 

(c)  In  the  limit  as  qc  »  cu,  show  that  there  is  a  solution  at  frequency  cu  =  cu„/N/2.  Show 
from  an  examination  of  K  and  K  that  the  wave  is  confined  to  the  surface.  Describe  its  polarization. 
This  wave  is  know  n  as  a  surface  plasmon. 
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In  Drude's  time,  and  for  many  years  thereafter,  it  seemed  reasonable  to  assume  that 
the  electronic  velocity  distribution,  like  that  of  an  ordinary  classical  gas  of  density 
n  =  N'  V.  was  given  in  equilibrium  at  temperature  7  by  the  Maxwell-Boltzmann 
distribution.  This  gives  the  number  of  electrons  per  unit  volume  with  velocities  in  the 
range1  dv  about  v  as  fB(\)dv ,  where 

a-,) 

We  saw  in  Chapter  1  that  in  conjunction  with  the  Drude  model  this  leads  to  good 
order  of  magnitude  agreement  with  the  Wiedemann-Franz  law,  but  also  predicts  a 
contribution  to  the  specific  heat  of  a  metal  of  per  electron  that  was  not  observed.2 

This  paradox  cast  a  shadow  over  the  Drude  model  for  a  quarter  of  a  century,  which 
was  only  removed  by  the  advent  of  the  quantum  theory  and  the  recognition  that  for 
electrons3  the  Pauli  exclusion  principle  requires  the  replacement  of  the  Maxwell- 
Boltzmann  distribution  (2.1)  with  the  Fermi-Dirac  distribution: 


/(v) 


(m/ft)3 _ J _ 

4tt3  exp  [(imr2  -  kBT0)/kBT]  +  1" 


(2.2) 


Here  h  is  Planck’s  constant  divided  by  In,  and  T0  is  a  temperature  that  is  determined 
by  the  normalization  condition4 


n 


</v/(v). 


(2.3) 


and  is  typically  tens  of  thousands  of  degrees.  At  temperatures  of  interest  (that  is, 
less  than  103  K)  the  Maxwell-Boltzmann  and  Fermi-Dirac  distributions  are  spectacu¬ 
larly  different  at  metallic  electronic  densities  (Figure  2.1). 

In  this  chapter  we  shall  describe  the  theory  underlying  the  Fermi-Dirac  distribution 
(2.2)  and  survey  the  consequences  of  Fermi-Dirac  statistics  for  the  metallic  electron 
gas. 

Shortly  after  the  discovery  that  the  Pauli  exclusion  principle  was  needed  to  account 
for  the  bound  electronic  states  of  atoms,  Sommerfeld  applied  the  same  principle  to 
the  free  electron  gas  of  metals,  and  thereby  resolved  the  most  flagrant  thermal  anoma¬ 
lies  of  the  early  Drude  model.  In  most  applications  Sommerfeld’s  model  is  nothing 
more  than  Drude’s  classical  electron  gas  with  the  single  modification  that  the  elec¬ 
tronic  velocity  distribution  is  taken  to  be  the  quantum  Fermi-Dirac  distribution 


1  We  use  standard  vector  notation.  Thus  by  r  we  mean  the  magnitude  of  the  vector  > ;  a  velocity  is  in 
the  range  dv  about  v  if  its  ith  component  lies  between  and  r,  +  dvh  for  i  =  x ,  \\  z;  we  also  use  dv  to 
denote  the  volume  of  the  region  of  velocity  space  in  the  range  dx  about  v:  dv  ~  dvxdvydvz  (thereby  following 
the  practice  common  among  physicists  of  failing  to  distinguish  notaiionally  between  a  region  and  its 
volume,  the  significance  of  the  symbol  being  clear  from  context). 

z  Because,  as  we  shall  see.  the  actual  electronic  contribution  is  about  100  times  smaller  at  room 
temperature,  becoming  smaller  still  as  the  temperature  drops. 

3  And  any  other  particles  obeying  Fermi-Dirac  statistics. 

4  Note  that  the  constants  in  the  Maxwell-Boltzmann  distribution  (2.1)  have  already  been  chosen  so 
that  (2.3)  is  satisfied.  Equation  (2.2)  is  derived  below;  see  Eq.  (2.89).  In  Problem  3d  the  prefactor  appearing 
in  Eq.  (2.2)  is  cast  in  a  form  that  facilitates  direct  comparison  with  Eq.  (2.1). 


) 
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Figure  2.1 

(a)  The  Maxwell-Boltzmann  and  Fermi-Dirac  distributions  for  typical  metallic  densities 
at  room  temperature.  (Both  curves  are  for  the  density  given  by  T  =  0.01  T0.)  The  scale 
is  the  same  for  both  distributions,  and  has  been  normalized  so  that  the  Fermi-Dirac 
distribution  approaches  1  at  low  energies.  Below  room  temperature  the  differences  between 
the  two  distributions  are  even  more  marked,  (b)  A  view  of  that  part  of  (a)  between  x  =  0 
and  x  =  10.  The  x-axis  has  been  stretched  by  about  a  factor  of  10,  and  the/-axis  has  been 
compressed  by  about  500  to  get  all  of  the  Maxwell-Boltzmann  distribution  in  the  figure. 
On  this  scale  the  graph  of  the  Fermi-Dirac  distribution  is  indistinguishable  from  the  .x-axis. 
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rather  than  the  classical  Maxwell-Boltzmann  distribution.  To  justify  both  the  use  of 
the  Fermi-Dirac  distribution  and  its  bold  grafting  onto  an  otherwise  classical  theory, 
we  must  examine  the  quantum  theory  of  the  electron  gas.5 

For  simplicity  we  shall  examine  the  ground  state  (i.e.,  T  =  0)  of  the  electron  gas 
before  studying  it  at  nonzero  temperatures.  As  it  turns  out.  the  properties  of  the 
ground  state  are  of  considerable  interest  in  themselves:  we  shall  find  that  room  tem¬ 
perature,  for  the  electron  gas  at  metallic  densities,  is  a  very  low  temperature  indeed, 
for  many  purposes  indistinguishable  from  T  =  0.  Thus  many  (though  not  all)  of  the 
electronic  properties  of  a  metal  hardly  differ  from  their  values  at  T  =  0,  even  at  room 
temperature. 


GROUND-STATE  PROPERTIES  OF  THE  ELECTRON  GAS 


We  must  calculate  the  ground-state  properties  of  N  electrons  confined  to  a  volume 
V.  Because  the  electrons  do  not  interact  with  one  another  (independent  electron 
approximation)  we  can  find  the  ground  state  of  the  A'  electron  system  by  first  finding 
the  energy  levels  of  a  single  electron  in  the  volume  K  and  then  filling  these  levels  up 
in  a  manner  consistent  with  the  Pauli  exclusion  principle,  which  permits  at  most  one 
electron  to  occupy  any  single  electron  level.6 

A  single  electron  can  be  described  by  a  wave  function  t/r(r)  and  the  specification  of 
which  of  two  possible  orientations  its  spin  possesses.  If  the  electron  has  no  interactions, 
the  one  electron  wave  function  associated  with  a  level  of  energy  E  satisfies  the  time- 
independent  Schrodinger  equation7: 

lr  ( t2  c2  c2  \  fi2  , 

-  2 m[cP  +  7?  +  r?)  *(r)  =  -  2^  V‘*(r)  =  E+{r)-  (2’4> 


We  shall  represent  the  confinement  of  the  electron  (by  the  attraction  of  the  ions) 
to  the  volume  V  by  a  boundary  condition  on  Eq.  (2.4).  The  choice  of  boundary 
condition,  whenever  one  is  dealing  with  problems  that  are  not  explicitly  concerned 
with  effects  of  the  metallic  surface,  is  to  a  considerable  degree  at  one’s  disposal  and 
can  be  determined  by  mathematical  convenience,  for  if  the  metal  is  sufficiently  large 
we  should  expect  its  bulk  properties  to  be  unaffected  by  the  detailed  configuration  of 
its  surface.8  In  this  spirit  we  first  select  the  shape  of  the  metal  to  suit  our  analytic 
convenience.  The  time-honored  choice  is  a  cube9  of  side  L  =  V1,3. 

Next  we  must  append  a  boundary  condition  to  the  Schrodinger  equation  (2.4). 


Throughout  litis  chapter  w  e  shall  lake  "electron  gas"  lo  mean  a  gas  of  free  and  independem  electrons 
tsee  page  41  unless  ue  are  explicitly  considering  corrections  due  to  electron-electron  or  electron-ion 
interactions 

Note  that  here  and  later  tic  shall  reserve  the  term  "state"  Tor  the  state  of  the  A-electron  system,  and 
the  term  "level"  for  a  one-electron  state. 

We  also  make  the  free  electron  approximation,  so  that  no  potential  energy  lerm  appears  in  the 
Schrodinger  equation. 

8  This  is  the  approach  that  is  almost  universally  followed  in  theories  of  macroscopic  mailer.  Rigorous 
proofs  that  bulk  properties  are  independent  of  the  boundary  conditions  can  now  be  constructed  in  a  variety 
ol  contexts.  The  work  most  pertinent  lo  solid  slate  phvsics  is  by  J  L  Lebowitz  and  F  H  Lieb  Plus 
Rev.  Leu.  22.  631  ( 1969).  ’  ' 

g  We  shall  subsequently  find  it  far  more  convenient  to  lake  not  a  cube  but  a  parallelepiped  with 
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reflecting  the  fact  that  the  electron  is  confined  to  this  cube.  We  also  make  this  choice 
in  the  belief  that  it  will  not  affect  calculated  bulk  properties.  One  possibility 
is  to  require  the  wave  function  ^(r)  to  vanish  whene\  er  r  is  on  the  surface  of  the  cube. 
This,  however,  is  often  unsatisfactory,  for  it  leads  to  standing-wave  solutions  of  (2.4), 
while  the  transport  of  charge  and  energy  by.  the  electrons  is  far  more  conveniently 
discussed  in  terms  of  running  waves.  A  more  satisfactory  choice  is  to  emphasize  the 
inconsequence  of  the  surface  by  disposing  of  it  altogether.  We  can  do  this  by  imagining 
each  face  of  the  cube  to  be  joined  to  the  face  opposite  it.  so  that  an  electron  coming 
to  the  surface  is  not  reflected  back  in,  but  leaves  the  metal,  simultaneously  reentering 
at  a  corresponding  point  on  the  opposite  surface.  Thus,  if  our  metal  were  one-dimen¬ 
sional,  we  would  simply  replace  the  line  from  0  to  L  to  which  the  electrons  were 
confined,  by  a  circle  of  circumference  L.  In  three  dimensions  the  geometrical  embodi¬ 
ment  of  the  boundary  condition,  in  which  the  three  pairs  of  opposite  faces  on  the  cube 
are  joined,  becomes  topologically  impossible  to  construct  in  three-dimensional  space. 
Nevertheless,  the  analytic  form  of  the  boundary  condition  is  easily  generalized.  In 
one  dimension  the  circular  model  of  a  metal  results  in  the  boundary  condition 
>p( .\  4-  L)  =  t//(x),  and  the  generalization  to  a  three-dimensional  cube  is  evidently 

i//(.v,  y,  z  +  L)  =  tp{: x.  y.  r), 

i//(x,  y  +  L.z)  =  i//(.y.  y.  (2.5) 

ipix  +  L,y,z)  =  t/'l  .v.  y.  r). 

Equation  (2.5)  is  known  as  the  Born-von  Karman  lor  periodic)  boundary  condition. 
We  shall  encounter  it  often  (sometimes  in  a  slightly  generalized9  form). 

We  now  solve  (2.4)  subject  to  the  boundary  condition  (2.5).  One  can  verify  by- 
differentiation  that  a  solution,  neglecting  the  boundary  condition,  is 

(2.6) 


(2.7) 


with  energy 


tMr) 

V  V 


f,(k)  = 


Irk2 
2m  ' 


where  k  is  any  position  independent  vector.  We  have  picked  the  normalization  con¬ 
stant  in  (2.6)  so  that  the  probability  of  finding  the  electron  somewhere  in  the  whole 
volume  V is  unity: 


1  = 


(2.8) 


To  see  the  significance  of  the  vector  k.  note  that  the 
the  momentum  operator. 


P 


h  c 
i  rr 


i 


h  c 
i  rx 


level  i/fk(rl  is  an  eigenstate  of 
etc.Y  (2.9) 


edges  noi  necessarily  equal  or  perpendicular.  For  ihe  momeni  we  use  a  cube  10  avoid  minor  geometrical 
complexities,  but  it  is  a  useful  exercise  to  verifv  that  all  the  results  of  this  section  remain  valid  for  V  * 
parallelepiped. 
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with  eigenvalue  p  =  h  k.  for 


h  r 

T  — e'1  r  =  h  k  e  k 

i  <  r 


Since  a  particle  in  an  eigenstate  of  an  operator  has  a  definite  value  of  the  corre¬ 
sponding  observable  given  by  the  eigenvalue,  an  electron  in  the  level  ^k(r)  has  a 
definite  momentum  proportional  to  k: 

P=hk.  (2.1 1 ) 

and  a  velocity  v  =  pm  of 

/ik 

'  =  — -  (2.12) 

in  ' 

In  view  of  this  the  energy  (2.7)  can  be  written  in  the  familiar  classical  form, 

2 

f  =  2m  =  (2‘,3) 

We  can  also  interpret  k  as  a  wave  vector.  The  plane  wave  e,k  'r  is  constant  in  any 
plane  perpendicular  to  k  (since  such  planes  are  defined  by  the  equation  k  •  r  =  con¬ 
stant)  and  it  is  periodic  along  lines  parallel  to  k.  with  wavelength 


known  as  the  de  Broglie  wavelength. 

We  now  invoke  the  boundary  condition  (2.5).  This  permits  only  certain  discrete 
values  of  k.  since  (2.5)  will  be  satisfied  by  the  general  wave  function  (2.6)  only  if 

e'kv'-  =  e,kyL  =  lAzl  _  |  (2.15) 

Since  tr  =  1  only  if  r  =  2k in.  where  n  is  an  integer,10  the  components  of  the  wave 
vector  k  must  be  of  the  form. 

_  2jUlx  2/7)1,.  27177- 

-  Kt - 27" ‘  -  =  ~£~-  nx,nr,n 2  integers.  (2.16) 


Thus  in  a  three-dimensional  space  with  Cartesian  axes  kx,  ky,  and  k.  (known  as 
k-space)  the  allowed  wave  vectors  are  those  whose  coordinates  along  the  three  axes 
are  given  by  integral  multiples  of  2n  L.  This  is  illustrated  (in  two  dimensions)  in 
Figure  2.2. 

Generally  the  only  practical  use  one  makes  of  the  quantization  condition  (2.16) 
is  this:  One  often  needs  to  know  how  many  allowed  values  of  k  are  contained  in 
a  region  of  k-space  that  is  enormous  on  the  scale  of  271;  L,  and  that  therefore  contains 
a  vast  number  of  allowed  points.  If  the  region  is  very  large,11  then  to  an  excellent 
approximation  the  number  of  allowed  points  is  just  the  volume  of  k-space  contained 
within  the  region,  divided  by  the  volume  of  k- space  per  point  in  the  network  of 

We  shall  alnavs  use  the  word  '  imeger'’  10  mean  the  negaiive  inteeers  and  zero,  as  well  as  lhe 
positive  integers. 

And  not  too  irregularis  shaped:  only  a  negligible  fraction  of  lhe  points  should  be  within  0(2n  L) 
of  the  surface. 
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Figure  2.2 

Points  in  a  two-dimensional  A-space  of  the  form  A,  = 
2jiiia  L.  A,  =  2jrn,  L.  Note  that  the  area  per  point  is  just 
(2n  L):.  In  <1  dimensions  the  volume  per  point  is  (2n  Lf. 


2it 


L 


allowed  values  of  k.  That  latter  volume  (see  Figure  2.2)  is  just  (2tt  L)3.  We  therefore 
conclude  that  a  region  of  A-space  of  volume  ft  will  contain 


ft  _  OF 

(2?r,  L )3  tin3 


(2.17) 


allowed  values  of  k.  or.  equivalently,  that  the  number  of  allowed  A-valucs  per  unit 
volume  of  A-space  (also  known  as  the  A-spacc  density  of  levels)  is  just 


V 


8  n3' 


(2.18) 


In  practice  we  shall  deal  with  A-space  regions  so  large  ( -  I022  points)  and  so  regular 
(typically  spheres)  that  to  all  intents  and  purposes  (2.17)  and  (2.18)  can  be  regarded 
as  exact.  We  shall  begin  to  apply  these  important  counting  formulas  shortly. 

Because  we  assume  the  electrons  are  noninteracting  we  can  build  up  the  A'-electron 
ground  state  by  placing  electrons  into  the  allowed  one-electron  levels  we  have  just 
found.  The  Pauli  exclusion  principle  plays  a  vital  role  in  this  construction  (as  it  does 
in  building  up  the  states  of  many  electron  atoms):  we  may  place  at  most  one  electron 
in  each  single  electron  level.  The  one-electron  levels  are  specified  by  the  wave  vectors 
k  and  by  the  projection  of  the  electron’s  spin  along  an  arbitrary  axis,  which  can  take 
either  of  the  two  values  ft/2  or  —h  2.  Therefore  associated  with  each  allowed  wave 
vector  k  are  two  electronic  levels,  one  for  each  direction  of  the  electron’s  spin. 

Thus  in  building  up  the  iV-eleclron  ground  state  we  begin  by  placing  two  electrons 
in  the  one-electron  level  k  =  0,  which  has  the  lowest  possible  one-electron  energy 
£  =  0.  We  then  continue  to  add  electrons,  successively  filling  the  one-electron  levels 
of  lowest  energy  that  are  not  already  occupied.  Since  the  energy  of  a  one-electron 
level  is  directly  proportional  to  the  square  of  its  wave  vector  (see  (2.7)),  when  A'  is 
enormous  the  occupied  region  will  be  indistinguishable  from  a  sphere.12  The  radius 
of  this  sphere  is  called  kt-  ( F  for  Fermi),  and  its  volume  ft  is  4nkF3/3.  According  to 
(2.17)  the  number  of  allowed  values  of  k  within  the  sphere  is 


12  If  it  were  not  spherical  it  would  not  be  ihc  ground  stale,  for  vve  could  then  construct  a  slate  of  lower 
energy  by  mo-  ’-g  the  electrons  in  lc\  els  farthest  away  from  k  =  Ointo  unoccupied  lev  els  closer  to  the  origin. 
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Since  each  allowed  fc-value  leads  to  two  one-electron  levels  (one  for  each  spin  value), 
in  order  to  accommodate  N  electrons  we  must  have 

N  =  2^V=^V.  (2.20) 

6n2  37T 


Thus  if  we  have  N  electrons  in  a  volume  F(i.e.,  an  electronic  density  n  =  N/V), 
then  the  ground  state  of  the  N-electron  system  is  formed  by  occupying  all  single- 
particle  levels  with  k  less  than  kF,  and  leaving  all  those  with  k  greater  than  kF  un¬ 
occupied,  where  kF  is  given  by  the  condition: 

(2.21) 

This  free  and  independent  electron  ground  state  is  described  by  some  rather 
unimaginative  nomenclature: 

The  sphere  of  radius  kF  (the  Fermi  wave  vector )  containing  the  occupied  one 
electron  levels  is  called  the  Fermi  sphere. 

The  surface  of  the  Fermi  sphere,  which  separates  the  occupied  from  the  unoccupied 
levels  is  called  the  Fermi  surface.  (We  shall  see,  starting  with  Chapter  8,  that  the 
Fermi  surface  is  one  of  the  fundamental  constructions  in  the  modern  theory  of  metals; 
in  general  it  is  not  spherical.) 

The  momentum  hkF  =  pF  of  the  occupied  one-electron  levels  of  highest  energy 
is  known  as  the  Fermi  momentum;  their  energy,  CF  =  h2kF2/2m  is  the  Fermi  energy; 
and  their  velocity,  vF  -  pF/m,  is  the  Fermi  velocity.  The  Fermi  velocity  plays  a  role 
in  the  theory  of  metals  comparable  to  the  thermal  velocity,  v  =  (3 kBT/m)112,  in  a 
classical  gas. 

All  these  quantities  can  be  evaluated  in  terms  of  the  conduction  electron  density, 
via  Eq.  (2.21).  For  estimating  them  numerically  it  is  often  more  convenient  to  express 
them  in  terms  of  the  dimensionless  parameter  rs/a0  (see  page  4),  which  varies  from 
about  2  to  6  in  the  metallic  elements.  Taken  together,  Eqs.  (1.2)  and  (2.21)  give 


kF 


(9tt/4)1/3 

>s 


1.92 

rs 


(2.22) 


or 

(2.23) 

Since  the  Fermi  wave  vector  is  of  the  order  of  inverse  angstroms,  the  de  Broglie 
wavelength  of  the  most  energetic  electrons  is  of  the  order  of  angstroms. 

The  Fermi  velocity  is 


This  is  a  substantial  velocity  (about  1  percent  of  the  velocity  of  light).  From  the 
viewpoint  of  classical  statistical  mechanics  this  is  quite  a  surprising  re^ilt,  for  we  are 
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describing  the  ground  state  (7  =  0),  and  all  particles  in  a  classical  gas  have  zero 
velocity  at  T  =  0.  Even  at  room  temperature  the  thermal  (i.e.,  average)  velocity  for 
a  classical  particle  with  the  electronic  mass  is  only  of  order  107  cm/sec. 

The  Fermi  energy  is  conveniently  written  in  the  form  (since  a0  =  h2/me2) 


£f  - 


h2kF2 

2m 


(2.25) 


Here  e2/2a0,  known  as  the  rydberg  (Ry),  is  the  ground-state  binding  energy  of  the 
hydrogen  atom,  13.6  electron  volts.13  The  rydberg  is  as  convenient  a  unit  of  atomic 
energies  as  the  Bohr  radius  is  of  atomic  distances.  Since  kFa0  is  of  the  order  of 
unity,  Eq.  (2.25)  demonstrates  that  the  Fermi  energy  has  the  magnitude  of  a  typical 
atomic  binding  energy.  Using  (2.23)  and  a0  =  0.529  x  10-8  cm,  we  find  the  explicit 
numerical  form: 


CF 


50.1  eV 
( rja0 )2’ 


(2.26) 


indicating  a  range  of  Fermi  energies  for  the  densities  of  metallic  elements  between 
1.5  and  15  electron  volts. 

Table  2.1  lists  the  Fermi  energy,  velocity,  and  wave  vector  for  the  metals  whose 
conduction  electron  densities  are  given  in  Table  1.1. 

To  calculate  the  ground-state  energy  of  N  electrons  in  a  volume  V  we  must  add 
up  the  energies  of  all  the  one-electron  levels  inside  the  Fermi  sphere14: 


E  =  2  X 


Kkf 


2m' 


k2. 


(2.27) 


Quite  generally,  in  summing  any  smooth  function  F(k)  over  all  allowed  values  of 
k,  one  may  proceed  as  follows: 

Because  the  volume  of  k-space  per  allowed  k  value  is  Ak  =  8n3/F(see  Eq.  (2.18)) 
it  is  convenient  to  write 

X^)  =  ^XF(k)Ak,  (2.28) 

k  C7t  k 


for  in  the  limit  as  Ak  -»  0  (i.e.,  V ->  oo)  the  sum  ZF(k)  Ak  approaches  the  integral 
|  dk  F(k),  provided  only  that  F(k)  does  not  vary  appreciably 1 5  over  distances  in  /c-space 
of  order  2n/L.  We  may  therefore  rearrange  (2.28)  and  write 


(2.29) 


In  applying  (2.29)  to  finite,  but  macroscopically  large,  systems  one  always  assumes 
that  (1/F)  Ff  (k)  differs  negligibly  from  its  infinite  volume  limit  (for  example,  one 


13  Strictly  speaking,  the  rydberg  is  the  binding  energy  in  the  approximation  of  infinite  proton  mass. 
An  electron  volt  is  the  energy  gained  by  an  electron  crossing  a  potential  oft  volt;  t  eV  =  1.602  x  10_12erg 
=  1.602  x  10_19joule. 

14  The  factor  of  2  is  for  the  two  spin  levels  allowed  for  each  k. 

15  The  most  celebrated  case  in  which  F  fails  to  satisfy  this  condition  is  the  condensation  of  the  ideal 
Bose  gas.  In  applications  to  metals  the  problem  never  arises. 
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Table  2.1 

FERMI  ENERGIES,  FERMI  TEMPERATURES,  FERMI  WAVE  VECTORS,  AND 
FERMI  VELOCITIES  FOR  REPRESENTATIVE  METALS" 


ELEMENT 

rja0 

Ef 

tf 

kF 

vF 

Li 

3.25 

4.74  eV 

5.51  x  10“  K 

1.12  x  108  cm" 1 

1.29  x  108  cm/sec 

Na 

3.93 

3.24 

3.77 

0.92 

1.07 

K 

4.86 

2.12 

2.46 

0.75 

0.86 

Rb 

5.20 

1.85 

2.15 

0.70 

0.81 

Cs 

5.62 

1.59 

1.84 

0.65 

0.75 

Cu 

2.67 

7.00 

8.16 

1.36 

1.57 

Ag 

3.02 

5.49 

6.38 

1.20 

1.39 

Au 

3.01 

5.53 

6.42 

1.21 

1.40 

Be 

1.87 

14.3 

16.6 

1.94 

2.25 

Mg 

2.66 

7.08 

8.23 

1.36 

1.58 

Ca 

3.27 

4.69 

5.44 

1.11 

1.28 

Sr 

3.57 

3.93 

4.57 

1.02 

1. 18 

Ba 

3.7 1 

3.64 

4.23 

0.98 

1.13 

Nb 

3.07 

5.32 

6.18 

1.18 

1.37 

Fe 

2.12 

11.1 

13.0 

1.71 

1.98 

Mn 

2.14 

10.9 

12.7 

1.70 

1.96 

Zn 

2.30 

9.47 

11.0 

1.58 

1.83 

Cd 

2.59 

7.47 

8.68 

1.40 

1.62 

Hg 

2.65 

7.13 

8.29 

1.37 

1.58 

A1 

2.07 

11.7 

13.6 

1.75 

2.03 

Ga 

2.19 

10.4 

12.1 

1.66 

1.92 

In 

2.41 

8.63 

10.0 

1.51 

1.74 

n 

2.48 

8.15 

9.46 

1.46 

1.69 

Sn 

2.22 

10.2 

11.8 

1.64 

1.90 

Pb 

2.30 

9.47 

11.0 

1.58 

1.83 

Bi 

2.25 

9.90 

11.5 

1.61 

1.87 

Sb 

2.14 

10.9 

12.7 

1.70 

1.96 

“The  table  entries  are  calculated  from  the  values  of  rjao  given  in  Table  1.1  using  m  = 
9.11  x  10-28  grams. 

assumes  that  the  electronic  energy  per  unit  volume  in  a  1-cm  cube  of  copper  is  the 
same  as  in  a  2-cm  cube). 

Using  (2.29)  to  evaluate  (2.27),  we  find  that  the  energy  density  of  the  electron  gas  is: 


A  =  _L  f  dLh2kl-  1  ^ 

V  47t 3  jt<lf  2m  7t2  10m 

To  find  the  energy  per  electron,  E/N,  in  the  ground  state, 
N/V  =  kF3/3n2,  which  gives 


(2.30) 

we  must  divide  this  by 


£  3  h2kr2  3  „ 

N  ~  10  m  “  5  Cf 


(2.31) 


We  can  also  write  this  result  as 


£ 

N 


3 

5 


Tf 


(2.32) 
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where  TF,  the  Fermi  temperature,  is 


58.2 

( rja0 )2  * 


104  K. 


(2.33) 


Note,  in  contrast  to  this,  that  the  energy  per  electron  in  a  classical  ideal  gas,  \kBT, 
vanishes  at  T  —  0  and  achieves  a  value  as  large  as  (2.32)  only  at  T  =  §  TF  104  K. 

Given  the  ground-state  energy  E,  one  can  calculate  the  pressure  exerted  by  the 
electron  gas  from  the  relation  P  =  —(dE/SV)N.  Since  E  =  jN£f  and  £f  is  propor¬ 
tional  to  kF2,  which  depends  on  V  only  through  a  factor  n2/i  =  (N/V)213,  it  follows 
that16 


P 


2  E 
3? 


(2.34) 


One  can  also  calculate  the  compressibility,  K,  or  bulk  modulus,  B  =  1  /K,  defined 
by: 


B  = 


K 


(2.35) 


Since  E  is  proportional  to  V  2/3,  Eq.  (2.34)  shows  that  P  varies  as  V  5/3,  and  therefore 

B  =  lp  =  f^  =  r,ZF  (2-36) 

or 

B  =  ^  x  10*°  dynes/cm2.  (2-37) 

W°o  / 

In  Table  2.2  we  compare  the  free  electron  bulk  moduli  (2.37)  calculated  from  rja0, 
with  the  measured  bulk  moduli,  for  several  metals.  The  agreement  for  the  heavier 
alkali  metals  is  fortuitously  good,  but  even  when  (2.37)  is  substantially  off,  as  it  is  in 


Table  2.2 

BULK  MODULI  IN  10'°  DYNES/CM2  FOR  SOME 
TYPICAL  METALS" 


METAL 

FREE  ELECTRON  B 

MEASURED  B 

Li 

23.9 

11.5 

Na 

9.23 

6.42 

K 

3.19 

2.81 

Rb 

2.28 

1.92 

Cs 

1.54 

1.43 

Cu 

63.8 

134.3 

Ag 

34.5 

99.9 

A1 

228 

76.0 

“  The  free  electron  value  is  that  for  a  free  electron  gas  at  the  observed 
density  of  the  metal,  as  calculated  from  Eq.  (2.37). 

16  At  no 


temperatures  the  pressure  and  energy  density  continue  to  obey  this  relation.  See  (2. 101). 
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the  noble  metals,  it  is  still  of  about  the  right  order  of  magnitude  (though  it  varies  from 
three  times  too  large  to  three  times  too  small,  through  the  table).  It  is  absurd  to  expect 
that  the  free  electron  gas  pressure  alone  should  completely  determine  the  resistance 
of  a  metal  to  compression,  but  Table  2.2  demonstrates  that  this  pressure  is  at  least  as 
important  as  any  other  effects. 


THERMAL  PROPERTIES  OF  THE  FREE  ELECTRON  GAS: 
THE  FERMI-DIRAC  DISTRIBUTION 


When  the  temperature  is  not  zero  it  is  necessary  to  examine  the  excited  states  of  the 
A-electron  system  as  well  as  its  ground  state,  for  according  to  the  basic  principles 
of  statistical  mechanics,  if  an  N-particle  system  is  in  thermal  equilibrium  at  tem¬ 
perature  T,  then  its  properties  should  be  calculated  by  averaging  over  all  N-particle 
stationary  states,  assigning  to  each  state  of  energy  E  a  weight  PN(E)  proportional  to 
e-E/kBT- 

e-^BT 

'"W  -  ,2J8) 


(Here  E"  is  the  energy  of  the  ath  stationary  state  of  the  IV-electron  system,  the  sum 
being  over  all  such  states.) 

The  denominator  of  (2.38)  is  known  as  the  partition  function,  and  is  related  to 
the  Helmholtz  free  energy,  F  —  U  —  TS  (where  U  is  the  internal  energy  and  S,  the 
entropy)  by 

=  e-WBT.  (2.39) 


We  can  therefore  write  (2.38)  more  compactly  as: 

Pn(E)  =  (2.40) 

Because  of  the  exclusion  principle,  to  construct  an  N-electron  state  one  must  fill 
N  different  one-electron  levels.  Thus  each  A’-electron  stationary  state  can  be  specified 
by  listing  which  of  the  N  one-electron  levels  are  filled  in  that  state.  A  very  useful 
quantity  to  know  is/”,  the  probability  of  there  being  an  electron  in  the  particular 
one-electron  level  i,  when  the  /V-electron  system  is  in  thermal  equilibrium.17  This 
probability  is  simply  the  sum  of  the  independent  probabilities  of  finding  the  N- 
electron  system  in  any  one  of  those  Af-electron  states  in  which  the  ith  level  is  occupied: 

(summation  over  all  N-electron 

/.”  =  £  PN(Ea”)  states  a  in  which  there  is  an  elec-  (2.41) 

tron  in  the  one-electron  level  i). 


We  can  evaluate/”  by  the  following  three  observations: 


1.  Since  the  probability  of  an  electron  being  in  the  level  i  is  just  one  minus  the 
probability  of  no  electron  being  in  the  level  i  (those  being  the  only  two  possibilities 


17  In  the  case  we  are  interested  in  the  level  i  is  specified  by  the  electron’s  wave  vector  k  and  the  pro- 
jection  s  of  its  spin  along  some  axis. 
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allowed  by  the  exclusion  principle)  we  could  equally  well  write  (2.41)  as 

(summation  over  all  TV-electron 

fiN  =  1  —  £  P,\{EyN)  states  y  in  which  there  is  no  elec-  (2.42) 

tron  in  the  one-electron  level  i).  - 

2.  By  taking  any  (TV  +  1  (-electron  state  in  which  there  is  an  electron  in  the  one- 
electron  level  i,  we  can  construct  an  TV-electron  state  in  which  there  is  no  electron 
in  the  level  i,  by  simply  removing  the  electron  in  the  ith  level,  leaving  the  occupation 
of  all  the  other  levels  unaltered.  Furthermore,  any  TV -electron  state  with  no  electron 
in  the  one-electron  level  i  can  be  so  constructed  from  just  one  ( TV  +  1  (-electron  state 
with  an  electron  in  the  level  i.18  Evidently  the  energies  of  any  TV-electron  state  and 
the  corresponding  (TV  +  l)-electron  state  differ  by  just  £f,  the  energy  of  the  only 
one-electron  level  whose  occupation  is  different  in  the  two  states.  Thus  the  set  of 
energies  of  all  TV-electron  states  with  the  level  i  unoccupied  is  the  same  as  the  set  of 
energies  of  all  (TV  +  l)-electron  states  with  the  level  i  occupied,  provided  that  each 
energy  in  the  latter  set  is  reduced  by  £;.  We  can  therefore  rewrite  (2.42)  in  the  peculiar 
form 

(summation  over  all  (TV  +  1  (-electron 

fF  =  1  —  Y,  Pn(E%+1  ~  £,)  states  a  in  which  there  is  an  electron  (2.43) 

in  the  one-electron  level  i). 

But  Eq.  (2.40)  permits  us  to  write  the  summand  as 

Pn(E%+1  -  £,-)  =  & -f‘)lk°TPN+l(EZ+1),  (2.44) 

where  n,  known  as  the  chemical  potential,  is  given  at  temperature  7  by 

H=FN+t-FN.  (2.45) 

Substituting  this  into  (2.43),  we  find : 

(summation  over  all  (TV  +  l)-electron 

fF  =  1  -  £  P|v+,(£*+1)  states  a  in  which  there  is  an  electron  (2.46) 

in  the  one-electron  level  i). 

Comparing  the  summation  in  (2.46)  with  that  in  (2.41)  one  finds  that  (2.46)  simply 
asserts  that 

fF  =  1  -  eUi-MW f"+ 1.  (2.47) 

3.  Equation  (2.47)  gives  an  exact  relation  between  the  probability  of  the  one- 
electron  level  i  being  occupied  at  temperature  T  in  an  TV-electron  system,  and  in  an 
(TV  +  l)-electron  system.  When  TV  is  very  large  (and  we  are  typically  interested  in 
TV  of  the  order  of  1022)  it  is  absurd  to  imagine  that  by  the  addition  of  a  single  extra 
electron  we  could  appreciably  alter  this  probability  for  more  than  an  insignificant 
handful  of  one-electron  levels.19  We  may  therefore  replace /f,+1  by  fF  in  (2.47),  which 


18  Namely  the  one  obtained  by  occupying  all  those  levels  occupied  in  the  .V-electron  state  plus  the 
ith  level. 

19  For  a  typical  level,  changing  N  by  one  alters  the  probability  of  occupation  by  order  1  N.  See 
Problem  4. 
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makes  it  possible  to  solve  for  f" : 


fi 


N 


1 

e(Cj  -rtlkgT  +  j- 


(2.48) 


In  subsequent  formulas  we  shall  drop  the  explicit  reference  to  the  N  dependence 
of/,,  which  is,  in  any  event,  carried  through  the  chemical  potential  see  (2.45).  The 
value  of  N  can  always  be  computed,  given  the /,,  by  noting  that /,  is  the  mean  number 
of  electrons  in  the  one-electron  level20  i.  Since  the  total  number  of  electrons  N  is 
just  the  sum  over  all  levels  of  the  mean  number  in  each  level. 


"  =  !/.  =  I 

i  i 


l 

e(Ej -»)!kBT  —  J> 


(2.49) 


which  determines  N  as  a  function  of  the  temperature  T and  chemical  potential  fi.  In 
many  applications,  however,  it  is  the  temperature  and  N  (or  rather  the  density, 
n  =  N/V)  that  are  given.  In  such  cases  (2.49)  is  used  to  determine  the  chemical 
potential  /.t  as  a  function  of  n  and  T,  permitting  it  to  be  eliminated  from  subsequent 
formulas  in  favor  of  the  temperature  and  density.  However  the  chemical  potential 
is  of  considerable  thermodynamic  interest  in  its  own  right.  Some  of  its  important 
properties  are  summarized  in  Appendix  B.21 


THERMAL  PROPERTIES  OF  THE  FREE  ELECTRON  GAS: 
APPLICATIONS  OF  THE  FERMI-DIRAC  DISTRIBUTION 

In  a  gas  of  free  and  independent  electrons  the  one-electron  levels  are  specified  by 
the  wave  vector  k  and  spin  quantum  number  s,  with  energies  that  are  independent 
of  s  (in  the  absence  of  a  magnetic  field)  and  given  by  Eq.  (2.7);  i.e., 

h2k2 

C(k)  =  ~2m'  (2-50) 

We  first  verify  that  the  distribution  function  (2.49)  is  consistent  with  the  ground-state 
(T  =  0)  properties  derived  above.  In  the  ground  state  those  and  only  those  levels  are 
occupied  with  £(k)  ^  £F,  so  the  ground-state  distribution  function  must  be 


Aj  -  L  &00  <- 
=  0,  E(k)  >  Ef. 


(2.51) 


20  Proof:  A  level  can  contain  either  0  or  1  electron  (more  than  one  being  prohibited  by  the  exclusion 
principle).  The  mean  number  of  electrons  is  therefore  1  times  the  probability  of  1  electron  plus  0  times  the 
probability  of  0  electrons.  Thus  the  mean  number  of  electrons  in  the  level  is  numerically  equal  to  the 
probability  of  its  being  occupied.  Note  that  this  would  not  be  so  if  multiple  occupation  of  levels  were 
permitted. 

21  The  chemical  potential  plays  a  more  fundamental  role  when  the  distribution  (2.48)  is  derived  in 
the  grand  canonical  ensemble.  See,  for  example,  F.  Reif,  Statistical  and  Thermal  Physics ,  McGraw-Hill, 
New  York,  1965,  p.  350.  Our  somewhat  unorthodox  derivation,  which  can  also  be  found  in  Reif,  uses  only 
the  canonical  ensemble. 
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On  the  other  hand,  as  T—  0.  the  limiting  form  of  the  Fermi-Dirac  distribution 
(2.48)  is 

lim  /k,  =  1,  £(k)  <  fi ; 

r~°  =  0,  £(k)  >  n- 


(2.52) 


For  these  to  be  consistent  it  is  necessary  that 


(2.53) 


lim  /t  =  £F. 
r-o 

We  shall  see  shortly  that  for  metals  the  chemical  potential  remains  equal  to  the 
Fermi  energy  to  a  high  degree  of  precision,  all  the  way  up  to  room  temperature.  As 
a  result,  people  frequently  fail  to  make  any  distinction  between  the  two  when  dealing 
with  metals.  This,  however,  can  be  dangerously  misleading.  In  precise  calculations 
it  is  essential  to  keep  track  of  the  extent  to  which  the  chemical  potential,  differs 
from  its  zero  temperature  value.  £f . 

The  most  important  single  application  of  Fermi-Dirac  statistics  is  the  calculation 
of  the  electronic  contribution  to  the  constant-volume  specific  heat  of  a  metal. 


cS 

ef 


I  -  ($,• 


u 


(2.54) 


In  the  independent  electron  approximation  the  internal  energy  U  is  just  the  sum 
over  one-electron  levels  of  £(k)  times  the  mean  number  of  electrons  in  the  level22: 


U  =  2X£(k)/(£(k)). 


(2.55) 


We  have  introduced  the  Fermi  function  /(£)  to  emphasize  that  fv  depends  on  k  only 
through  the  electronic  energy  £(k): 


m 


1 


eQ-li)lkBT  _j_  J- 


(2.56) 


If  we  divide  both  sides  of  (2.55)  by  the  volume  V,  then  (2.29)  permits  us  to  write  the 
energy  density  u  =  U/V as 

1  dk 

1 £(k)/  (£(k) ).  (2.57) 


fdk 

J4P 


If  we  also  divide  both  sides  of  (2.49)  by  V,  then  we  can  supplement  (2.57)  by  an  equation 
for  the  electronic  density  n  =  N/V,  and  use  it  to  eliminate  the  chemical  potential : 


mm 

In  evaluating  integrals  like  (2.57)  and  (2.58)  of  the  form 

dk 

^smk)). 


f  dk 

"  “  J  4ti3 


fi 


(2.58) 


(2.59) 


22  As  usual,  the  factor  of  2  reflects  the  fact  that  each  fc-level  can  contain  two  electrons  of  opposite 
spin  orientations. 
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one  often  exploits  the  fact  that  the  integrand  depends  on  k  only  through  the  electronic 
energy  £  =  h2k2/2m,  by  evaluating  the  integral  in  spherical  coordinates  and  changing 
variables  from  k  to  £: 


Here 


I  ^3  F(£(k))  =  I  -pr  F(£(k))  =  I  d£g(t)F(£). 


<?(£)  = 


=  0, 


£  >  0; 
£  <  0. 


(2.60) 


(2.61) 


Since  the  integral  (2.59)  is  an  evaluation  of  (1/F)  F(£(k)),  the  form  in  (2.60) 

shows  that 


g(£)  d£  =  fy  J  x  [the  number  of  one-electron 
'  '  levels  in  the  energy  range 

from  £  to  £  +  d£].  (2.62) 


For  this  reason  g(£)  is  known  as  the  density  of  levels  per  unit  volume  (or  often  simply 
as  the  density  of  levels).  A  dimensionally  more  transparent  way  of  writing  g  is 


0(£)  = 


3  n_ 
2  £f 


£  >  0; 


=  0,  £  <  0,  (2.63) 

where  £f  and  kf  are  defined  by  the  zero-temperature  equations  (2.21)  and  (2.25).  A 
quantity  of  particular  numerical  importance  is  the  density  of  levels  at  the  Fermi 
energy,  which  (2.61)  and  (2.63)  give  in  either  of  the  two  equivalent  forms: 


or 


t?(£f)  — 


mkF 


<?(£f)  — 


3n 
2  £/ 


(2.64) 


(2.65) 


Using  this  notation,  we  rewrite  (2.57)  and  (2.58)  as: 


and 


«  =  |  deg(C)£f(C) 


n  =  J  d£g  (£)/(£). 


(2.66) 


(2.67) 


We  do  this  both  for  notational  simplicity  and  because  in  this  form  the  free  electron 
approximation  enters  only  through  the  particular  evaluation  (2.61)  or  (2.63)  of  the 
density  of  levels  g.  We  can  define  a  density  of  levels,  via  (2.62),  in  terms  of  which  (2.66) 
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Figure  2.3 

The  Fermi  function,  /(£)  = 
l/[ews'")  +  1]  versus  £  for 
given  n,  at  (a)  T  =  0  and 
(b)T  x  0.01  n  (of  order  room 
temperature,  at  typical  me¬ 
tallic  densities).  The  two 
curves  differ  only  in  a  region 
of  order  kBT  about  p. 

* 

and  (2.67)  remain  valid  for  any  set  of  noninteracting  (that  is,  independent)  electrons.23 
Thus  we  shall  later  be  able  to  apply  results  deduced  from  (2.66)  and  (2.67)  to  con¬ 
siderably  more  sophisticated  models  of  independent  electrons  in  metals. 

In  general,  the  integrals  (2.66)  and  (2.67)  have  a  rather  complex  structure.  There 
is,  however,  a  simple  systematic  expansion  that  exploits  the  fact  that  at  almost  all 
temperatures  of  interest  in  metals,  T  is  very  much  smaller  than  the  Fermi  tempera¬ 
ture  (2.33).  In  Figure  2.3  the  Fermi  function  /(£)  is  plotted  at  T  =  0  and  at  room 
temperature  for  typical  metallic  densities  ( kBT/fi  «  0.01).  Evidently/  differs  from  its 
zero  temperature  form  only  in  a  small  region  about  fx  of  width  a  few  kBT.  Thus 
the  way  in  which  integrals  of  the  form  //(£)/(£)  dt>  differ  from  their  zero  tem¬ 
perature  values,  iiFw  H( 6)  dS,  will  be  entirely  determined  by  the  form  of  //(£)  near 
£  =  ix.  If  //(£)  does  not  vary  rapidly  in  the  energy  range  of  the  order  of  kBT  about  [x, 
the  temperature  dependence  of  the  integral  should  be  given  quite  accurately  by 
replacing  //(£)  by  the  first  few  terms  in  its  Taylor  expansion  about  £  =  //: 

«■<*> 

This  procedure  is  carried  out  in  Appendix  C.  The  result  is  a  series  of  the  form: 

I  //(£)/(£)  d&  =  P  H(Z)dZ  +  £  (kBT)2"an^j  H(B)  (2.69) 

J-co  J  oo  n=l  £=/< 

which  is  known  as  the  Sommerfeld  expansion.24  The-u„  are  dimensionless  constants 
of  the  order  of  unity.  The  functions  H  one  typically  encounters  have  major  variations 
on  an  energy  scale  of  the  order  of  fx,  and  generally  ( d/dZf  H(£)|c=^  is  of  the  order  of 
H(ix)/ix".  When  this  is  the  case,  successive  terms  in  the  Sommerfeld  expansion  are 

23  See  Chapter  8. 

24  The  expansion  is  not  always  exact,  but  is  highly  reliable  unless  H(£)  has  a  singularity  very  close  to 
£  =  /i.  for  example,  H  is  singular  at  £  =  0  (as  is  the  free  electron  density  of  levels  (2.63))  then  the  expan¬ 
sion  will  neglect  terms  of  the  order  of  exp  (— fkBT ),  which  are  typically  of  order  e-100  ~  10  ~43.  See  also 
Problem  1. 
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smaller  by  0(kBT/g)2  which  is  O(10-4)  at  room  temperature.  Consequently,  in 
actual  calculations  only  the  first  and  (very  occasionally)  the  second  terms  are  retained 
in  the  sum  in  (2.69).  The  explicit  form  for  these  is  (Appendix  C): 


j: 


//(£)/(£)  </£ 


-L 


H(e)de  +  £( k„T)2H'(g )  +  7^(kBT)4H"\g)  + 


»(¥)' 


(2.70) 


To  evaluate  the  specific  heat  of  a  metal  at  temperatures  small  compared  with  Tr 
we  apply  the  Sommerfeld  expansion  (2.70)  to  the  electronic  energy  and  number 
densities  (Eqs.  (2.66)  and  (2.67)): 


-r 


£g(£)d£  +  -  (kBT)2[gg\g)  +  <?(,<)]  +  0(T4), 


<?(£)</£  +  —  { kBT)2g(g )  +  0(T4). 

o 


(2.71) 

(2.72) 


Equation  (2.72)  as  we  shall  presently  see  in  detail,  implies  that  g  differs  from  its  T  =  0 
value,  £f,  by  terms  of  order  T2.  Thus,  correctly  to  order  T2,  we  may  write 


fK  ft, 

//(£)  </£  = 

Jo  Jo 


fr  C‘r 

//(£)  de,  =  |  7/(£)  de  +  (g  -  eF)H(eF). 


(2.73) 


If  we  apply  this  expansion  to  the  integrals  in  (2.71)  and  (2.72),  and  replace  g  by  £f 
in  the  terms  already  of  order  T 2  in  these  equations,  we  find 


u  —  eg(e)  de  +  eF  i 


F  I  7t‘ 

&g(e,)de  +  eF  <  (g  —  eF)g(eF)  +  —  (kBT)2g'(eF) 


+  ~r(kBT)2g(eF)  +  0(T4), 

o 


»  =  J  g(e)  de  +  |i 


^(8)  d£>  +  —  8f)^(8f)  +  —  (kBT)2g'(£>F) 

u 


(2.74) 

(2.75) 


The  temperature-independent  first  terms  on  the  right  sides  of  (2.74)  and  (2.75)  are 
just  the  values  of  u  and  n  in  the  ground  state.  Since  we  are  calculating  the  specific 
heat  at  constant  density,  n  is  independent  of  temperature,  and  (2.75)  reduces  to 


0  =  (g  —  eF)g(eF)  +  j(kBT)2g'(eF), 


(2.76) 


which  determines  the  deviation  of  the  chemical  potential  from  £f : 
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Since  for  free  electrons  g(Z)  varies  as  £I/2  (see  Eq.  (2.63)),  this  gives 


V  = 


(2.78) 


which  is,  as  we  asserted  above,  a  shift  of  the  order  of  T2  and  typically  only  about 
0.01  percent,  even  at  room  temperature. 

Equation  (2.76)  sets  the  term  in  braces  in  (2.74)  equal  to  zero,  thereby  simplifying 
the  form  of  the  thermal  energy  density  at  constant  electronic  density: 


u 


(kBT)2g(ZF) 


(2.79) 


where  n0  is  the  energy  density  in  the  ground  state.  The  specific  heat  of  the  electron 
gas  is  therefore 


kn2Tg(ZF) 


(2.80) 


or,  for  free  electrons  (see  (2.65) ), 


Comparing  this  with  the  classical  result  for  an  ideal  gas,  ct,  =  3nkB/2,  we  see  that  the 
effect  of  Fermi-Dirac  statistics  is  to  depress  the  specific  heat  by  a  factor  of  {n2/3) 
(kBT/ZF),  which  is  proportional  to  the  temperature,  and  even  at  room  temperature  is 
only  of  order  1CT2.  This  explains  the  absence  of  any  observable  contribution  of  the 
electronic  degrees  of  freedom  to  the  specific  heat  of  a  metal  at  room  temperature. 

If  one  is  willing  to  dispense  with  the  precise  numerical  coefficient,  one  can  under¬ 
stand  this  behavior  of  the  specific  heat  quite  simply  from  the  temperature  dependence 
of  the  Fermi  function  itself.  The  increase  in  energy  of  the  electrons  when  the  tem¬ 
perature  is  raised  from  T  =  0  comes  about  entirely  because  some  electrons  with 
energies  within  0(kBT)  below  ZF  (the  darkly  shaded  region  of  Figure  2.4)  have  been 
excited  to  an  energy  range  of  0(kBT)  above  ZF  (the  lightly  shaded  region  of  Figure  2.4). 
The  number  of  electrons  per  unit  volume  that  have  been  so  excited  is  the  width,  kBT, 
of  the  energy  interval  times  the  density  of  levels  per  unit  volume  g(ZF).  Furthermore, 
the  excitation  energy  is  of  order  kBT,  and  hence  the  total  thermal  energy  density  is 
of  order  g{EF)(kBT)2  above  the  ground-state  energy.  This  misses  the  precise  result 
(2.79)  by  a  factor  of  n2/6,  but  it  gives  a  simple  physical  picture,  and  is  useful  for  rough 
estimates. 


Figure  2.4 

The  Fermi  function  at  nonzero  T.  The 
distribution  differs  from  its  T  =  0  form 
because  some  electrons  just  below  zF  (darkly 
shaded  region)  have  been  excited  to  levels 
just  above  F-  (lightly  shaded  region). 
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The  prediction  of  a  linear  specific  heat  is  one  of  the  most  important  consequences 
of  Fermi-Dirac  statistics,  and  provides  a  further  simple  test  of  the  electron  gas  theory 
of  a  metal,  provided  one  can  be  sure  that  degrees  of  freedom  other  than  the  electronic 
ones  do  not  make  comparable  or  even  bigger  contributions.  As  it  happens,  the  ionic 
degrees  of  freedom  completely  dominate  the  specific  heat  at  high  temperatures.  How¬ 
ever,  well  below  room  temperature  their  contribution  falls  off  as  the  cube  of  the 

Table  2.3 

SOME  ROUGH  EXPERIMENTAL  VALUES  FOR  THE  COEFFICIENT 
OF  THE  LINEAR  TERM  IN  T  OF  THE  MOLAR  SPECIFIC  HEATS 
OF  METALS,  AND  THE  VALUES  GIVEN  BY  SIMPLE  FREE 
ELECTRON  THEORY 


ELEMENT 

FREE  ELECTRON  V 

(in  10“ 4  cal-mole- 

MEASURED  y 

1-K-2) 

RATIO" 

(m*/m) 

Li 

1.8 

4.2 

2.3 

Na 

2.6 

3.5 

1.3 

K 

4.0 

4.7 

1.2 

Rb 

4.6 

5.8 

1.3 

Cs 

5.3 

7.7 

1.5 

Cu 

1.2 

1.6 

1.3 

Ag 

1.5 

1.6 

1.1 

Au 

1.5 

1.6 

1.1 

Be 

1.2 

0.5 

0.42 

Mg 

2.4 

3.2 

1.3 

Ca 

3.6 

6.5 

1.8 

Sr 

4.3 

8.7 

2.0 

Ba 

4.7 

6.5 

1.4 

Nb 

1.6 

20 

12 

Fe 

1.5 

12 

8.0 

Mn 

1.5 

40 

27 

Zn 

1.8 

1.4 

0.78 

Cd 

2.3 

1.7 

0.74 

Hg 

2.4 

5.0 

2.1 

A1 

2.2 

3.0 

1.4 

Ga 

2.4 

1.5 

0.62 

In 

2.9 

4.3 

1.5 

T1 

3.1 

3.5 

1.1 

Sn 

3.3 

4.4 

1.3 

Pb 

3.6 

7.0 

1.9 

Bi 

4.3 

0.2 

0.047 

Sb 

3.9 

1.5 

0.38 

«  Since  the  theoretical  value  of  y  is  proportional  to  the  density  of  levels  at 
the  Fermi  level,  which  in  turn  is  proportional  to  the  electronic  mass  m ,  one 
sometimes  defines  a  specific  heat  effective  mass  m*  so  that  m*/m  is  the  ratio 
of  the  measured  y  to  the  free  electron  y.  Beware  of  identifying  this  specific  heat 
effective  mass  with  any  of  the  many  other  effective  masses  used  in  solid-state 
theory.  (See,  for  example,  the  index  entries  under  “effective  mass.”) 
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temperature  (Chapter  23),  and  at  very  low  temperatures  it  drops  below  the  electronic 
contribution,  which  only  decreases  linearly  with  T.  In  order  to  separate  out  these  two 
contributions  it  has  become  the  practice  to  plot  cJT  against  T2,  for  if  the  electronic 
and  ionic  contributions  together  result  in  the  low-temperature  form, 

cv=yT+AT\  (2.82) 

then 

J=y  +  A  T2.  (2.83) 


One  can  thus  find  y  by  extrapolating  the  cJT  curve  linearly  down  to  T 2  =  0,  and 
noting  where  it  intercepts  the  c„/T- axis.  Measured  metallic  specific  heats  typically 
contain  a  linear  term  that  becomes  comparable  to  the  cubic  one  at  a  few  degrees 
Kelvin.25 

Specific  heat  data  are  usually  quoted  in  joules  (or  calories)  per  mole  per  degree 
Kelvin.  Since  a  mole  of  free  electron  metal  contains  ZNA  conduction  electrons  (where 
Z  is  the  valence  and  NA  is  Avogadro’s  number)  and  occupies  a  volume  ZN Afn,  we  must 
multiply  the  heat  capacity  per  unit  volume,  cv  by  ZNA/n,  in  order  to  get  the  heat 
capacity  per  mole,  C: 


C  = 


71  rrr, 

■zr  - . 

3  n 


(2.84) 


where  R  =  kBNA  -  8.314  joules/mole  =  1.99  calories/mole-K.  Using  the  free  elec¬ 
tron  density  of  levels  (2.65)  and  the  evaluation  (2.33)  of  £ F/kB,  we  find  a  free 
electron  contribution  to  the  heat  capacity  per  mole  of  C  =  y'T,  where 

y  =  I rt2R  —  =  0.169Z ( -Y  x  10"* cal-mole-'-K-2.  (2-85) 
2  T,  \a0J 

Some  rough,  measured  values  of  y  are  displayed  in  Table  2.3,  together  with  the  free 
electron  values  implied  by  (2.85)  and  the  values  of  rja0  in  Table  1.1.  Note  that  the 
alkali  metals  continue  to  be  reasonably  well  described  by  free  electron  theory,  as  do 
the  noble  metals  (Cu,  Ag,  Au).  Note  also,  however,  the  striking  disparities  in  Fe  and 
Mn  (experiment  of  the  order  of  10  times  theory)  as  well  as  those  in  Bi  and  Sb  (experi¬ 
ment  of  the  order  of  0.1  times  theory).  These  large  deviations  are  now  qualitatively 
understood  on  fairly  general  grounds,  and  we  shall  return  to  them  in  Chapter  15. 
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To  find  the  velocity  distribution  for  electrons  in  metals,  consider  a  small26  volume 
element  of  fc-space  about  a  point  k,  of  volume  rfk.  Allowing  for  the  twofold  spin 


23  Since  constant  density  is  hard  to  arrange  experimentally,  one  generally  measures  the  specific  heat 
at  constant  pressure,  c  .  However,  one  can  show  (Problem  2)  that  for  the  metallic  free  electron  gas  at  room 
temperature  and  below,  cp/cv  =  1  +  0(kBT/£F)2.  Thus  at  temperatures  where  the  electronic  contribution 
to  the  specific  heat  becomes  observable  (a  few  degrees  Kelvin)  the  two  specific  heats  differ  by  a  negligible 
amount. 

26  Small  enough  that  the  Fermi  function  and  other  functions  of  physical  interest  vary  negligibly 
throughout  the  volume  element,  but  large  enough  that  it  contains  very  many  one-electron  levels. 
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degeneracy,  the  number  of  one-electron  levels  in  this  volume  element  is  (see  (2.18)) 


(2.86) 


The  probability  of  each  level  being  occupied  is  just  /(£(k)),  and  therefore  the  total 
number  of  electrons  in  the  /(-space  volume  element  is 


V  h2k2 

dk,  E(k)  =  — . 

2m 


(2.87) 


Since  the  velocity  of  a  free  electron  with  wave  vector  k  is  v  =  hk/m  (Eq.  (2.12)), 
the  number  of  electrons  in  an  element  of  volume  dx  about  v  is  the  same  as  the  number 
in  an  element  of  volume  dk  =  ( m/h )3  dx  about  k  =  mx/h.  Consequently  the  total 
number  of  electro  ns  per  unit  volume  of  real  space  in  a  velocity  space  element  of  volume 
dx  about  v  is 


where 


/(v)  dx. 


_  (m/h)3  1 

4ti3  exp[(jmt>2  -  p)/kBT ]  +  1' 


(2.88) 

(2.89) 


Sommerfeld  reexamined  the  Drude  model,  replacing  the  classical  Maxwell-Boltz- 
mann  velocity  distribution  (2.1)  by  the  Fermi-Dirac  distribution  (2.89).  Using  a 
velocity  distribution  constructed  from  quantum-mechanical  arguments  in  an  other¬ 
wise  classical  theory  requires  some  justification.27  One  can  describe  the  motion  of  an 
electron  classically  if  one  can  specify  its  position  and  momentum  as  accurately  as 
necessary,  without  violating  the  uncertainty  principle.28 

A  typical  electron  in  a  metal  has  a  momentum  of  the  order  of  hkF,  so  the  uncertainty 
in  its  momentum,  Ap,  must  be  small  compared  with  hkF  for  a  good  classical  descrip¬ 
tion.  Since,  from  (2.22),  kF  ~  l/rs,  the  uncertainty  in  position  must  satisfy 


Ax  ~ 


h_  i_ 

Ap  kF 


A, 


(2.90) 


where,  from  (1.2),  rs  is  of  the  order  of  the  mean  interelectronic  distance — i.e.,  ang¬ 
stroms.  Thus  a  classical  description  is  impossible  if  one  has  to  consider  electrons 
localized  to  within  atomic  distances  (also  of  the  order  of  angstroms).  However,  the 
conduction  electrons  in  a  metal  are  not  bound  to  particular  ions,  but  can  wander 
freely  through  the  volume  of  the  metal.  In  a  macroscopic  specimen,  for  most  purposes 
there  is  no  need  to  specify  their  position  to  an  accuracy  of  1 0“  8  cm.  The  Drude  model 
assumes  a  knowledge  of  the  position  of  an  electron  primarily  in  only  the  two  following 
contexts: 


11  A  detailed  analytical  justification  is  fairly  complicated  to  construct,  just  as  it  is  a  fairly  subtle 
matter  to  specify  with  generality  and  precision  when  the  quantum  theory  can  be  replaced  by  its  classical 
limit.  The  underlying  physics,  however,  is  straightforward. 

28  There  is  also  a  somewhat  more  specialized  limitation  on  the  use  of  classical  mechanics  in  describing 
conduction  electrons.  The  energy  of  motion  of  an  electron  in  the  plane  perpendicular  to  a  uniform  applied 
magnetic  field  is  quantized  in  multiples  of  ho),  (Chapter  14).  Even  for  fields  as  large  as  10“  gauss,  this  is  a 
very  small  energy,  but  in  suitably  prepared  samples  at  temperatures  of  a  few  degrees  Kelvin,  these  quantum 
effects  become  observable,  and  are,  in  fact,  of  great  practical  importance. 
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1.  When  spatially  varying  electromagnetic  fields  or  temperature  gradients  are  ap¬ 
plied,  one  must  be  able  to  specify  the  position  of  an  electron  on  a  scale  small 
compared  with  the  distance  ).  over  which  the  fields  or  temperature  gradients  vary. 
For  most  applications  the  applied  fields  or  temperature  gradients  do  not  vary 
appreciably  on  the  scale  of  angstroms,  and  the  necessary  precision  of  definition 
in  the  electron’s  position  need  not  lead  to  an  unacceptably  large  uncertainty  in 
its  momentum.  For  example,  the  electric  field  associated  with  visible  light  varies 
appreciably  only  over  a  distance  of  order  103  A.  If,  however,  the  wave  length  is 
very  much  shorter  than  this  (for  example,  X  rays),  one  must  use  quantum  me¬ 
chanics  to  describe  the  electronic  motion  induced  by  the  field. 

2.  There  is  also  an  implicit  assumption  in  the  Drude  model  that  one  can  localize  an 
electron  to  within  substantially  less  than  a  mean  free  path  f,  and  one  should 
therefore  be  suspicious  of  classical  arguments  when  mean  free  paths  much  shorter 
than  tens  of  angstroms  occur.  Fortunately,  as  we  shall  see  below,  mean  free  paths 
in  metals  are  of  the  order  of  100  A  at  room  temperature,  and  become  longer  still 
as  the  temperature  drops. 

There  is  thus  a  wide  range  of  phenomena  in  which  the  behavior  of  a  metallic 
electron  is  well  described  by  classical  mechanics.  It  is  not,  however,  immediately 
evident  from  this  that  the  behavior  of  N  such  electrons  can  be  described  by  classical 
mechanics.  Since  the  Pauli  exclusion  principle  so  profoundly  affects  the  statistics  of 
N  electrons,  why  should  it  not  have  similarly  drastic  effects  on  their  dynamics?  That 
it  does  not  follows  from  an  elementary  theorem,  which  we  state  without  proof,  since 
the  proof,  though  simple,  is  notationally  rather  cumbersome: 

Consider  a  system  of  N  electrons  whose  interactions  with  one  another  are  ignored, 
and  which  are  exposed  to  an  arbitrary  space-  and  time-dependent  electromagnetic 
field.  Let  the  A'-electron  state  at  time  0  be  formed  by  occupying  a  particular  group  of 
N  one-electron  levels,  \p,(0), ...,  1^(0).  Let  4>jU)  be  the  level  i^j(0)  would  evolve  into 
in  time  t  under  the  influence  of  the  electromagnetic  field  if  there  were  only  a  single 
electron  present,  which  was  in  the  level  at  time  zero.  Then  the  correct  AT-electron 
state  at  time  t  will  be  the  one  formed  by  occupying  the  set  of  N  one-electron  levels 

|M0»  lM0- 

Thus  the  dynamical  behavior  of  N  noninteracting  electrons  is  completely  deter¬ 
mined  by  considering  N  independent  one-electron  problems.  In  particular,  if  the 
classical  approximation  is  valid  for  each  of  these  one-electron  problems,  it  will  also 
be  valid  for  the  whole  N-electron  system.29 

The  use  of  Fermi-Dirac  statistics  affects  only  those  predictions  of  the  Drude  model 
that  require  some  knowledge  of  the  electronic  velocity  distribution  for  their  evalua¬ 
tion.  If  the  rate  1/t  at  which  an  electron  experiences  collisions  does  not  depend  on  its 
energy,  then  only  our  estimate  of  the  electronic  mean  free  path  and  our  calculation  of 
the  thermal  conductivity  and  thermopower  are  at  all  affected  by  a  change  in  the 
equilibrium  distribution  function. 


29  Note  that  this  implies  that  any  classical  configuration  consistent  with  the  exclusion  principle  at 
time  t  =  0  (i.e.,  having  less  than  one  electron  of  each  spin  per  unit  volume,  in  any  momentum  space  region 
of  volume  dp  =  {2n tif/V)  will  remain  consistent  with  the  exclusion  principle  at  all  future  times.  This  result 
can  also  be  provpH  by  purely  classical  reasoning  as  a  direct  corollary  of  Liouville’s  theorem.  See  Chapter  12. 
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Mean  Free  Path  Using  vF  (Eq.  (2.24) )  as  a  measure  of  the  typical  electronic  speed, 
we  can  evaluate  the  mean  free  path  l  =  vfz  from  Eq.  (1.8)  as  follows: 


(rjap)1 

P» 


x  92  A. 


(2.91) 


Since  the  resistivity  in  microhm  centimeters,  pM,  is  typically  1  to  100  at  room  tem¬ 
perature,  and  since  rja0  is  typically  2  to  6,  mean  free  paths  of  order  a  hundred 
angstroms  long  are  possible  even  at  room  temperature.30 


Thermal  Conductivity  We  continue  to  estimate  the  thermal  conductivity  by  Eq.  (1.51): 

k  =  jv2z cv.  (2.92) 

The  correct  specific  heat  (2.81)  is  smaller  than  the  classical  guess  of  Drude  by  a  factor 
of  order  kBT/ 8f ;  the  correct  estimate  of  v2  is  not  the  classical  thermal  mean  square 
velocity  of  order  kBT/m,  but  vF2  =  28 F/m,  which  is  larger  than  the  classical  value  by 
a  factor  of  order  8  F/kBT.  Inserting  these  values  in  (2.92)  and  eliminating  the  relaxation 
time  in  favor  of  the  conductivity  through  (1.6),  we  find 

—  =  —  (^Y  =  2.44  x  10- 8  watt-ohm/K2.  (2.93) 

crT  3  \e  J 

This  is  remarkably  close  to  Drude’s  fortuitously  good  value,  thanks  to  the  two 
compensating  corrections  of  order  kBT/ZF,  and  in  excellent  agreement  with  the  data 
in  Table  1.6.  We  shall  see  (Chapter  13)  that  this  value  of  the  Lorenz  number  is  a  far 
better  one  than  the  very  rough  derivation  of  (2.93)  would  suggest. 


Thermopower  Drude’s  overestimate  of  the  thermopower  is  also  resolved  by  the  use 
of  Fermi-Dirac  statistics.  Substituting  the  specific  heat,  from  Eq.  (2.81),  into  Eq.  (1.59) 
we  find 

«  '  -T7  (if)  '  IA2  (v)  x  10“  vo1,/k'  ,2M> 

which  is  smaller  than  Drude’s  estimate  (Eq.  (1.60))  by  0(kBT/ 6f)  ~  0.01  at  room 
temperature. 

Other  Properties  Since  the  form  of  the  electronic  velocity  distribution  did  not  play 
a  role  in  the  calculation  of  the  DC  or  AC  conductivities,  the  Hall  coefficient,  or  the 
magnetoresistance,  the  estimates  given  in  Chapter  1  remain  the  same  whether  one 
uses  Maxwell-Boltzmann  or  Fermi-Dirac  statistics. 

This  is  not  the  case,  however,  if  one  uses  an  energy-dependent  relaxation  time.  If, 
for  example,  one  thought  the  electrons  collided  with  fixed  scattering  centers,  then  it 
would  be  natural  to  take  an  energy-independent  mean  free  path,  and  hence  a  relaxa¬ 
tion  time  t  =  l/v  ~  1/ 81/2.  Shortly  after  Drude  set  forth  the  electron  gas  model  of  a 


30  It  is  perhaps  just  as  well  that  Drude  estimated  fusing  the  very  much  lower  classical  thermal  velocity, 
or  he  might  have  been  sufficiently  baffled  by  such  long  mean  free  paths  to  abandon  further  investigation. 
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metal,  H.  A.  Lorentz  showed,  using  the  classical  Maxwell-Boltzmann  velocity  dis¬ 
tribution,  that  an  energy-dependent  relaxation  time  would  lead  to  temperature  depen¬ 
dence  in  the  DC  and  AC  conductivities,  as  well  as  to  a  nonvanishing  magnetoresis¬ 
tance  and  a  field-  and  temperature-dependent  Hall  coefficient.  As  one  might  now 
expect  from  the  inappropriateness  of  the  classical  velocity  distribution,  none  of  these 
corrections  were  in  any  way  able  to  bring  the  discrepancies  of  the  Drude  model  into 
better  alignment  with  the  observed  facts  about  metals.31  Furthermore,  we  shall  see 
(Chapter  13)  that  when  the  correct  Fermi-Dirac  velocity  distribution  is  used,  adding 
an  energy  dependence  to  the  relaxation  time  has  little  significant  effect  on  most  of 
the  quantities  of  interest  in  a  metal.32  If  one  calculates  the  DC  or  AC  conductivities, 
the  magnetoresistance,  or  the  Hall  coefficient  assuming  an  energy-dependent  t(E),  the 
results  one  finds  are  the  same  as  those  one  would  have  calculated  assuming  an  energy- 
independent  t,  equal  to  t(8f  ).  In  metals  these  quantities  are  determined  almost  entirely 
by  the  way  in  which  electrons  near  the  Fermi  level  are  scattered.33  This  is  another 
very  important  consequence  of  the  Pauli  exclusion  principle,  the  justification  of  which 
will  be  given  in  Chapter  13. 


PROBLEMS 


1.  The  Free  and  Independent  Electron  Gas  in  Two  Dimensions 

(a)  What  is  the  relation  between  n  and  kF  in  two  dimensions? 

(b)  What  is  the  relation  between  kr  and  r,  in  two  dimensions? 

(c)  Prove  that  in  two  dimensions  the  free  electron  density  of  levels  g( £)  is  a  constant  indepen¬ 
dent  of  E  for  £  >  0,  and  0  for  6  <  0.  What  is  the  constant? 

(d)  Show  that  because  g{ 6)  is  constant,  every  term  in  the  Sommerfeld  expansion  for  n  vanishes 
except  the  T  =  0  term.  Deduce  that  p  =  EF  at  any  temperature. 

(e)  Deduce  from  (2.67)  that  when  g(c)  is  as  in  (c),  then 

p  +  kBT\n  (1  +  e-»lk‘>T)  =  EF.  (2.95) 

(f)  Estimate  from  (2.95)  the  amount  by  which  p  differs  from  Er.  Comment  on  the  numerical 
significance  of  this  “failure”  of  the  Sommerfeld  expansion,  and  on  the  mathematical  reason  for 
the  “failure.” 


2.  Thermodynamics  of  the  Free  and  Independent  Electron  Gas 
(a)  Deduce  from  the  thermodynamic  identities 


(2.96) 


31  The  Lorentz  model  is,  however,  of  considerable  importance  in  the  description  of  semiconductors 
(Chapter  29). 

32  The  thermopower  is  a  notable  exception. 

33  These  assertions  are  correct  to  leading  order  in  kBTfZF ,  but  in  metals  this  is  always  a  good  expansion 
parameter. 
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from  Eqs.  (2.56)  and  (2.57),  and  from  the  third  law  of  thermodynamics  (s  -»  0  as  T -*  0)  that  the 
entropy  density,  s  =  S/V is  given  by: 

f  tfk 

s  =  -kB\—3\_flnf+  (1  —  /)  In  (1  -/)],  (2.97) 

where /(6(k)) is  the  Fermi  function  (Eq.  (2.56)). 

(b)  Since  the  pressure  P  satisfies  Eq.(B.5)  in  Appendix  B,  P  =  —  (u  -  7s  —  /tn),  deduce  from 
(2.97)  that 


P  =  k„T 
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(2.98) 


Show  that  (2.98)  implies  that  P  is  a  homogeneous  function  of  fi  and  T of  degree  5/2;  that  is, 

P(/./t,  ).T)  =  Xsl2P([i,  T)  (2.99) 

for  any  constant 

(c)  Deduce  from  the  thermodynamic  relations  in  Appendix  B  that 


(2.100) 


(d)  By  differentiating  (2.99)  with  respect  to  7:  show  that  the  ground-state  relation  (2.34)  holds 
at  any  temperature,  in  the  form 

(2.101) 


i  _  i 


lu¬ 


te)  Show  that  when  kBT  «  cF ,  the  ratio  of  the  constant-pressure  to  constant-volume  specific 
heats  satisfies 


fe)-4(¥T+o(W 


(f )  Show,  by  retaining  further  terms  in  the  Sommerfeld  expansions  of  u  and  n,  that  correct 
to  order  T3  the  electronic  heat  capacity  is  given  by 


3.  The  Classical  Limit  of  Fermi-Dirac  Statistics 

The  Fermi-Dirac  distribution  reduces  to  the  Maxwell-Boltzmann  distribution,  provided  that  the 
Fermi  function  (2.56)  is  much  less  than  unity  for  every  positive  6,  for  in  that  case  we  must  have 

/(£)  w  (2.103) 

The  necessary  and  sufficient  condition  for  (2.103)  to  hold  for  all  positive  £  is 

»  i.  (2.104) 

(a)  Assuming  that  (2.104)  holds,  show  that 


Ts  =  e~)llikBT  V!inll6h(2mkBTy112. 


(2.105) 
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In  conjunction  with  (2.104)  this  requires  that 


rs  » 
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(2.106) 


which  can  also  be  taken  as  the  condition  for  the  validity  of  classical  statistics. 

(b)  What  is  the  significance  of  the  length  rs  must  exceed? 

(c)  Show  that  (2.106)  leads  to  the  numerical  condition 


«o 


» 


(2.107) 


(d)  Show  that  the  normalization  constant  m3/4n3ft3  appearing  in  the  Fermi-Dirac  velocity  dis¬ 
tribution  (2.2)  can  also  be  written  as  (3N/7r/4)n(m/2jikB7V)3/2  so  that  /B(0)//(0)  =  (4/3yfn)(TF/T)il2. 


4.  Insensitivity  of  the  Distribution  Function  to  Small  Changes  in  the  Total  Number 
of  Electrons 

In  deriving  the  Fermi  distribution  (page  41)  we  argued  that  the  probability  of  a  given  level  being 
occupied  should  not  change  appreciably  when  the  total  number  of  electrons  is  changed  by  one. 
Verily  that  the  Fermi  function  (2.56)  is  compatible  with  this  assumption  as  follows: 

(a)  Show,  when  kBT  «  SF,  that  when  the  number  of  electrons  changes  by  one  at  fixed  tem¬ 
perature,  the  chemical  potential  changes  by 


1 

Vg(sFy 


(2.108) 


where  g( 6)  is  the  density  of  levels. 

(b)  Show,  as  a  consequence  of  this,  that  the  most  the  probability  of  any  level  being  occupied 
can  change  by  is 


A  f  _  1  1 

J  ~  6  kBT N' 


(2.109) 


[Use  the  free  electron  evaluation  (2.65)  of  g(£r).]  Although  temperatures  of  millidegrees  Kelvin 
can  be  reached,  at  which  EF/kBT  a  10s,  when  N  is  of  order  102z  then  A/is  still  negligibly  small. 
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Free  electron  theory  successfully  accounts  for  a  wide  range  of  metallic  properties.  In 
the  form  originally  put  forth  by  Drude  the  most  striking  deficiencies  of  the  model 
were  due  to  the  use  of  classical  statistical  mechanics  in  describing  the  conduction 
electrons.  As  a  result,  predicted  thermoelectric  fields  and  heat  capacities  were  hun¬ 
dreds  of  times  too  large,  even  at  room  temperature.  The  difficulty  was  obscured  by 
the  fact  that  classical  statistics  fortuitously  gave  a  form  for  the  Wiedemann-Franz 
law  that  was  not  in  such  gross  error.  Sommerfeld’s  application  of  Fermi-Dirac  sta¬ 
tistics  to  the  conduction  electrons  eliminated  this  class  of  difficulties  while  retaining 
all  of  the  other  basic  assumptions  of  the  free  electron  model. 

However,  the  Sommerfeld  free  electron  model  still  makes  many  quantitative  pre¬ 
dictions  that  are  quite  unambiguously  contradicted  by  observation,  and  leaves  many 
fundamental  questions  of  principle  unresolved.  We  list  below  those  inadequacies  of 
the  free  electron  model  that  have  emerged  from  the  applications  made  in  the  preceding 
two  chapters.1 


DIFFICULTIES  WITH  THE  FREE  ELECTRON  MODEL 

1.  Inadequacies  in  the  Free  Electron  Transport  Coefficients 

(a)  The  Hall  Coefficient  Free  electron  theory  predicts  a  Hall  coefficient  which 
at  metallic  densities  of  electrons  has  the  constant  value  Rt,  =  -  1/nec,  inde¬ 
pendent  of  the  temperature,  the  relaxation  time,  or  the  strength  of  the  mag¬ 
netic  field.  Although  observed  Hall  coefficients  have  this  order  of  magnitude, 
generally  speaking  they  depend  on  both  the  magnetic  field  strength  and  the 
temperature  (and  presumably  on  the  relaxation  time,  which  is  rather  harder 
to  control  experimentally).  Often  this  dependence  is  quite  dramatic.  In  alumi¬ 
num,  for  example,  RH  (see  Figure  1.4)  never  gets  within  a  factor  of  three  of 
the  free  electron  value,  depends  strongly  on  the  strength  of  the  field,  and  at 
high  fields  does  not  even  have  the  sign  predicted  by  free  electron  theory. 
Such  cases  are  not  atypical.  Only  the  Hall  coefficients  of  the  alkali  metals 
come  even  close  to  behaving  in  accordance  with  the  predictions  of  free  elec¬ 
tron  theory. 

(b)  The  Magnetoresistance  Free  electron  theory  predicts  that  the  resistance  of 
a  wire  perpendicular  to  a  uniform  magnetic  field  should  not  depend  on  the 
strength  of  the  field.  In  almost  all  cases  it  does.  In  some  cases  (notably  the 
noble  metals,  copper,  silver,  and  gold)  it  can  be  made  to  increase  apparently 
without  limit  as  the  field  increases.  In  most  metals  the  behavior  of  the  resis¬ 
tance  in  a  field  depends  quite  drastically  on  the  manner  in  which  the  metallic 
specimen  is  prepared  and,  for  suitable  specimens,  on  the  orientation  of  the 
specimen  with  respect  to  the  field. 


1  These  examples  and  the  remarks  making  up  the  rest  of  this  brief  chapter  are  not  intended  to  give 
a  detailed  picture  of  the  limitations  of  the  free  electron  model.  That  will  emerge  in  the  chapters  that  follow, 
together  with  the  solutions  to  the  difficulties  posed  by  the  model.  Our  purpose  in  this  chapter  is  only  to 
emphasize  how  varied  and  extensive  the  shortcomings  are,  thereby  indicating  why  one  must  resort  to  a 
considerably  more  elaborate  analysis. 
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(c)  The  Thermoelectric  Field  The  sign  of  the  thermoelectric  field,  like  the  sign 
of  the  Hall  constant,  is  not  always  what  free  electron  theory  predicts  it  should 
be.  Only  the  order  of  magnitude  is  right. 

(d)  The  Wiedemann-Franz  Law  That  great  triumph  of  free  electron  theory,  the 
Wiedemann-Franz  law,  is  obeyed  beautifully  at  high  (room)  temperatures 
and  also  quite  probably  at  very  low  (a  few  degrees  K)  temperatures.  At  inter¬ 
mediate  temperatures  it  fails,  and  k/oT  depends  on  the  temperature. 

(e)  Temperature  Dependence  of  the  DC  Electrical  Conductivity  Nothing  in  free 
electron  theory  can  account  for  the  temperature  dependence  of  the  DC  con¬ 
ductivity  (revealed,  for  example,  in  Table  1.2).  It  has  to  be  mechanically 
inserted  into  the  theory  as  an  ad  hoc  temperature  dependence  in  the  relaxa¬ 
tion  time  r. 

(f)  Directional  Dependence  of  the  DC  Electrical  Conductivity  In  some  (but  by 
no  means  all)  metals  the  DC  conductivity  depends  on  the  orientation  of  the 
specimen  (if  suitably  prepared)  with  respect  to  the  field.  In  such  specimens  the 
current  j  need  not  even  be  parallel  to  the  field. 

(g)  AC  Conductivity  There  is  a  far  more  subtle  frequency  dependence  to  the 
optical  properties  of  metals  than  the  simple  free  electron  dielectric  constant 
can  hope  to  produce.  Even  sodium,  in  other  respects  a  fairly  good  free  electron 
metal,  appears  to  fail  this  test  in  the  detailed  frequency  dependence  of  its 
reflectivity.  In  other  metals  the  situation  is  far  worse.  We  cannot  begin  to 
explain  the  colors  of  copper  and  gold  in  terms  of  reflectivities  calculated  from 
the  free  electron  dielectric  constant. 

2.  Inadequacies  in  the  Static  Thermodynamic  Predictions 

(a)  Linear  Term  in  the  Specific  Heat  The  Sommerfeld  theory  accounts  reason¬ 
ably  well  for  the  size  of  the  term  linear  in  T  in  the  low-temperature  specific 
heat  of  the  alkali  metals,  rather  less  well  for  the  noble  metals,  and  very 
poorly  indeed  for  transition  metals  such  as  iron  and  manganese  (much  too 
small  a  prediction)  as  well  as  for  bismuth  and  antimony  (much  too  large  a 
prediction). 

(b)  Cubic  Term  in  the  Specific  Heat  There  is  nothing  in  the  free  electron  model 
to  explain  why  the  low-temperature  specific  heat  should  be  dominated  by 
the  electronic  contribution.  However,  it  is  evident  from  experiment  that  the 
T3  correction  to  the  linear  term  is  very  definitely  dominated  by  something 
else,  since  the  simple  Sommerfeld  theory  for  the  electronic  contribution  to  the 
T3  term  has  the  wrong  sign  and  is  millions  of  times  too  small. 

(c)  The  Compressibility  of  Metals  Although  free  electron  theory  does  miracu¬ 
lously  well  in  estimating  the  bulk  moduli  (or  compressibilities)  of  many  metals, 
it  is  clear  that  more  attention  must  be  paid  to  the  ions  and  to  electron- 
electron  interactions  if  one  is  to  achieve  a  more  accurate  estimate  of  the 
equation  of  state  of  a  metal. 

3.  Fundamental  Mysteries 

(a)  What  Determines  the  Number  of  Conduction  Electrons?  We  have  assumed 
th?  ’1  valence  electrons  become  conduction  electrons,  while  the  others 
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remain  bound  to  the  ions.  We  have  given  no  thought  to  the  question  of  why 
this  should  be,  or  how  it  is  to  be  interpreted  in  the  case  of  elements,  like 
iron,  that  display  more  than  one  chemical  valence. 

(b)  Why  Are  Some  Elements  Nonmetals  ?  A  more  acute  inadequacy  of  our  rule 
of  thumb  for  determining  the  number  of  conduction  electrons  is  posed  by  the 
existence  of  insulators.  Why,  for  example,  is  boron  an  insulator  while  its 
vertical  neighbor  in  the  periodic  table,  aluminum,  an  excellent  metal?  Why 
is  carbon  an  insulator  when  in  the  form  of  diamond  and  a  conductor  when 
in  the  form  of  graphite?  Why  are  bismuth  and  antimony  such  very  poor 
conductors? 

REVIEW  OF  BASIC  ASSUMPTIONS 

To  make  further  progress  with  any  of  these  problems  we  must  reexamine  the  basic 
assumptions  on  which  free  electron  theory  rests.  The  most  notable  are  these: 

1.  Free  Electron  Approximation2  The  metallic  ions  play  a  very  minor  role.  In  be¬ 
tween  collisions  they  have  no  effect  at  all  on  the  motion  of  an  electron,  and  though 
Drude  invoked  them  as  a  source  of  collisions,  the  quantitative  information  we 
have  been  able  to  extract  about  the  collision  rate  has  made  no  sense  when  inter¬ 
preted  in  terms  of  electrons  colliding  with  fixed  ions.  The  only  thing  the  ions 
really  seem  to  do  properly  in  the  models  of  Drude  and  Sommerfeld  is  to  maintain 
overall  charge  neutrality. 

2.  Independent  Electron  Approximation3  The  interactions  of  the  electrons  with  one 
another  are  ignored. 

3.  Relaxation-Time  Approximation4  The  outcome  of  a  collision  is  assumed  not  to 
depend  on  the  configuration  of  the  electrons  at  the  moment  of  collision. 

All  these  oversimplifications  must  be  abandoned  if  we  are  to  achieve  an  accurate 
model  of  a  solid.  However,  a  remarkable  amount  of  progress  can  be  made  by  first 
concentrating  entirely  on  improving  some  aspects  of  the  free  electron  approximation 
while  continuing  to  use  the  independent  electron  and  relaxation  time  approximations. 
We  shall  return  to  a  critical  examination  of  these  last  two  approximations  in  Chapters 
16  and  17,  limiting  ourselves  here  to  the  following  general  observations: 

There  is  a  surprisingly  wide  range  of  circumstances  in  which  the  independent 
electron  approximation  does  not  drastically  diminish  the  validity  of  the  analysis.  In 
resolving  the  problems  of  free  electron  theory  listed  above,  improving  on  the  inde¬ 
pendent  electron  approximation  plays  a  major  role  only  in  the  calculation  of  metallic 
compressibilities  (2c).5'6  An  indication  of  why  we  apparently  ignore  electron-electron 
interactions  is  given  in  Chapter  17,  together  with  further  examples  in  which  electron- 
electron  interactions  do  play  a  direct  and  crucial  role. 


2  See  page  4. 

3  See  page  4. 

4  See  page  6. 

5  Numbers  in  parentheses  refer  to  numbered  paragraphs  at  the  beginning  of  this  chapter. 

0  There  are  also  some  cases  where  a  failure  of  the  independent  electron  approximation  (Chapter  1 0. 
p.  186  and  Chapter  32)  invalidates  the  simple  distinction  between  metals  and  insulators  that  we  shall 
draw  in  Chapters  8  and  12. 
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As  for  the  relaxation  time  approximation,  even  in  Drude’s  time  there  were  methods 
in  kinetic  theory  for  correcting  this  oversimplification.  They  lead  to  a  much  more 
complex  analysis  and  in  many  cases  are  primarily  important  in  understanding  metallic 
phenomena  with  greater  precision.  Of  the  difficulties  described  previously,  only  the 
problem  of  the  Wiedemann-Franz  law  at  intermediate  temperatures  (Id)  has  a  resolu¬ 
tion  that  requires  abandoning  the  relaxation  time  approximation  even  at  the  most 
gross  qualitative  level  of  explanation.7  In  Chapter  16  we  shall  describe  the  form  a 
theory  must  take  if  it  is  to  go  beyond  the  relaxation  time  approximation,  together  with 
further  examples  of  problems  requiring  such  a  theory  for  their  resolution. 

The  free  electron  approximation  is  the  major  source  of  the  difficulties  in  the  theories 
of  Drude  and  Sommerfeld.  It  makes  several  simplifications: 

(i)  The  effect  of  the  ions  on  the  dynamics  of  an  electron  between  collisions  is  ignored. 

(ii)  What  role  the  ions  play  as  a  source  of  collisions  is  left  unspecified. 

(iii)  The  possibility  that  the  ions  themselves,  as  independent  dynamical  entities, 
contribute  to  physical  phenomena  (such  as  the  specific  heat  or  thermal  con¬ 
ductivity)  is  ignored. 

The  failures  of  assumptions  (ii)  and  (iii)  play  an  essential  role  in  accounting  for 
deviations  from  the  Wiedemann-Franz  law  at  intermediate  temperatures  (Id)  and  the 
temperature  dependence  of  the  electrical  conductivity  (le).  The  failure  of  assumption 
(iii)  accounts  for  the  cubic  term  in  the  specific  heat  (2b).  Relaxing  these  two  assump¬ 
tions  is  also  essential  in  accounting  for  a  variety  of  phenomena  yet  to  be  discussed. 
Such  phenomena  are  briefly  described  in  Chapter  21,  and  the  consequences  of 
abandoning  assumptions  (ii)  and  (iii)  are  explored  in  detail  in  Chapters  22  to  26. 

It  is  assumption  (i),  that  the  ions  have  no  significant  effect  on  the  motion  of  electrons 
between  collisions,  that  is  responsible  for  most  of  the  deficiencies  of  the  Drude  and 
Sommerfeld  theories  described  above.  The  reader  may  well  be  perplexed  at  how  one 
can  distinguish  between  assumptions  (i)  and  (ii),  for  it  is  far  from  clear  that  the  effect 
of  the  ions  on  the  electrons  can  be  unambiguously  resolved  into  “collisional”  and 
“noncollisional”  aspects.  We  shall  find,  however  (especially  in  Chapters  8  and  12), 
that  a  theory  that  takes  into  account  the  detailed  field  produced  by  an  appropriate 
static  array  of  ions  but  ignores  the  possibility  of  ionic  motion  (the  “static  ion  approxi¬ 
mation”)  reduces  under  a  wide  range  of  circumstances  to  a  relatively  simple  modifica¬ 
tion  of  the  Drude  and  Sommerfeld  free  electron  theories,  in  which  collisions  are 
entirely  absent!  It  is  only  when  one  allows  for  ionic  motion  that  their  role  as  a  source 
of  collisions  can  be  properly  understood. 

We  shall  therefore  relax  the  free  electron  approximation  in  two  stages.  First  we 
shall  examine  the  wealth  of  new  structure  and  the  subsequent  elucidation  that  emerges 
when  the  electrons  are  considered  to  move  not  in  empty  space,  but  in  the  presence  of  a 
specified  static  potential  due  to  a  fixed  array  of  stationary  ions.  Only  after  that  (from 
Chapter  21  onward)  will  we  examine  the  consequences  of  the  dynamical  deviations  of 
the  ionic  positions  from  that  static  array. 

The  single  most  important  fact  about  the  ions  is  that  they  are  not  distributed  at 
random,  but  are  arranged  in  a  regular  periodic  array,  or  “lattice.”  This  was  first 


(le). 


It  must  also  be  abandoned  to  explain  the  detailed  temperature  dependence  of  the  DC  conductivity 
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suggested  by  the  macroscopic  crystalline  forms  assumed  by  many  solids  (including 
metals),  first  directly  confirmed  by  X-ray  diffraction  experiments  (Chapter  6)  and 
subsequently  reconfirmed  by  neutron  diffraction,  electron  microscopy,  and  many 
other  direct  measurements. 

The  existence  of  a  periodic  lattice  of  ions  lies  at  the  heart  of  modern  solid  state 
physics.  It  provides  the  basis  for  the  entire  analytic  framework  of  the  subject,  and 
without  it  comparatively  little  progress  would  have  been  made.  If  there  is  one 
reason  why  the  theory  of  solids  is  so  much  more  highly  developed  than  the  theory 
of  liquids,  even  though  both  forms  of  matter  have  comparable  densities,  it  is  that 
the  ions  are  arranged  periodically  in  the  solid  state  but  are  spatially  disordered  in 
liquids.  It  is  the  lack  of  a  periodic  array  of  ions  that  has  left  the  subject  of  amorphous 
solids  in  so  primitive  a  state  compared  with  the  highly  developed  theory  of  crystalline 
solids.8 

To  make  further  progress  in  the  theory  of  solids,  whether  metallic  or  insulating, 
we  must  therefore  turn  to  the  subject  of  periodic  arrays.  The  fundamental  properties 
of  such  arrays  are  developed  in  Chapters  4,  5,  and  7,  without  regard  to  particular 
physical  applications.  In  Chapter  6  these  concepts  are  applied  to  an  elementary 
discussion  of  X-ray  diffraction,  which  provides  a  direct  demonstration  of  the  period¬ 
icity  of  solids  and  is  a  paradigm  for  the  wide  variety  of  other  wave  phenomena  in 
solids  we  shall  subsequently  encounter.  Chapters  8  to  1 1  explore  the  direct  con¬ 
sequences  of  the  periodicity  of  the  array  of  ions  on  the  electronic  structure  of  any  solid, 
whether  insulating  or  metallic.  In  Chapters  12  to  15  the  resulting  theory  is  used  to 
reexplore  the  properties  of  metals  described  in  Chapters  1  and  2.  Many  of  the  anoma¬ 
lies  of  free  electron  theory  are  thereby  removed,  and  its  mysteries  are  in  large  part 
resolved. 


8  Although  there  has  been  a  great  burst  of  interest  in  amorphous  solids  (starling  in  the  late  1960s), 
the  subject  has  yet  to  develop  any  unifying  principles  of  a  power  even  remotely  comparable  to  that  provided 
by  the  consequences  of  a  periodic  array  of  ions.  Many  of  the  concepts  used  in  the  theory  of  amorphous 
solids  are  borrowed,  with  little  if  any  justification,  from  the  theory  of  crystalline  solids,  even  though  they 
are  only  well  understood  as  consequences  of  lattice  periodicity.  Indeed,  the  term  “solid  state  physics,”  if 
defined  as  the  subject  matter  of  solid  state  physics  textbooks  (including  this  one)  is  currently  confined 
almost  entirely  to  the  theory  of  crystalline  solids.  This  is  in  large  part  because  the  normal  condition  of 
solid  matter  is  crystalline,  and  also  because  in  its  present  form  the  subject  of  amorphous  solids  still  lacks 
the  kind  of  broad  basic  principles  suitable  for  inclusion  in  an  elementary  text. 
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Those  who  have  not  wandered  amidst  the  mineralogical  departments  of  natural 
history  museums  are  often  surprised  to  learn  that  metals,  like  most  other  solids,  are 
crystalline,  for  although  one  is  used  to  the  very  obvious  crystalline  features  of  quartz, 
diamond,  and  rock  salt,  the  characteristic  plane  faces  at  sharp  angles  with  one  another 
are  absent  from  metals  in  their  most  commonly  encountered  forms.  However,  those 
metals  that  occur  naturally  in  the  metallic  state  are  quite  often  found  in  crystalline 
forms,  which  are  completely  disguised  in  finished  metal  products  by  the  great  mal¬ 
leability  of  metals,  which  permits  them  to  be  fashioned  into  whatever  macroscopic 
shape  one  wishes. 

The  true  test  of  crystallinity  is  not  the  superficial  appearance  of  a  large  specimen, 
but  whether  on  the  microscopic  level  the  ions  are  arranged  in  a  periodic  array.1 
This  underlying  microscopic  regularity  of  crystalline  matter  was  long  hypothesized 
as  the  obvious  way  to  account  for  the  simple  geometric  regularities  of  macroscopic 
crystals,  in  which  plane  faces  make  only  certain  definite  angles  with  each  other.  It 
received  direct  experimental  confirmation  in  1913  through  the  work  of  W.  and 
L.  Bragg,  who  founded  the  subject  of  X-ray  crystallography  and  began  the  inves¬ 
tigation  of  how  atoms  are  arranged  in  solids. 

Before  we  describe  how  the  microscopic  structure  of  solids  is  determined  by  X-ray 
diffraction  and  how  the  periodic  structures  so  revealed  affect  fundamental  physical 
properties,  it  is  useful  to  survey  some  of  the  most  important  geometrical  properties 
of  periodic  arrays  in  three-dimensional  space.  These  purely  geometrical  consider¬ 
ations  are  implicit  in  almost  all  the  analysis  one  encounters  throughout  solid  state 
physics,  and  shall  be  pursued  in  this  chapter  and  in  Chapters  5  and  7.  The  first  of 
many  applications  of  these  concepts  will  be  made  to  X-ray  diffraction  in  Chapter  6. 


BRA  VAIS  LATTICE 

A  fundamental  concept  in  the  description  of  any  crystalline  solid  is  that  of  the  Bravais 
lattice,  which  specifies  the  periodic  array  in  which  the  repeated  units  of  the  crystal 
are  arranged.  The  units  themselves  may  be  single  atoms,  groups  of  atoms,  molecules, 
ions,  etc.,  but  the  Bravais  lattice  summarizes  only  the  geometry  of  the  underlying 
periodic  structure,  regardless  of  what  the  actual  units  may  be.  We  give  two  equivalent 
definitions  of  a  Bravais  lattice2 : 

(a)  A  Bravais  lattice  is  an  infinite  array  of  discrete  points  with  an  arrangement  and 
orientation  that  appears  exactly  the  same,  from  whichever  of  the  points  the 
array  is  viewed. 

(b)  A  (three-dimensional)  Bravais  lattice  consists  of  all  points  with  position  vectors 
R  of  the  form 

R  =  +  n2  a2  +  n3a3,  (4.1) 


1  Often  a  specimen  is  made  up  of  many  small  pieces,  each  large  on  the  microscopic  scale  and  con¬ 
taining  large  numbers  of  periodically  arranged  ions.  This  “polycrystalline”  state  is  more  commonly 
encountered  than  a  single  macroscopic  crystal,  in  which  the  periodicity  is  perfect,  extending  through  the 
entire  specimen. 

2  Why  the  name  Bravais  appears  is  explained  in  Chapter  7. 
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where  a,,  a2,  and  a3  are  any  three  vectors  not  all  in  the  same  plane,  and  nu 
n2,  and  n3  range  through  all  integral  values.3  Thus  the  point  Xti.a,  is  reached  by 
moving  nt  steps4  of  length  at  in  the  direction  of  a;  for  i  =  1,  2,  and  3. 

The  vectors  a,  appearing  in  definition  (b)  of  a  Bravais  lattice  are  called  primitive 
vectors  and  are  said  to  generate  or  span  the  lattice. 

It  takes  some  thought  to  see  that  the  two  definitions  of  a  Bravais  lattice  are  equiva¬ 
lent.  That  any  array  satisfying  (b)  also  satisfies  (a)  becomes  evident  as  soon  as  both  def¬ 
initions  are  understood.  The  argument  that  any  array  satisfying  definition  (a)  can 
be  generated  by  an  appropriate  set  of  three  vectors  is  not  as  obvious.  The  proof 
consists  of  an  explicit  recipe  for  constructing  three  primitive  vectors.  The  construction 
is  given  in  Problem  8a. 

*  *P  •  Figure  4.1 

A  general  two-dimensional  Bravais  lattice  of  no 
particular  symmetry:  the  oblique  net.  Primitive 
vectors  a ,  and  a2  are  shown.  All  points  in  the  net  are 
linear  combinations  of  these  with  integral  coefficients ; 
for  example,  P  =  a,  +  2a2,  and  Q  =  -a,  +  a2. 

a, 


•  •  •  . 

Figure  4.1  shows  a  portion  ofa  two-dimensional  Bravais  lattice.5  Clearly  definition 
(a)  is  satisfied,  and  the  primitive  vectors  a3  and  a2  required  by  definition  (b)  are 
indicated  in  the  figure.  Figure  4.2  shows  one  of  the  most  familiar  of  three-dimensional 
Bravais  lattices,  the  simple  cubic.  It  owes  its  special  structure  to  the  fact  that  it  can 
be  spanned  by  three  mutually  perpendicular  primitive  vectors  of  equal  length. 

Figure  4.2 

A  simple  cubic  three-dimensional  Bravais  lattice.  The  three 
primitive  vectors  can  be  taken  to  be  mutually  perpendicular, 
and  with  a  common  magnitude. 


We  continue  with  the  convention  that  “integer"  means  a  negative  integer  or  zero  as  well  as  a 
positive  integer. 

4  When  n  is  negative,  n  steps  in  a  direction  means  n  steps  in  the  opposite  direction.  The  point  reached 
does  not.  of  course,  depend  on  the  order  in  which  the  n,  +  n2  +  n3  steps  are  taken. 

5  A  two-dimensional  Bravais  lattice  is  also  known  as  a  net 
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Figure  4.3 

The  vertices  of  a  two-dimensional  honeycomb 
do  not  form  a  Bravais  lattice.  The  array  of 
points  has  the  same  appearance  whether  viewed 
from  point  P  or  point  Q.  However,  the  view 
from  point  R  is  rotated  through  180  . 


It  is  important  that  not  only  the  arrangement,  but  also  the  orientation  must 
appear  the  same  from  every  point  in  a  Bravais  lattice.  Consider  the  vertices  of  a 
two-dimensional  honeycomb  (Figure  4.3).  The  array  of  points  looks  the  same  when 
viewed  from  adjacent  points  only  if  the  page  is  rotated  through  180°  each  time  one 
moves  from  one  point  to  the  next.  Structural  relations  are  clearly  identical,  but  not 
orientational  relations,  so  the  vertices  of  a  honeycomb  do  not  form  a  Bravais  lattice. 
A  case  of  more  practical  interest,  satisfying  the  structural  but  not  the  orientational 
requirements  of  definition  (a),  is  the  three-dimensional  hexagonal  close-packed  lattice, 
described  below. 


INFINITE  LATTICES  AND  FINITE  CRYSTALS 

Since  all  points  are  equivalent,  the  Bravais  lattice  must  be  infinite  in  extent.  Actual 
crystals  are,  of  course,  finite,  but  if  they  are  large  enough  the  vast  majority  of  points 
will  be  so  far  from  the  surface  as  to  be  unaffected  by  its  existence.  The  fiction  of  an 
infinite  system  is  thus  a  very  useful  idealization.  If  surface  effects  are  of  interest  the 
notion  of  a  Bravais  lattice  is  still  relevant,  but  now  one  must  think  of  the  physical 
crystal  as  filling  up  only  a  finite  portion  of  the  ideal  Bravais  lattice. 

Frequently  one  considers  finite  crystals,  not  because  surface  effects  are  important, 
but  simply  for  conceptual  convenience,  just  as  in  Chapter  2  we  placed  the  electron 
gas  in  a  cubical  box  of  volume  V  =  L3.  One  then  generally  picks  the  finite  region 
of  the  Bravais  lattice  to  have  the  simplest  possible  form.  Given  three  primitive 
vectors  alt  a2,  and  a3,  one  usually  considers  the  finite  lattice  of  N  sites  to  be  the  set 
of  points  of  the  form  R  =  «ia,  +  n2a  2  +  n3a3,  where  0  ^  n,  <  0  =$  n2  <  N  2, 

0  n3  <  N3,  and  N  =  NiN2N3.  This  artifact  is  closely  connected  with  the  general¬ 
ization  to  the  description  of  crystalline  systems6  of  the  periodic  boundary  condition 
we  used  in  Chapter  2. 

FURTHER  ILLUSTRATIONS  AND  IMPORTANT  EXAMPLES 

Of  the  two  definitions  of  a  Bravais  lattice,  definition  (b)  is  mathematically  more 
precise  and  is  the  obvious  starting  point  for  any  analytic  work.  It  has,  however,  two 


We  shall  make  particular  use  of  it  in  Chapters  8  and  22. 
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minor  shortcomings.  First,  for  any  given  Bravais  lattice  the  set  of  primitive  vectors 
is  not  unique — indeed,  there  are  infinitely  many  nonequivalent  choices  (see  Figure 
4.4) — and  it  is  distasteful  (and  sometimes  misleading)  to  rely  too  heavily  on  a  defi¬ 
nition  that  emphasizes  a  particular  choice.  Second,  when  presented  with  a  particular 
array  of  points  one  usually  can  tell  at  a  glance  whether  the  first  definition  is  satisfied, 
although  the  existence  of  a  set  of  primitive  vectors  or  a  proof  that  there  is  no  such 
set  can  be  rather  more  difficult  to  perceive  immediately. 

Figure  4.4 

Several  possible  choices  of  pairs  of 
primitive  vectors  for  a  two-dimen¬ 
sional  Bravais  lattice.  They  are 
drawn,  for  clarity,  from  different 
origins. 


Consider,  for  example,  the  body-centered  cubic  (bcc)  lattice,  formed  by  adding  to 
the  simple  cubic  lattice  of  Figure  4.2  (whose  sites  we  now  label  A)  an  additional  point, 
B,  at  the  center  of  each  little  cube  (Figure  4.5).  One  might  at  first  feel  that  the  center 
points  B  bear  a  different  relation  to  the  whole  than  the  corner  points  A.  Flowever, 
the  center  point  B  can  be  thought  of  as  corner  points  of  a  second  simple  cubic  array. 

Figure  4.5 

A  few  sites  from  a  body-centered  cubic  Bravais 
lattice.  Note  that  it  can  be  regarded  either  as  a  simple 
cubic  lattice  formed  from  the  points  A  with  the  points 
B  at  the  cube  centers,  or  as  a  simple  cubic  lattice 
formed  from  the  points  B  with  the  points  A  at  the 
cube  centers.  This  observation  establishes  that  it  is 
indeed  a  Bravais  lattice. 


In  this  new  array  the  corner  points  A  of  the  original  cubic  array  are  center  points. 
Thus  all  points  do  have  identical  surroundings,  and  the  body-centered  cubic  lattice 
is  a  Bravais  lattice.  If  the  original  simple  cubic  lattice  is  generated  by  primitive  vectors 

(4.2) 


alt,  aj,  at. 
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where  X,  %  and  2  are  three  orthogonal  unit  vectors,  then  a  set  of  primitive  vectors 
for  the  body-centered  cubic  lattice  could  be  (Figure  4.6) 

a  i  =  nX,  a2  =  a$,  a3  =  ^  (X  +  S'  +  2).  (4.3) 


Figure  4.6 

Three  primitive  vectors,  specified  in  Eq.  (4.3), 
for  the  body-centered  cubic  Bravais  lattice.  The 
lattice  is  formed  by  taking  all  linear  combina¬ 
tions  of  the  primitive  vectors  with  integral 
coefficients.  The  point  P,  for  example,  is  P  = 
j  -a,  —  a2  +  2a3. 


A  more  symmetric  set  (see  Figure  4.7)  is 

at=^(?  +  2-X).  a2  =^(2  +  X  —  $>),  a3  =  |(X  +  S'  -  2).  (4.4) 

It  is  important  to  convince  oneself  both  geometrically  and  analytically  that  these 
sets  do  indeed  generate  the  bcc  Bravais  lattice. 


Figure  4.7 

A  more  symmetric  set  of  primitive  vectors, 
specified  in  Eq.  (4.4),  for  the  body- 
centered  cubic  Bravais  lattice.  The  point 
P,  for  example,  has  the  form  P  =  2a  j  + 
a2  +  a3. 


Another  equally  important  example  is  the  face-centered  cubic  (fee)  Bravais  lattice. 
To  construct  the  face-centered  cubic  Bravais  lattice  add  to  the  simple  cubic  lattice 
of  Figure  4.2  an  additional  point  in  the  center  of  each  square  face  (Figure  4.8).  For 
ease  in  description  think  of  each  cube  in  the  simple  cubic  lattice  as  having  horizontal 
bottom  and  top  faces,  and  four  vertical  side  faces  facing  north,  south,  east,  and  west. 
It  may  sound  as  if  all  points  in  this  new  array  are  not  equivalent,  but  in  fact  they  are. 
One  can,  for  example,  consider  the  new  simple  cubic  lattice  formed  by  the  points  added 
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Figure  4.8 

Some  points  from  a  face-centered 
cubic  Bravais  lattice. 


Up 


to  the  centers  of  all  the  horizontal  faces.  The  original  simple  cubic  lattice  points  are 
now  centering  points  on  the  horizontal  faces  of  the  new  simple  cubic  lattice,  whereas 
the  points  that  were  added  to  the  centers  of  the  north-south  faces  of  the  original  cubic 
lattice  are  in  the  centers  of  the  east-west  faces  of  the  new  one,  and  vice  versa. 

In  the  same  way  one  can  also  regard  the  simple  cubic  lattice  as  being  composed 
of  all  points  centering  the  north-south  faces  of  the  original  simple  cubic  lattice,  or 
all  points  centering  the  east-west  faces  of  the  original  cubic  lattice.  In  either  case  the 
remaining  points  will  be  found  centered  on  the  faces  of  the  new  simple  cubic  frame¬ 
work. -Thus  any  point  can  be  thought  of  either  as  a  corner  point  or  as  a  face-centering 
point  for  any  of  the  three  kinds  of  faces,  and  the  face-centered  cubic  lattice  is  indeed 
a  Bravais  lattice. 

A  symmetric  set  of  primitive  vectors  for  the  face-centered  cubic  lattice  (see  Figure 
4.9)  is 

a,  =^(?  +  8),  a2=^(2  +  X),  a3  = -(X  +  ?).  (4.5) 


Figure  4.9  j 

A  set  of  primitive  vectors,  as  given  in  Eq.  (4.5), 
for  the  face-centered  cubic  Bravais  lattice  The 
labeled  points  are  P  =  a,  +  a2  +  a3,  Q  =  2a2, 
R  =  a2  +  a3,  and  S  =  —a,  +  a2  +  a3. 


The  face-centered  cubic  and  body-centered  cubic  Bravais  lattices  are  of  great 
importance,  since  an  enormous  variety  of  solids  crystallize  in  these  forms  with  an 
atom  (or  ion)  at  each  lattice  site  (see  Tables  4.1  and  4.2).  (The  corresponding  simple 
cubic  form,  however,  is  very  rare,  the  alpha  phase  of  polonium  being  the  only  known 
example  among  the  elements  under  normal  conditions.) 
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Table  4.1 

ELEMENTS  WITH  THE  MONATOMIC  FACE-CENTERED 
CUBIC  CRYSTAL  STRUCTURE 


ELEMENT 

a  (A) 

ELEMENT 

o  (A) 

ELEMENT 

a  (A) 

Ar 

5.26  (4.2  K) 

Ir 

3.84 

pt 

3.92 

Ag 

4.09 

Kr 

5.72  (58  K) 

<5-Pu 

4.64 

A1 

4.05 

La 

5.30 

Rh 

3.80 

Au 

4.08 

Ne 

4.43  (4.2  K) 

Sc 

4.54 

Ca 

5.58 

Ni 

3.52 

Sr 

6.08 

Ce 

5.16 

Pb 

4.95 

Th 

5.08 

fS-Co 

3.55 

Pd 

3.89 

Xe  (58  K) 

6.20 

Cu 

3.61 

Pr 

5.16 

Yb 

5.49 

Data  in  Tables  4.1  to  4.7  are  from  R.  W.  G.  Wyckoff,  Crystal  Structures,  2nd  ed.. 
Interscience,  New  York,  1963.  In  most  cases,  the  data  are  taken  at  about  room  tem¬ 
perature  and  normal  atmospheric  pressure.  For  elements  that  exist  in  many  forms  the 
stable  room  temperature  form  (or  forms)  is  given.  For  more  detailed  information,  more 
precise  lattice  constants,  and  references,  the  Wyckoff  work  should  be  consulted. 

- r - 

Table  4.2 

ELEMENTS  WITH  THE  MONATOMIC  BODY-CENTERED 
CUBIC  CRYSTAL  STRUCTURE 


ELEMENT 

a  (A) 

ELEMENT 

a  (A) 

ELEMENT 

a  (A) 

Ba 

5.02 

Li 

3.49  (78  K) 

Ta 

3.31 

Cr 

2.88 

Mo 

3.15 

T1 

3.88 

Cs 

6.05  (78  K) 

Na 

4.23  (5  K) 

V 

3.02 

Fe 

2.87 

Nb 

3.30 

W 

3.16 

K 

5.23  (5  K) 

Rb 

5.59  (5  K) 

A  NOTE  ON  USAGE 

Although  we  have  defined  the  term  “Bravais  lattice”  to  apply  to  a  set  of  points,  it 
is  also  generally  used  to  refer  to  the  set  of  vectors  joining  any  one  of  these  points 
to  all  the  others.  (Because  the  points  are  a  Bravais  lattice,  this  set  of  vectors  does  not 
depend  on  which  point  is  singled  out  as  the  origin.)  Yet  another  usage  comes  from 
the  fact  that  any  vector  R  determines  a  translation  or  displacement,  in  which  everything 
is  moved  bodily  through  space  by  a  distance  R  in  the  direction  of  R.  The  term  “Bravais 
lattice”  is  also  used  to  refer  to  the  set  of  translations  determined  by  the  vectors,  rather 
than  the  vectors  themselves.  In  practice  it  is  always  clear  from  the  context  whether 
it  is  the  points,  the  vectors,  or  the  translations  that  are  being  referred  to.7 


7  The  more  general  use  of  the  term  provides  an  elegant  definition  of  a  Bravais  lattice  with  the  pre¬ 
cision  of  definition  (b)  and  the  nonprejudicial  nature  of  definition  (a) :  A  Bravais  lattice  is  a  discrete  set 
of  vectors  not  all  in  a  plane,  closed  under  vector  addition  and  subtraction  (i.e.,  the  sum  and  difference  of 
any  two  vectors  in  the  set  are  also  in  the  set). 
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COORDINATION  NUMBER 

The  points  in  a  Bravais  lattice  that  are  closest  to  a  given  point  are  called  its  nearest 
neighbors.  Because  of  the  periodic  nature  of  a  Bravais  lattice,  each  point  has  the  same 
number  of  nearest  neighbors.  This  number  is  thus  a  property  of  the  lattice,  and  is 
referred  to  as  the  coordination  number  of  the  lattice.  A  simple  cubic  lattice  has  co¬ 
ordination  number  6;  a  body-centered  cubic  lattice,  8;  and  a  face-centered  cubic 
lattice,  12.  The  notion  of  a  coordination  number  can  be  extended  in  the  obvious  way 
to  some  simple  arrays  of  points  that  are  not  Bravais  lattices,  provided  that  each  point 
in  the  array  has  the  same  number  of  nearest  neighbors. 

PRIMITIVE  UNIT  CELL 

A  volume  of  space  that,  when  translated  through  all  the  vectors  in  a  Bravais  lattice, 
just  fills  all  of  space  without  either  overlapping  itself  or  leaving  voids  is  called  a 
primitive  cell  or  primitive  unit  cell  of  the  lattice.8  There  is  no  unique  way  of  choosing 
a  primitive  cell  for  a  given  Bravais  lattice.  Several  possible  choices  of  primitive  cells 
for  a  two-dimensional  Bravais  lattice  are  illustrated  in  Figure  4.10. 


Figure  4.10 

Several  possible  choices  of  primitive  cell  for  a  single  two-dimensional  Bravais  lattice. 

A  primitive  cell  must  contain  precisely  one  lattice  point  (unless  it  is  so  positioned 
that  there  are  points  on  its  surface).  It  follows  that  if  n  is  the  density  of  points  in 
the  lattice9  and  v  is  the  volume  of  the  primitive  cell,  then  nv  =  1.  Thus  v  =  l/n.  Since 


8  Translations  of  the  primitive  cell  may  possess  common  surface  points;  the  nonoverlapping  proviso 
is  only  intended  to  prohibit  overlapping  regions  of  nonzero  volume. 

9  The  density  «  of  Bravais  lattice  points  need  not,  of  course,  be  identical  to  the  density  of  conduction 
electrons  in  a  metal.  When  the  possibility  of  confusion  is  present,  we  shall  specify  the  two  densities  with 
different  symb'’ 
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this  result  holds  for  any  primitive  cell,  the  volume  of  a  primitive  cell  is  independent 
of  the  choice  of  cell. 

It  also  follows  from  the  definition  of  a  primitive  cell  that,  given  any  two  primitive 
cells  of  arbitrary  shape,  it  is  possible  to  cut  the  first  up  into  pieces,  which,  when 
translated  through  appropriate  lattice  vectors,  can  be  reassembled  to  give  the  second. 
This  is  illustrated  in  Figure  4.11. 


Figure  4.11 

Two  possible  primitive  cells  for  a  two-dimen¬ 
sional  Bravais  lattice.  The  parallelogram  cell 
(shaded)  is  obviously  primitive;  additional 
hexagonal  cells  are  indicated  to  demonstrate 
that  the  hexagonal  cell  is  also  primitive.  The 
parallelogram  can  be  cut  into  pieces,  which, 
when  translated  through  lattice  vectors,  re¬ 
assemble  to  form  the  hexagon.  The  translations 
for  the  four  regions  of  the  parallelogram  are; 
Region  I— c5;  Region  [1— BO;  Region  III— 
AO;  Region  IV — no  translation. 


The  obvious  primitive  cell  to  associate  with  a  particular  set  of  primitive  vectors, 
a,,  a2,  a3,  is  the  set  of  all  points  r  of  the  form 

r  =  +  x2a2  +  x3a3  (4-6) 

for  all  x;  ranging  continuously  between  0  and  1 ;  i.e.,  the  parallelipiped  spanned  by  the 
three  vectors  a,,  a2,  and  a3.  This  choice  has  the  disadvantage  of  not  displaying  the 
full  symmetry  of  the  Bravais  lattice.  For  example  (Figure  4. 12),  the  unit  cell  (4.6)  for  the 
choice  of  primitive  vectors  (4.5)  of  the  fee  Bravais  lattice  is  an  oblique  parallelipiped, 
which  does  not  have  the  full  cubic  symmetry  of  the  lattice  in  which  it  is  embedded. 
It  is  often  important  to  work  with  cells  that  do  have  the  full  symmetry  of  their  Bravais 
lattice.  There  are  two  widely  used  solutions  to  this  problem; 


Figure  4.12 

Primitive  and  conventional  unit  cells  for  the  face- 
centered  cubic  Bravais  lattice.  The  conventional  cell  is 
the  large  cube.  The  primitive  cell  is  the  figure  with  six 
parallelogram  faces.  It  has  one  quarter  the  volume  of 
the  cube,  and  rather  less  symmetry. 
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UNIT  CELL;  CONVENTIONAL  UNIT  CELL 

One  can  fill  space  up  with  nonprimitive  unit  cells  (known  simply  as  unit  cells  or 
conventional  unit  cells).  A  unit  cell  is  a  region  that  just  fills  space  without  any  over¬ 
lapping  when  translated  through  some  subset  of  the  vectors  of  a  Bravais  lattice.  The 
conventional  unit  cell  is  generally  chosen  to  be  bigger  than  the  primitive  cell  and  to 
have  the  required  symmetry.  Thus  one  frequently  describes  the  body-centered  cubic 
lattice  in  terms  of  a  cubic  unit  cell  (Figure  4.13)  that  is  twice  as  large  as  a  primitive 
bcc  unit  cell,  and  the  face-centered  cubic  lattice  in  terms  of  a  cubic  unit  cell  (Figure 
4.12)  that  has  four  times  the  volume  of  a  primitive  fee  unit  cell.  (That  the  conventional 
cells  are  two  and  four  times  bigger  than  the  primitive  cells  is  easily  seen  by  asking 
how  many  lattice  points  the  conventional  cubic  cell  must  contain  when  it  is  so  placed 
that  no  points  are  on  its  surface.)  Numbers  specifying  the  size  of  a  unit  cell  (such  as 
the  single  number  a  in  cubic  crystals)  are  called  lattice  constants. 

Figure  4.13 

Primitive  and  conventional  unit  cells  for  the  body- 
centered  cubic  Bravais  lattice.  The  primitive  cell 
(shaded)  has  half  the  volume  of  the  conventional 
cubic  cell. 


WIGNER-SEITZ  PRIMITIVE  CELL 

One  can  always  choose  a  primitive  cell  with  the  full  symmetry  of  the  Bravais  lattice. 
By  far  the  most  common  such  choice  is  the  Wigner-Seitz  cell.  The  Wigner-Seitz  cell 
about  a  lattice  point  is  the  region  of  space  that  is  closer  to  that  point  than  to  any 
other  lattice  point.10  Because  of  the  translational  symmetry  of  the  Bravais  lattice, 
the  Wigner-Seitz  cell  about  any  one  lattice  point  must  be  taken  into  the  Wigner-Seitz 
cell  about  any  other,  when  translated  through  the  lattice  vector  that  joins  the  two 
points.  Since  any  point  in  space  has  a  unique  lattice  point,  as  its  nearest  neighbor11 
it  will  belong  to  the  Wigner-Seitz  cell  of  precisely  one  lattice  point.  It  follows  that  a 


10  Such  a  cell  can  be  defined  for  any  set  of  discrete  points  that  do  not  necessarily  form  a  Bravais 
lattice.  In  this  broader  context  the  cell  is  known  as  a  Voronoy  polyhedron.  In  contrast  to  the  Wigner-Seitz 
cell,  the  structure  and  orientation  of  a  general  Voronoy  polyhedron  will  depend  on  which  point  of  the 
array  it  encloses. 

11  Except  for  points  on  the  common  surface  of  two  or  more  Wigner-Seitz  cells. 
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Wigner-Seitz  cell,  when  translated  through  all  lattice  vectors,  will  just  fill  space 
without  overlapping;  i.e.,  the  Wigner-Seitz  cell  is  a  primitive  cell. 

Since  there  is  nothing  in  the  definition  of  the  Wigner-Seitz  cell  that  refers  to  any 
particular  choice  of  primitive  vectors,  the  Wigner-Seitz  cell  will  be  as  symmetrical 
as  the  Bravais  lattice.12 

The  Wigner-Seitz  unit  cell  is  illustrated  for  a  two-dimensional  Bravais  lattice  in 
Figure  4.14  and  for  the  three-dimensional  body-centered  cubic  and  face-centered 
cubic  Bravais  lattices  in  Figures  4.15  and  4.16. 

Note  that  the  Wigner-Seitz  unit  cell  about  a  lattice  point  can  be  constructed  by 
drawing  lines  connecting  the  point  to  all  others' 3  in  the  lattice,  bisecting  each  line 

Figure  4.14 

The  Wigner-Seitz  cell  for  a  two-dimensional 
Bravais  lattice.  The  six  sides  of  the  cell  bisect 
the  lines  joining  the  central  points  to  its  six 
nearest  neighboring  points  (shown  as  dashed 
lines).  In  two  dimensions  the  Wigner-Seitz 
cell  is  always  a  hexagon  unless  the  lattice  is 
rectangular  (see  Problem  4a). 

Figure  4.15 

The  Wigner-Seitz  cell  for  the  body-centered  cubic  Bravais 
lattice  (a  “truncated  octahedron”).  The  surrounding  cube  is  a 
conventional  body-centered  cubic  cell  with  a  lattice  point  at 
its  center  and  on  each  vertex.  The  hexagonal  faces  bisect  the 
lines  joining  the  central  point  to  the  points  on  the  vertices 
(drawn  as  solid  lines).  The  square  faces  bisect  the  lines  joining 
the  central  point  to  the  central  points  in  each  of  the  six  neigh¬ 
boring  cubic  cells  (not  drawn).  The  hexagons  are  regular  (see 
Problem  4d). 


Figure  4.16 

Wigner-Seitz  cell  for  the  face-centered  cubic  Bravais 
lattice  (a  “rhombic  dodecahedron”).  The  surrounding 
cube  is  not  the  conventional  cubic  cell  of  Figure  4.12, 
but  one  in  which  lattice  points  are  at  the  center  of  the 
cube  and  at  the  center  of  the  12  edges.  Each  of  the  12 
(congruent)  faces  is  perpendicular  to  a  line  joining  the 
central  point  to  a  point  on  the  center  of  an  edge. 


12  A  precise  definition  of  “as  symmetrical  as”  is  given  in  Chapter  7. 

13  In  practice  only  a  fairly  small  number  of  nearby  points  actually  y*  -ilanes  that  bound  the  cell 
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with  a  plane,  and  taking  the  smallest  polyhedron  containing  the  point  bounded  by 
these  planes. 


CRYSTAL  STRUCTURE;  LATTICE  WITH  A  BASIS 

A  physical  crystal  can  be  described  by  giving  its  underlying  Bravais  lattice,  together 
with  a  description  of  the  arrangement  of  atoms,  molecules,  ions,  etc.,  within  a 
particular  primitive  cell.  When  emphasizing  the  difference  between  the  abstract 
pattern  of  points  composing  the  Bravais  lattice  and  an  actual  physical  crystal14 
embodying  the  lattice,  the  technical  term  “crystal  structure”  is  used.  A  crystal  structure 
consists  of  identical  copies  of  the  same  physical  unit,  called  the  basis,  located  at  all 
the  points  of  a  Bravais  lattice  (or,  equivalently,  translated  through  all  the  vectors 
of  a  Bravais  lattice).  Sometimes  the  term  lattice  with  a  basis  is  used  instead.  However, 
“lattice  with  a  basis”  is  also  used  in  a  more  general  sense  to  refer  to  what  results 
even  when  the  basic  unit  is  not  a  physical  object  or  objects,  but  another  set  of  points. 
For  example,  the  vertices  of  a  two-dimensional  honeycomb,  though  not  a  Bravais 
lattice,  can  be  represented  as  a  two-dimensional  triangular  Bravais  lattice' 5  with  a 
two-point  basis  (Figure  4.17).  A  crystal  structure  with  a  basis  consisting  of  a  single 
atom  or  ion  is  often  called  a  monatomic  Bravais  lattice. 

Figure  4.17 

The  honeycomb  net,  drawn  so  as 
to  emphasize  that  it  is  a  Bravais 
lattice  with  a  two-point  basis.  The 
pairs  of  points  joined  by  heavy 
solid  lines  are  identically  placed  in 
the  primitive  cells  (parallelograms) 
of  the  underlying  Bravais  lattice. 


One  also  can  describe  a  Bravais  lattice  as  a  lattice  with  a  basis  by  choosing  a  non¬ 
primitive  conventional  unit  cell.  This  is  often  done  to  emphasize  the  cubic  symmetry 
of  the  bcc  and  fee  Bravais  lattices,  which  are  then  described  respectively,  as  simple 
cubic  lattices  spanned  by  ait,  af,  and  at,  with  a  two-point  basis 

0,  “(«  +  ?  +  2)  (bcc)  (4.7) 

or  a  four-point  basis 

0,  ^(X  +  n  ^(?  +  2),  ^  (2  +  X)  (fee).  (4.8) 


14  But  still  idealized  in  being  infinite  in  extent. 

15  Span*’  '1  by  two  primitive  vectors  of  equal  length,  making  an  angle  of  60°. 
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SOME  IMPORTANT  EXAMPLES  OF  CRYSTAL  STRLCTURES  AND 
LATTICES  WITH  BASES 

Diamond  Structure 

The  diamond  lattice16  (formed  by  the  carbon  atoms  in  a  diamond  crystal)  consists  of 
two  interpenetrating  face-centered  cubic  Bravais  lattices,  displaced  along  the  body 
diagonal  of  the  cubic  cell  by  one  quarter  the  length  of  the  diagonal.  1 1  can  be  regarded 
as  a  face-centered  cubic  lattice  with  the  two-point  basis  0  and  (a/4)(x  +  y  +  z).  The 
coordination  number  is  4  (Figure  4.18).  The  diamond  lattice  is  not  a  Bravais  lattice. 

Figure  4.18 

Conventional  cubic  cell  of  the  diamond  lattice. 
For  clarity,  sites  corresponding  to  one  of  the 
two  interpenetrating  face-centered  cubic  lattices 
are  unshaded.  (In  the  zincblende  structure  the 
shaded  sites  are  occupied  by  one  kind  of  ion, 
and  the  unshaded  by  another.)  Nearest-neighbor 
bonds  have  been  drawn  in.  The  four  nearest 
neighbors  of  each  point  form  the  vertices  of  a 
regular  tetrahedron. 


because  the  environment  of  any  point  differs  in  orientation  from  the  environments 
of  its  nearest  neighbors.  Elements  crystallizing  in  the  diamond  structure  are  given 
in  Table  4.3. 


Table  4.3 

ELEMENTS  WITH  THE  DIAMOND  CRYSTAL 
STRUCTURE 


ELEMENT 

CUBE  side  a  (A) 

C  (diamond) 

3.57 

Si 

5.43 

Ge 

5.66 

a-Sn  (grey) 

6.49 

Hexagonal  Close-Packed  Structure 

Though  not  a  Bravais  lattice,  the  hexagonal  close- packed  (hep)  structure  ranks  in 
importance  with  the  body-centered  cubic  and  face-centered  cubic  Bravais  lattices; 
about  30  elements  crystallize  in  the  hexagonal  close-packed  form  (Table  4.4). 

16  We  use  the  word  "lattice.”  without  qualifications,  to  refer  either  to  a  Bravais  lattice  or  a  lattice 
with  a  basis. 
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Table  4.4 

ELEMENTS  WITH  THE  HEXAGONAL  CLOSE-PACKED  CRYSTAL 
STRUCTURE 


ELEMENT 

a(  A) 

c 

c/a 

ELEMENT 

fit  (A) 

c 

-  c/a 

Be 

2.29 

3.58 

1.56 

Os 

2.74 

4.32 

1.58 

Cd 

2.98 

5.62 

1.89 

Pr 

3.67 

5.92 

1.61 

Ce 

3.65 

5.96 

1.63 

Re 

2.76 

4.46 

1.62 

a-Co 

2.51 

4.07 

1.62 

Ru 

2.70 

4.28 

1.59 

Dy 

3.59 

5.65 

1.57 

Sc 

3.31 

5.27 

1.59 

Er 

3.56 

5.59 

1.57 

Tb 

3.60 

5.69 

1.58 

Gd 

3.64 

5.78 

1.59 

Ti 

2.95 

4.69 

1.59 

He  (2  K) 

3.57 

5.83 

1.63 

T1 

3.46 

5.53 

1.60 

Hf 

3.20 

5.06 

1.58 

Tm  v 

3.54 

5.55 

1.57 

Ho 

3.58 

5.62 

1.57 

Y  X 

3.65 

5.73 

1.57 

La 

3.75 

6.07 

1.62 

Zn 

2.66 

4.95 

1.86 

Lu 

3.50 

5.55 

1.59 

Zr 

3.23 

5.15 

1.59 

Mg 

3.21 

5.21 

1.62 

— 

— 

Nd 

3.66 

5.90 

1.61 

"Ideal” 

1.63 

Underlying  the  hep  structure  is  a  simple  hexagonal  Bravais  lattice,  given  by  stacking 
two-dimensional  triangular  nets 1 5  directly  above  each  other  (Figure  4. 1 9).  The  direc¬ 
tion  of  stacking  (a3,  below)  is  known  as  the  oaxis.  Three  primitive  vectors  are 


at  =  aX, 


2  3' 


a3  =  et. 


(4.9) 


The  first  two  generate  a  triangular  lattice  in  the  x-y  plane,  and  the  third  stacks  the 
planes  a  distance  c  above  one  another. 

The  hexagonal  close-packed  structure  consists  of  two  interpenetrating  simple  hex¬ 
agonal  Bravais  lattices,  displaced  from  one  another  by  a,/3  +  a2/3  +  a3/2  (Figure 
4.20).  The  name  reflects  the  fact  that  close-packed  hard  spheres  can  be  arranged  in 


Figure  4.19 

The  simple  hexagonal  Bravais  lattice  Two-dimensional  triangular  nets  (shown  in  inset)  are 
stacked  directly  abov-  one  another,  a  distance  c  apart. 
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Figure  4.20 

The  hexagonal  close-packed  crystal  structure.  It  can 
be  viewed  as  two  interpenetrating  simple  hexagonal 
Bravais  lattices,  displaced  vertically  by  a  distance  c/2 
along  the  common  c-axis,  and  displaced  horizontally 
so  that  the  points  of  one  lie  directly  above  the  centers 
of  the  triangles  formed  by  the  points  of  the  other. 


such  a  structure.  Consider  stacking  cannonballs  (Figure  4.21),  starting  with  a  close- 
packed  triangular  lattice  as  the  first  layer.  The  next  layer  is  formed  by  placing  a  ball 
in  the  depressions  left  in  the  center  of  every  other  triangle  in  the  first  layer,  thereby 
forming  a  second  triangular  layer,  shifted  with  respect  to  the  first.  The  third  layer  is 
formed  by  placing  balls  in  alternate  depressions  in  the  second  layer,  so  that  they  lie 
directly  over  the  balls  in  the  first  layer.  The  fourth  layer  lies  directly  over  the  second, 
and  so  on.  The  resulting  lattice  is  hexagonal  close-packed  with  the  particular  value 
(see  Problem  5): 


1.632990. 


(4.10) 


Figure  4.21 

View  from  above  of  the  first  two  layers  in  a  slack 
of  cannonballs.  The  first  layer  is  arranged  in  a 
plane  triangular  lattice.  Balls  in  the  second  layer 
are  placed  above  alternate  interstices  in  the  first. 
If  balls  in  the  third  layer  are  placed  directly 
above  those  in  the  first,  at  sites  of  the  type 
shown  in  inset  (a),  balls  in  the  fourth  directly 
above  those  in  the  second,  etc.,  the  resulting 
structure  will  be  close-packed  hexagonal.  If, 
however,  balls  in  the  third  layer  are  placed 
directly  above  those  interstices  in  the  first  that 
were  not  covered  by  balls  in  the  second,  at  sites 
of  the  type  shown  in  inset  (b),  balls  in  the  fourth 
layer  placed  directly  above  those  in  the  first, 
balls  in  the  fifth  directly  above  those  in  the 
second,  etc.,  the  resulting  structure  will  be  face- 
centered  cubic  (with  the  body  diagonal  of  the 
cube  oriented  vertically.) 


Because,  however,  the  symmetry  of  the  hexagonal  close-packed  lattice  is  independent 
of  the  c/a  ratio,  the  name  is  not  restricted  to  this  case.  The  value  c/a  =  v'/8/3  is 
sometimes  called  “ideal,”  and  the  truly  close-packed  structure,  with  the  ideal  value 
of  c/a,  is  known  as  an  ideal  hep  structure.  Unless,  however,  the  physical  units  in  the 
hep  structure  are  actually  close-packed  spheres,  there  is  no  -.on  why  c/a  should 
be  ideal  (see  Table  4.4). 


I 
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Note,  as  in  the  case  of  the  diamond  structure,  that  the  hep  lattice  is  not  a  Eravais 
lattice,  because  the  orientation  of  the  environment  of  a  point  varies  from  layer  to 
layer  along  the  c-axis.  Note  also  that,  when  viewed  along  the  c-axis,  the  two  types 
of  planes  merge  to  form  the  two-dimensional  honeycomb  array  of  Figure  4.3,  which 
is  not  a  Bravais  lattice. 

Other  Close-Packing  Possibilities 

Note  that  the  hep  structure  is  not  the  only  way  to  close-pack  spheres.  If  the  first  two 
layers  are  laid  down  as  described  above,  but  the  third  is  placed  in  the  other  set  of 
depressions  in  the  second — i.e.,  those  lying  above  unused  depressions  in  both  the  first 
and  second  layers  (see  Figure  4.21) — and  then  the  fourth  layer  is  placed  in  depressions 
in  the  third  directly  above  the  balls  in  the  first,  the  fifth  above  the  second,  and  so  on, 
one  generates  a  Bravais  lattice.  This  Bravais  lattice  turns  out  to  be  nothing  but  the 
face-centered  cubic  lattice,  with  the  cube  diagonal  perpendicular  to  the  triangular 
planes  (Figures  4.22  and  4.23). 


Figure  4.22 

How  to  section  the  face-centered  cubic  Bravais  lattice  to  get 
the  layers  pictured  in  Figure  4.21. 


Figure  4.23 

A  cubic  section  of  some  face-centered  cubic  close-packed 
spheres. 


There  are  infinitely  many  other  close-packing  arrangements,  since  each  successive 
layer  can  be  placed  in  either  of  two  positions.  Only  fee  close-packing  gives  a  Bravais 
lattice,  and  the  fee  (...ABC  ABC  ABC...)  and  hep  (...ABABAB...)  structures  are  by 
far  the  mp't  commonly  encountered.  Other  close-packed  structures  are  observed, 
however.  ,ain  rare  earth  metals,  for  example,  take  on  a  structure  of  the  form 
(. .  .ABACABACABAC. . .). 
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The  Sodium  Chloride  Structure 

We  are  forced  to  describe  the  hexagonal  close-packed  and  diamond  lattices  as  lattices 
with  bases  by  the  intrinsic  geometrical  arrangement  of  the  lattice  points.  A  lattice 
with  a  basis  is  also  necessary,  however,  in  describing  crystal  structures  in  which  the 
atoms  or  ions  are  located  only  at  the  points  of  a  Bravais  lattice,  but  in  which  the  crystal 
structure  nevertheless  lacks  the  full  translational  symmetry  of  the  Bravais  lattice 
because  more  than  one  kind  of  atom  or  ion  is  present.  For  example,  sodium  chloride 
(Figure  4.24)  consists  of  equal  numbers  of  sodium  and  chlorine  ions  placed  at  alternate 
points  of  a  simple  cubic  lattice,  in  such  a  way  that  each  ion  has  six  of  the  other  kind 
of  ions  as  its  nearest  neighbors.17  This  structure  can  be  described  as  a  face-centered 
cubic  Bravais  lattice  with  a  basis  consisting  of  a  sodium  ion  at  0  and  a  chlorine  ion 
at  the  center  of  the  conventional  cubic  cell,  (a/2)(x  +  y  +  z). 


Figure  4.24 

The  sodium  chloride  structure.  One  type  of  ion  is  repre¬ 
sented  by  black  balls,  the  other  type  by  white.  The  black 
and  white  balls  form  interpenetrating  fee  lattices. 


Table  4.5 

SOME  COMPOUNDS  WITH  THE  SODIUM  CHLORIDE  STRUCTURE 


CRYSTAL 


a  (A) 


CRYSTAL 


1(A) 


CRYSTAL 


LiF 

4.02 

LiCl 

5.13 

LiBr 

5.50 

Lil 

6.00 

NaF 

4.62 

NaCl 

5.64 

NaBr 

5.97 

Nai 

6.47 

KF 

5.35 

KC1 

6.29 

KBr 

6.60 

KI 

7.07 

RbF 

5.64 

RbCl 

6.58 

RbBr 

6.85 

Rbl 

7.34 

CsF 

6.01 

AgF 

4.92 

AgCI 

5.55 

AgBr 

5.77 

MgO 

4.21 

MgS 

5.20 

MgSe 

5.45 

CaO 

4.81 

CaS 

CaSe 

CaTe 

SrO 

SrS 

SrSe 

SrTe 

BaO 

BaS 

BaSe 

BaTe 


a  (A) 

5.69 

5.91 

6.34 

5.16 

6.02 

6.23 

6.47 

5.52 

6.39 

6.60 

6.99 


The  Cesium  Chloride  Structure 

Similarly,  cesium  chloride  (Figure  4.25)  consists  of  equal  numbers  of  cesium  and 
chlorine  ions,  placed  at  the  points  of  a  body-centered  cubic  lattice  so  that  each  ion 


17 


For  examples  see  Table  4.5. 
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has  eight  of  the  other  kind  as  its  nearest  neighbors.18  The  translational  symmetry 
of  this  structure  is  that  of  the  simple  cubic  Bravais  lattice,  and  it  is  described  as  a 
simple  cubic  lattice  with  a  basis  consisting  of  a  cesium  ion  at  the  origin  0  and  a  chlorine 
ion  at  the  cube  center  (a/2)(x  +  y  +  z). 


Figure  4.25 

The  cesium  chloride  structure.  One  type  of  ion  is  repre¬ 
sented  by  black  balls,  the  other  type  by  white.  The  black 
and  white  balls  form  interpenetrating  simple  cubic  lattices. 


Table  4.6 

SOME  COMPOUNDS  WITH  THE  CESIUM  CHLORIDE 
STRUCTURE 


CRYSTAL 

a  (A) 

CRYSTAL 

a  (A) 

CsCl 

4.12 

T1C1 

3.83 

CsBr 

4.29 

TIBr 

3.97 

Csl 

4.57 

Til 

4.20 

The  Zincblende  Structure 

Zincblende  has  equal  numbers  of  zinc  and  sulfur  ions  distributed  on  a  diamond  lattice 
so  that  each  has  four  of  the  opposite  kind  as  nearest  neighbors  (Figure  4.18).  This 
structure19  is  an  example  of  a  lattice  with  a  basis,  which  must  be  so  described  both 
because  of  the  geometrical  position  of  the  ions  and  because  two  types  of  ions  occur. 


Table  4.7 

SOME  COMPOUNDS  WITH  THE  ZINCBLENDE  STRUCTURE 


CRYSTAL 

a  (A) 

CRYSTAL 

a  (A) 

CRYSTAL 

a  (A) 

CuF 

4.26 

ZnS 

5.41 

AlSb 

6.13 

CuCl 

5.41 

ZnSe 

5.67 

GaP 

5.45 

CuBr 

5.69 

ZnTe 

6.09 

GaAs 

5.65 

Cul 

6.04 

CdS 

5.82 

GaSb 

6.12 

Agl 

6.47 

CdTe 

6.48 

InP 

5.87 

BeS 

4.85 

HgS 

5.85 

InAs 

6.04 

BeSe 

5.07 

HgSe 

6.08 

InSb 

6.48 

BeTe 

5.54 

HgTe 

6.43 

SiC 

4.35 

MnS  (red) 

5.60 

A1P 

5.45 

MnSe 

5.82 

AlAs 

5.62 

18 


19 


For  examples  see  Table  4.6. 
For  example  jTable  4.7. 
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OTHER  ASPECTS  OF  CRYSTAL  LATTICES 

This  chapter  has  concentrated  on  the  description  of  the  translational  symmetry  of 
crystal  lattices  in  real  physical  space.  Two  other  aspects  of  periodic  arrays  will  be 
dealt  with  in  subsequent  chapters:  in  Chapter  5  we  examine  the  consequences  of 
translational  symmetry  not  in  real  space,  but  in  the  so-called  reciprocal  (or  wave 
vector)  space,  and  in  Chapter  7  we  describe  some  features  of  the  rotational  symmetry 
of  crystal  lattices. 


PROBLEMS 


1.  In  each  of  the  following  cases  indicate  whether  the  structure  is  a  Bravais  lattice.  If  it  is,  give 
three  primitive  vectors;  if  it  is  not,  describe  it  as  a  Bravais  lattice  with  as  small  as  possible  a  basis. 

(a)  Base-centered  cubic  (simple  cubic  with  additional  points  in  the  centers  of  the  horizontal 
faces  of  the  cubic  cell). 

(b)  Side-centered  cubic  (simple  cubic  with  additional  points  in  the  centers  of  the  vertical 
faces  of  the  cubic  cell). 

(c)  Edge-centered  cubic  (simple  cubic  with  additional  points  at  the  midpoints  of  the  lines 
joining  nearest  neighbors). 

2.  What  is  the  Bravais  lattice  formed  by  all  points  with  Cartesian  coordinates  (n,,  n2,  n3)  if; 

(a)  The  n,  are  cither  all  even  or  all  odd? 

(b)  The  sum  of  the  nt  is  required  to  be  even? 

3.  Show  that  the  angle  between  any  two  of  the  lines  (bonds)  joining  a  site  of  the  diamond  lattice 
to  its  four  nearest  neighbors  is  cos-1  (—1/3)  =  109°28'. 

4.  (a)  Prove  that  the  Wigner-Seitz  cell  for  any  two-dimensional  Bravais  lattice  is  either  a 
hexagon  or  a  rectangle. 

(b)  Show  that  the  ratio  of  the  lengths  of  the  diagonals  of  each  parallelogram  face  of  the 
Wigner-Seitz  cell  for  the  face-centered  cubic  lattice  (Figure  4.16)  is  yf 2:1 . 

(c)  Show  that  every  edge  of  the  polyhedron  bounding  the  Wigner-Seitz  cell  of  the  body- 
centered  cubic  lattice  (Figure  4.15)  is  v/2/4  times  the  length  of  the  conventional  cubic  cell. 

(d)  Prove  that  the  hexagonal  faces  of  the  bcc  Wigner-Seitz  cell  are  all  regular  hexagons. 
(Note  that  the  axis  perpendicular  to  a  hexagonal  face  passing  through  its  center  has  only  threefold 
symmetry,  so  this  symmetry  alone  is  not  enough.) 

5.  (a)  Prove  that  the  ideal  c/a  ratio  for  the  hexagonal  close-packed  structure  is  v/8/3  =  1.633. 
(b)  Sodium  transforms  from  bcc  to  hep  at  about  23K  (the  “martensitic”  transformation). 

Assuming  that  the  density  remains  fixed  through  this  transition,  find  the  lattice  constant  a  of 
the  hexagonal  phase,  given  that  a  =  4.23  A  in  the  cubic  phase  and  that  the  c/a  ratio  is  indistin¬ 
guishable  from  its  ideal  value. 

6.  The  face-centered  cubicis  the  most  dense  and  the  simple  cubic  is  the  least  dense  of  the  three 
cubic  Bravais  lattices.  The  diamond  structure  is  less  dense  than  any  of  these.  One  measure  of 
this  is  that  the  coordination  numbers  are:  fee.  12;  bcc,  8;  sc.  6;  diamond,  4.  Another  is  the  following: 
Suppose  identical  solid  spheres  are  distributed  through  space  in  sc  way  that  their  centers 
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lie  on  the  points  of  each  of  these  four  structures,  and  spheres  on  neighboring  points  just  touch, 
without  overlapping.  (Such  an  arrangement  of  spheres  is  called  a  close-packing  arrangement.) 
Assuming  that  the  spheres  have  unit  density,  show  that  the  density  of  a  set  of  close-packed  spheres 
on  each  of  the  four  structures  (the  “packing  fraction”)  is: 


fee: 

j2n/6  =  0.74 

bcc: 

Jln/S  =  0.68 

sc: 

n/6  =  0.52 

diamond: 

JJn/16  =  0.34. 

7.  Let  N„  be  the  number  of  nth  nearest  neighbors  of  a  given  Bravais  lattice  point  (e.g..  in  a  simple 
cubic  Bravais  lattice  .V,  =  6,  N2  =  12,  etc.).  Let  r„  be  the  distance  to  the  nth  nearest  neighbor 
expressed  as  a  multiple  of  the  nearest  neighbor  distance  (e.g.,  in  a  simple  cubic  Bravais  lattice 
ft  =  1,  r2  —  v/2  =  1.414).  Make  a  table  of  N„  and  r„  for  n  =  1, ....  6  for  the  fee,  bcc,  and  sc 
Bravais  lattices. 

8.  (a)  Given  a  Bravais  lattice,  let  a3  be  a  vector  joining  a  particular  point  P  to  one  of  its  nearest 
neighbors.  Let  P'  be  a  lattice  point  not  on  the  line  through  P  in  the  direction  of  a,  that  is  as 
close  to  the  line  as  any  other  lattice  point,  and  let  a2  join  P  to  P'.  Let  P"  be  a  lattice  point  not 
on  the  plane  through  P  determined  by  a,  and  a2  that  is  as  close  to  the  plane  as  any  other  lattice 
point,  and  let  a3  join  P  to  P".  Prove  that  a, ,  a2.  and  a3  are  a  set  of  primitive  vectors  for  the 
Bravais  lattice. 

(b)  Prove  that  a  Bravais  lattice  can  be  defined  as  a  discrete  set  of  vectors,  not  all  in  a  plane, 
closed  under  addition  and  subtraction  (as  described  on  page  70). 


The  Reciprocal 
Lattice 


Definitions  and  Examples 

First  Brillouin  Zone 

Lattice  Planes  and  Miller  Indices 
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The  reciprocal  lattice  plays  a  fundamental  role  in  most  analytic  studies  of  periodic 
structures.  One  is  led  to  it  from  such  diverse  avenues  as  the  theory  of  crystal  diffraction, 
the  abstract  study  of  functions  with  the  periodicity  of  a  Bravais  lattice,  or  the  question 
of  what  can  be  salvaged  of  the  law  of  momentum  conservation  when  the  full  trans¬ 
lational  symmetry  of  free  space  is  reduced  to  that  of  a  periodic  potential.  In  this  brief 
chapter  we  shall  describe  some  important  elementary  features  of  the  reciprocal  lattice 
from  a  general  point  of  view  not  tied  to  any  particular  application. 

DEFINITION  OF  RECIPROCAL  LATTICE 

Consider  a  set  of  points  R  constituting  a  Bravais  lattice,  and  a  plane  wave,  e'1'.  For 
general  k,  such  a  plane  wave  will  not,  of  course,  have  the  periodicity  of  the  Bravais 
lattice,  but  for  certain  special  choices  of  wave  vector  it  will.  The  set  of  all  wave  vectors 
K  that  yield  plane  waves  with  the  periodicity  of  a  given  Bravais  lattice  is  known  as  its 
reciprocal  lattice.  Analytically,  K  belongs  to  the  reciprocal  lattice  of  a  Bravais  lattice 
of  points  R,  provided  that  the  relation 

giK-t'+R)  _  e*-r  (5.1) 

holds  for  any  r,  and  for  all  R  in  the  Bravais  lattice.  Factoring  out  <?* ' ',  we  can  charac¬ 
terize  the  reciprocal  lattice  as  the  set  of  wave  vectors  K  satisfying 

e*R=l  (5.2) 

for  all  R  in  the  Bravais  lattice. 

Note  that  a  reciprocal  lattice  is  defined  with  reference  to  a  particular  Bravais 
lattice.  The  Bravais  lattice  that  determines  a  given  reciprocal  lattice  is  often  referred 
to  as  the  direct  lattice,  when  viewed  in  relation  to  its  reciprocal.  Note  also  that 
although  one  could  define  a  set  of  vectors  K  satisfying  (5.2)  for  an  arbitrary  set  of 
vectors  R,  such  a  set  of  K  is  called  a  reciprocal  lattice  only  if  the  set  of  vectors  R  is 
a  Bravais  lattice.1 


THE  RECIPROCAL  LATTICE  IS  A  BRAVAIS  LATTICE 


That  the  reciprocal  lattice  is  itself  a  Bravais  lattice  follows  most  simply  from  the 
definition  of  a  Bravais  lattice  given  in  footnote  7  of  Chapter  4,  along  with  the  fact 
that  if  K,  and  K2  satisfy  (5.2),  so,  obviously,  will  their  sum  and  difference. 

It  is  worth  considering  a  more  clumsy  proof  of  this  fact,  which  provides  an  explicit 
algorithm  for  constructing  the  reciprocal  lattice.  Let  a,,  a2,  and  a3  be  a  set  of  primitive 
vectors  for  the  direct  lattice.  Then  the  reciprocal  lattice  can  be  generated  by  the  three 
primitive  vectors 

u  a2  *  a3 

bi  =  2n - - - r, 

al  '  (a2  *  a3) 


u  a3  X  al 

□2  —  2n  y 

a  1  '  (a2  X  a3) 

L  ->  al  X  a2 

b3  =  2n - - - r- 

3,  •  (a2  x  a3) 


(5.3) 


1  In  particular,  in  working  with  a  lattice  with  a  basis  one  uses  the  recip,"'''al  lattice  determined  by 
the  underlying  Bravais  lattice,  rather  than  a  set  of  K  satisfying  (5.2)  for  v*.  R  describing  both  the 
Bravais  lattice  and  the  basis  points. 
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To  verify  that  (5.3)  gives  a  set  of  primitive  vectors  for  the  reciprocal  lattice,  one 
first  notes  that  the  bj  satisfy2 

b,-  ■  aj  =  2 nSij, 

where  <5y  is  the  Kronecker  delta  symbol : 

<5,j  =  0,  i  +  j; 

Sij  =1,  i  =  j. 

Now  any  vector  k  can  be  written  as  a  linear  combination3  of  the  b,-: 


k  —  k  |  b2  4  k2b2  4-  k3b3. 

(5.6) 

If  R  is  any  direct  lattice  vector,  then 

R  =  jqa,  4-  n2a2  4-  n3a3. 

(5.7) 

where  the  nt  are  integers.  It  follows  from  (5.4)  that 

k  •  R  =  27t(ki«i  +  k2n2  4  k3n3). 

(5.8) 

For  e,'t'R  to  be  unity  for  all  R  (Eq.  (5.2))  k  ■  R  must  be  2n  times  an  integer  for  any 
choices  of  the  integers  n,-.  This  requires  the  coefficients  kt  to  be  integers.  Thus  the 
condition  (5.2)  that  K  be  a  reciprocal  lattice  vector  is  satisfied  by  just  those  vectors 
that  are  linear  combinations  (5.6)  of  the  b*  with  integral  coefficients.  Thus  (compare 
Eq.  (4.1))  the  reciprocal  lattice  is  a  Bravais  lattice  and  the  b,  can  be  taken  as  primitive 
vectors. 


(5.4) 

(5.5) 


THE  RECIPROCAL  OF  THE  RECIPROCAL  LATTICE 

Since  the  reciprocal  lattice  is  itself  a  Bravais  lattice,  one  can  construct  its  reciprocal 
lattice.  This  turns  out  to  be  nothing  but  the  original  direct  lattice. 

One  way  to  prove  this  is  by  constructing  clt  c2,  and  c3  out  of  the  b,-  according  to 
the  same  formula  (5.3)  by  which  the  bf  were  constructed  from  the  a,.  It  then  follows 
from  simple  vector  identities  (Problem  1)  that  c;  =  a(,  i  =  1,  2,  3. 

A  simpler  proof  follows  from  the  observation  that  according  to  the  basic  definition 
(5.2),  the  reciprocal  of  the  reciprocal  lattice  is  the  set  of  all  vectors  G  satisfying 

eiGK  =  1  (5.9) 

for  all  K  in  the  reciprocal  lattice.  Since  any  direct  lattice  vector  R  has  this  property 
(again  by  (5.2)),  all  direct  lattice  vectors  are  in  the  lattice  reciprocal  to  the  reciprocal 
lattice.  Furthermore,  no  other  vectors  can  be,  for  a  vector  not  in  the  direct  lattice  has 
the  form  r  =  jqaj  4-  x2a2  4-  x3a3  with  at  least  one  nonintegral  x,.  For  that  value 
of  i,  e*1*"'  =  e2*'*'  ^  1,  and  condition  (5.9)  is  violated  for  the  reciprocal  lattice  vector 
K  =  b,. 


2  When  i  ^  j>  Eq.  (5.4)  follows  because  the  cross  product  of  two  vectors  is  normal  to  both.  When 
i  —  j,  it  follows  because  of  the  vector  identity 

a,  •  (a2  x  a3)  =  a2  •  (a3  x  a^  =  a3  •  (a,  x  a2). 

3  This  is  t  jt  any  three  vectors  not  all  in  one  plane,  it  is  easy  to  verify  that  the  b,  are  not  all  in  a 
plane  as  long  as  the  a*  are  not. 
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IMPORTANT  EXAMPLES 

The  simple  cubic  Bravais  lattice,  with  cubic  primitive  cell  of  side  a  has  as  its  reciprocal 
a  simple  cubic  lattice  with  cubic  primitive  cell  of  side  2n/a.  This  can  be  seen,  for 
example,  from  the  construction  (5.3),  for  if 


=  aft,  a2  =  US’,  a3  =  at 


then 


2  7t 

b,  =  —  «, 
a 


2k 

b2=-S, 


2n 

b3  =  —  2. 
a 


(5.10) 


(5.11) 


The  face-centered  cubic  Bravais  lattice  with  conventional  cubic  cell  ofs.de  a  has 
as  its  reciprocal  a  body-centered  cubic  lattice  with  conventional  cubic  cell  of  side 
4n/a.  This  can  be  seen  by  applying  the  construction  (5.3)  to  the  fee  primitive  vectors 

(4.5).  The  result  is 


4ir  1  4n  1 

b‘=^  +  2-*)’  ^2 


4n  1  ,  .  ... 

(2  +  8  -  S),  b3  =  —  2  +  y  _  2) 


(5.12) 


This  has  precisely  the  form  of  the  bcc  primitive  vectors  (4.4),  provided  that  the  side 
of  the  cubic  cell  is  taken  to  be  4t z/a. 

The  body-centered  cubic  lattice  with  conventional  cubic  cell  ^  s  de  u  has  as  Us 
reciprocal  a  face-centered  cubic  lattice  with  conventional  cubic  cell  of  side  4n/a.This 
can  again  be  proved  from  the  construction  (5.3),  but  it  also  follows  from  the  above 
resul/for  the  reciprocal  of  the  fee  lattice,  along  with  the  theorem  that  the  reciproc 
of  the  reciprocal  is  the  original  lattice. 

It  is  left  as  an  exercise  for  the  reader  to  verify  (Problem  2)  that  the  reciprocal  o 
simple  hexagonal  Bravais  lattice  with  lattice  constants  c  and  a  (Figure  5.1a)  is  another 


,  ,  (b) 

(a) 

^Primitive  vectors  for  the  simple  hexagonal  Bravais  lattice.  (b)  Pri^tlve  ^°rS  ^ 
the  lattice  reciprocal  to  that  generated  by  the  primitive  vectors  m  (a).  The  c  and  c 
are  parallel.  The  a*  axes  are  rotated  by  30  with  respect  to  the  a i  axes  in  the  plane  perpe 
dicular  to  the  c  or  c*  axes.  The  reciprocal  lattice  is  also  simple  hexagona  . 
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simple  hexagonal  lattice  with  lattice  constants  2 n/c  and  4rr/%/3o  (Figure  5. 1  b),  rotated 
through  30°  about  the  oaxis  with  respect  to  the  direct  lattice.4 


VOLUME  OF  THE  RECIPROCAL  LATTICE  PRIMITIVE  CELL 

If  i  is  the  volume5  of  a  primitive  cell  in  the  direct  lattice,  then  the  primitive  cell  of  the 
reciprocal  lattice  has  a  volume  (2n)3/v.  This  is  proved  in  Problem  1 . 


FIRST  BRILLOUIN  ZONE 

The  Wigner-Seitz  primitive  cell  (page  73)  of  the  reciprocal  lattice  is  known  as  the 
first  Brillouin  zone.  As  the  name  suggests,  one  also  defines  higher  Brillouin  zones, 
which  are  primitive  cells  of  a  different  type  that  arise  in  the  theory  of  electronic  levels 
in  a  periodic  potential.  They  are  described  in  Chapter  9. 

Although  the  terms  “Wigner-Seitz  cell”  and  “first  Brillouin  zone”  refer  to  identical 
geometrical  constructions,  in  practice  the  latter  term  is  applied  only  to  the  k- space 
cell.  In  particular,  when  reference  is  made  to  the  first  Brillouin  zone  of  a  particular 
r-space  Bravais  lattice  (associated  with  a  particular  crystal  structure),  what  is  always 
meant  is  the  Wigner-Seitz  cell  of  the  associated  reciprocal  lattice.  Thus,  because  the 
reciprocal  of  the  body-centered  cubic  lattice  is  face-centered  cubic,  the  first  Brillouin 
zone  of  the  bcc  lattice  (Figure  5.2a)  is  just  the  fee  Wigner-Seitz  cell  (Figure  4.16). 
Conversely,  the  first  Brillouin  zone  of  the  fee  lattice  (Figure  5.2b)  is  just  the  bcc  Wigner- 
Seitz  cell  (Figure  4.15). 

Figure  5.2 

(a)  The  first  Brillouin  zone  for 
the  body-centered  cubic  lattice. 

(b)  The  first  Brillouin  zone  for 
the  face-centered  cubic  laitice. 


(j) 


(b) 


LATTICE  PLANES 

There  is  an  intimate  relation  between  vectors  in  the  reciprocal  lattice  and  planes  of 
points  in  the  direct  lattice.  This  relation  is  of  some  importance  in  understanding  the 
fundamental  role  the  reciprocal  lattice  plays  in  the  theory  of  diffraction,  and  will  be 
applied  to  that  problem  in  the  next  chapter.  Here  we  shall  describe  the  relation  in 
general  geometrical  terms. 


+  The  hexagonal  dose-packed  structure  is  not  a  Bravais  lattice,  and  therefore  the  reciprocal  lattice 
used  in  the  analysis  of  hep  solids  is  that  of  the  simple  hexagonal  lattice  (see  footnote  1) 

5  The  primitive  cell  volume  is  independent  of  the  choice  of  cell,  as  proved  in  Chapter  4. 
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Given  a  particular  Bravais  lattice,  a  lattice  plane  is  defined  to  be  any  plane  con¬ 
taining  at  least  three  noncollinear  Bravais  lattice  points.  Because  of  the  translational 
symmetry  of  the  Bravais  lattice,  any  such  plane  will  actually  contain  infinitely  many 
lattice  points,  which  form  a  two-dimensional  Bravais  lattice  within  the  plane.  Some 
lattice  planes  in  a  simple  cubic  Bravais  lattice  are  pictured  in  Figure  5.3. 


Figure  5.3 

Some  lattice  planes  (shaded)  in  a  simple  cubic  Bravais  lattice:  (a)  and  (b) 
show  two  different  ways  of  representing  the  lattice  as  a  family  of  lattice  planes. 

By  a  family  of  lattice  planes  we  mean  a  set  of  parallel,  equally  spaced  lattice  planes, 
which  together  contain  all  the  points  of  the  three-dimensional  Bravais  lattice.  Any 
lattice  plane  is  a  member  of  such  a  family.  Evidently  the  resolution  of  a  Bravais  lattice 
into  a  family  of  lattice  planes  is  far  from  unique  (Figure  5.3).  The  reciprocal  lattice 
provides  a  very  simple  way  to  classify  all  possible  families  of  lattice  planes,  which  is 
embodied  in  the  following  theorem: 

For  any  family  of  lattice  planes  separated  by  a  distance  d,  there  are  reciprocal 
lattice  vectors  perpendicular  to  the  planes,  the  shortest  of  which  have  a  length 
of  In/d.  Conversely,  for  any  reciprocal  lattice  vector  K,  there  is  a  family  of  lattice 
planes  normal  to  K  and  separated  by  a  distance  d,  where  2n/d  is  the  length  of 
the  shortest  reciprocal  lattice  vector  parallel  to  K. 

The  theorem  is  a  straightforward  consequence  of  (a)  the  definition  (5.2)  of  recip¬ 
rocal  lattice  vectors  as  the  wave  vectors  of  plane  waves  that  are  unity  at  all  Bravais 
lattice  sites  and  (b)  the  fact  that  a  plane  wave  has  the  same  value  at  all  points  lying  in 
a  family  of  planes  that  are  perpendicular  to  its  wave  vector  and  separated  by  an 
integral  number  of  wavelengths. 

To  prove  the  first  part  of  the  theorem,  given  a  family  of  lattice  planes,  let  ft  be  a 
unit  vector  normal  to  the  planes.  That  K  =  2rtfl/tf  is  a  reciprocal  lattice  vector  follows 
from  the  fact  that  the  plane  wave  e"K  1  is  constant  in  planes  perpendicular  to  K  and 
has  the  same  value  in  planes  separated  by  A  =  2n/K  =  d.  Since  one  of  the  lattice 
planes  contains  the  Bravais  lattice  point  r  =  0,  e®  'r  must  be  unity  for  any  point  r  in 
any  of  the  planes.  Since  the  planes  contain  all  Bravais  lattice  points,  elK  =  1  for  all 
R,  so  that  K  is  indeed  a  reciprocal  lattice  vector.  FurthP  more,  K  is  the  shortest 


Miller  Indices  of  Lattice  Planes  91 


reciprocal  lattice  vector  normal  to  the  planes,  for  any  wave  vector  shorter  than  K 
will  give  a  plane  wave  with  wavelength  greater  than  2 n/K  =  d.  Such  a  plane  wave 
cannot  have  the  same  value  on  all  planes  in  the  family,  and  therefore  cannot  give  a 
plane  wave  that  is  unity  at  all  Bravais  lattice  points. 

To  prove  the  converse  of  the  theorem,  given  a  reciprocal  lattice  vector,  let  K  be 
the  shortest  parallel  reciprocal  lattice  vector.  Consider  the  set  of  real  space  planes  on 
which  the  plane  wave  eiK  r  has  the  value  unity.  These  planes  (one  of  which  contains 
the  point  r  =  0)  are  perpendicular  to  K  and  separated  by  a  distance  d  =  2n/K.  Since 
the  Bravais  lattice  vectors  R  all  satisfy  e'K  K  =  1  for  any  reciprocal  lattice  vector  K, 
they  must  all  lie  within  these  planes;  i.e.,  the  family  of  planes  must  contain  within  it 
a  family  of  lattice  planes.  Furthermore  the  spacing  between  the  lattice  planes  is  also 
d  (rather  than  some  integral  multiple  of  d),  for  if  only  every  nth  plane  in  the  family 
contained  Bravais  lattice  points,  then  according  to  the  first  part  of  the  theorem,  the 
vector  normal  to  the  planes  of  length  2n/nd,  i.e.,  the  vector  K/n,  would  be  a  reciprocal 
lattice  vector.  This  would  contradict  our  original  assumption  that  no  reciprocal 
lattice  vector  parallel  to  K  is  shorter  than  K. 

MILLER  INDICES  OF  LATTICE  PLANES 

The  correspondence  between  reciprocal  lattice  vectors  and  families  of  lattice  planes 
provides  a  convenient  way  to  specify  the  orientation  of  a  lattice  plane.  Quite  generally 
one  describes  the  orientation  of  a  plane  by  giving  a  vector  normal  to  the  plane.  Since 
we  know  there  are  reciprocal  lattice  vectors  normal  to  any  family  of  lattice  planes*  it 
is  natural  to  pick  a  reciprocal  lattice  vector  to  represent  the  normal.  To  make  the 
choice  unique,  one  uses  the  shortest  such  reciprocal  lattice  vector.  In  this  way  one 
arrives  at  the  Miller  indices  of  the  plane: 

The  Miller  indices  of  a  lattice  plane  are  the  coordinates  of  the  shortest  reciprocal 
lattice  vector  normal  to  that  plane,  with  respect  to  a  specified  set  of  primitive  recip¬ 
rocal  lattice  vectors.  Thus  a  plane  with  Miller  indices  h,  k,  l,  is  normal  to  the  reciprocal 
lattice  vector  /ibx  +  k b2  +  /b3. 

As  so  defined,  the  Miller  indices  are  integers,  since  any  reciprocal  lattice  vector  is 
a  linear  combination  of  three  primitive  vectors  with  integral  coefficients.  Since  the 
normal  to  the  plane  is  specified  by  the  shortest  perpendicular  reciprocal  lattice  vector, 
the  integers  h,  k,  l  can  have  no  common  factor.  Note  also  that  the  Miller  indices 
depend  on  the  particular  choice  of  primitive  vectors. 

In  simple  cubic  Bravais  lattices  the  reciprocal  lattice  is  also  simple  cubic  and  the 
Miller  indices  are  the  coordinates  of  a  vector  normal  to  the  plane  in  the  obvious 
cubic  coordinate  system.  As  a  general  rule,  face-centered  and  body-centered  cubic 
Bravais  lattice  are  described  in  terms  of  a  conventional  cubic  cell,  i.e.,  as  simple  cubic 
lattices  with  bases.  Since  any  lattice  plane  in  a  fee  or  bcc  lattice  is  also  a  lattice  plane 
in  the  underlying  simple  cubic  lattice,  the  same  elementary  cubic  indexing  can  be 
used  to  specify  lattice  planes.  In  practice,  it  is  only  in  the  description  of  noncubic 
crystals  that  one  must  remember  that  the  Miller  indices  are  the  coordinates  of  the 
normal  in  a  system  given  by  the  reciprocal  lattice,  rather  than  the  direct  lattice. 

The  Miller  indices  of  a  plane  have  a  geometrical  interpretation  in  the  direct  lattice, 
which  is  sometimes  offered  as  an  alternative  way  of  defining  them.  Because  a  lattice 
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plane  with  Miller  indices  h,  k,  l  is  perpendicular  to  the  reciprocal  lattice  vector  K  = 
hbt  +  kb2  +  /b3,  it  will  be  contained  in  the  continuous  plane  K  •  r  =  A,  for 
suitable  choice  of  the  constant  A.  This  plane  intersects  the  axes  determined  by  the 
direct  lattice  primitive  vectors  a;  at  the  points  x1al,  x2a2,  and  x3a3  (Figure  5.4), 
w'here  the  x;  are  determined  by  the  condition  that  x^  indeed  satisfy  the  equation  of 
the  plane:  K  •  (xja,)  =  A.  Since  K  •  a2  =  2nh,  K  ■  a2  =  2 nk,  and  K  •  a3  =  2 id,  it 
follows  that 


Thus  the  intercepts  with  the  crystal  axes  of  a  lattice  plane  are  inversely  proportional 
to  the  Miller  indices  of  the  plane. 


Figure  5.4 

An  illustration  of  the  crystallographic  definition  of  the  Miller  indices  of 
a  lattice  plane.  The  shaded  plane  can  be  a  portion  of  the  continuous  plane 
in  which  the  points  of  the  lattice  plane  lie,  or  any  plane  parallel  to  the 
lattice  plane.  The  Miller  indices  are  inversely  proportional  to  the  x,. 


Crystallographers  put  the  cart  before  the  horse,  defining  the  Miller  indices  to  be  a 
set  of  integers  with  no  common  factors,  inversely  proportional  to  the  intercepts  of 
the  crystal  plane  along  the  crystal  axes: 


1  1  1 

h:k:l  = 

Xi  x2  x3 


SOME  CONVENTIONS  FOR  SPECIFYING  DIRECTIONS 

Lattice  planes  are  usually  specified  by  giving  their  Miller  indices  in  parentheses. 
(h,  k,  l).  Thus,  in  a  cubic  system,  a  plane  with  a  normal  (4,  -2,  1)  (or,  from  the  crys¬ 
tallographic  viewpoint,  a  plane  with  intercepts  (1,  —2, 4)  along  cubic  axes)  is  called  a 
(4,  -2,  1)  plane.  The  commas  are  eliminated  without  confusion  by  writing  n  instead 
of  —  n’  simplifying  the  description  to  (421).  One  must  know  what  set  of  axes  is  being 
used  to  interpret  these  symbols  unambiguously.  Simple  cubic  axes  are  invariably  used 
when  the  crystal  has  cubic  symmetry.  Some  examples  of  planes  in  cubic  crystals  are 
shown  in  Figure  5.5. 

A  similar  convention  is  used  to  specify  directions  in  the  direct  lattice,  but  to  avoid 
confusion  with  the  Miller  indices  (directions  in  the  reciprocal  lattice)  square  brackets 
are  used  instead  of  parentheses.  Thus  the  body  diagonal  of  a  simple  cubic  lattice  lies 
in  the  [111]  direction  and,  in  general  the  lattice  point  +  n2a2  +  n3 a3  lies  in 
the  direction  [111112113]  from  the  origin. 

There  is  also  a  notation  specifying  both  a  family  of  lattice  planes  and  all  those 
other  families  that  are  equivalent  to  it  by  virtue  of  the  symmetry  of  the  crystal.  Thus 
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Figure  5.5 

Three  lattice  planes  and  their  Miller  indices  in  a  simple  cubic  Bravais  lattice. 

the  (100),  (010),  and  (001)  planes  are  all  equivalent  in  a  cubic  crystal.  One  refers  to 
them  collectively  as  the  {100}  planes,  and  in  general  one  uses  {hkl}  to  refer  to  the 
(hid)  planes  and  all  those  that  are  equivalent  to  them  by  virtue  of  the  crystal  symmetry. 
A  similar  convention  is  used  with  directions:  the  [100],  [010],  [001],  [100],  [010], 
and  [001]  directions  in  a  cubic  crystal  are  referred  to,  collectively,  as  the  <100) 
directions. 

This  concludes  our  general  geometrical  discussion  of  the  reciprocal  lattice.  In 
Chapter  6  we  shall  see  an  important  example  of  the  utility  and  the  power  of  the 
concept  in  the  theory  of  the  diffraction  of  X  rays  by  a  crystal. 


PROBLEMS 


1.  (a)  Prove  that  the  reciprocal  lattice  primitive  vectors  defined  in  (5.3)  satisfy 


bi  ■  (b2  x  b3) 


(2  n)3 

ai  -(a2  x  a3)' 


(5.15) 


(Hint:  Write  b!  (but  not  b2  or  b3)  in  terms  of  the  a(,  and  use  the  orthogonality  relations  (5.4).) 

(b)  Suppose  primitive  vectors  are  constructed  from  the  b;  in  the  same  manner  (Eq.  (5.3))  as 
the  b(  are  constructed  from  the  a(.  Prove  that  these  vectors  are  just  the  a,  themselves;  i.e.,  show  that 


2n 


b2  x  b3 


bi  •  (b2  x  b3) 


=  at,  etc. 


(5.16) 


(Hint:  Write  b3  in  the  numerator  (but  not  b2)  in  terms  of  the  a,,  use  the  vector  identity  A  x 
(B  x  C)  =  B(A  •  C)  —  C(A  •  B),  and  appeal  to  the  orthogonality  relations  (5.4)  and  the  result 
(5.15)  above.) 

(c)  Prove  that  the  volume  of  a  Bravais  lattice  primitive  cell  is 


v  =  |ai  •  (a2  x  a3)|,  (5.17) 

where  the  a,  are  three  primitive  vectors.  (In  conjunction  with  (5.15)  this  establishes  that  the  volume 
of  the  reciprocal  lattice  primitive  cell  is  (2 n)3/v.) 
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2.  (a)  Using  the  primitive  vectors  given  in  Eq.  (4.9)  and  the  construction  (5.3)  (or  by  any  other 
method)  show  that  the  reciprocal  of  the  simple  hexagonal  Bravais  lattice  is  also  simple  hexagonal, 
with  lattice  constants  2n/c  and  4n/N/3a,  rotated  through  30°  about  the  c-axis  with  respect  to  the 
direct  lattice. 

(b)  For  what  value  of  c/a  does  the  ratio  have  the  same  value  in  both  direct  and  reciprocal 
lattices?  If  c/a  is  ideal  in  the  direct  lattice,  what  is  its  value  in  the  reciprocal  lattice? 

(c)  The  Bravais  lattice  generated  by  three  primitive  vectors  of  equal  length  a,  making  equal 
angles  0  with  one  another,  is  known  as  the  trigonal  Bravais  lattice  (see  Chapter  7).  Show  that  the 
reciprocal  of  a  trigonal  Bravais  lattice  is  also  trigonal,  with  an  angle  0*  given  by  —cos  8*  = 
cos  0/[l  +  cos  0],  and  a  primitive  vector  length  a*,  given  by  a*  =  (27t/a)(l  +  2  cos  0  cos  8*)~112. 

3.  (a)  Show  that  the  density  of  lattice  points  (per  unit  area)  in  a  lattice  plane  is  d/v.  where  v  is 
the  primitive  cell  volume  and  d  the  spacing  between  neighboring  planes  in  the  family  to  which  the 
given  plane  belongs. 

(b)  Prove  that  the  lattice  planes  with  the  greatest  densities  of  points  are  the  {111}  planes  in 
a  face-centered  cubic  Bravais  lattice  and  the  { 1 10}  planes  in  a  body-centered  cubic  Bravais  lattice. 
(Hint:  This  is  most  easily  done  by  exploiting  the  relation  between  families  of  lattice  planes  and 
reciprocal  lattice  vectors.) 

4.  Prove  that  any  reciprocal  lattice  vector  K  is  an  integral  multiple  of  the  shortest  parallel 
reciprocal  lattice  vector  K0.  (Hint:  Assume  the  contrary,  and  deduce  that  since  the  reciprocal 
lattice  is  a  Bravais  lattice,  there  must  be  a  reciprocal  lattice  vector  parallel  to  K  shorter  than  K0.) 
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Typical  interatomic  distances  in  a  solid  are  on  the  order  of  an  angstrom  (10  cm). 
An  electromagnetic  probe  of  the  microscopic  structure  of  a  solid  must  therefore  have 
a  wavelength  at  least  this  short,  corresponding  to  an  energy  of  order 


he 

10” 8  cm 


12.3  x  103  eV. 


(6.1) 


Energies  like  this,  on  the  order  of  several  thousands  of  electron  volts  (kilovolts  or 
keV),  are  characteristic  X-ray  energies. 

In  this  chapter  we  shall  describe  how  the  distribution  of  X  rays  scattered  by  a 
rigid,1  periodic2  array  of  ions  reveals  the  locations  of  the  ions  within  that  structure. 
There  are  two  equivalent  ways  to  view  the  scattering  of  X  rays  by  a  perfect  periodic 
structure,  due  to  Bragg  and  to  von  Laue.  Both  viewpoints  are  still  widely  used.  The 
von  Laue  approach,  which  exploits  the  reciprocal  lattice,  is  closer  to  the  spirit  ot 
modem  solid  state  physics,  but  the  Bragg  approach  is  still  in  wide  use  by  X-ray  crys- 
tallographers.  Both  are  described  below,  together  with  a  proof  of  their  equivalence. 


BRAGG  FORMULATION  OF  X-RAY  DIFFRACTION  BY  A  CRYSTAL 

In  1913  W.  H.  and  W.  L.  Bragg  found  that  substances  whose  macroscopic  forms 
were  crystalline  gave  remarkably  characteristic  patterns  of  reflected  X-radiation, 
quite  unlike  those  produced  by  liquids.  In  crystalline  materials,  for  certain  sharply 
defined  wavelengths  and  incident  directions,  intense  peaks  of  scattered  radiation 

(now  known  as  Bragg  peaks)  were  observed. 

W  L  Bragg  accounted  for  this  by  regarding  a  crystal  as  made  out  of  parallel  planes 
of  ions  spaced  a  distance  d  apart  (i.e.,  the  lattice  planes  described  in  Chapter  5).  The 
conditions  for  a  sharp  peak  in  the  intensity  of  the  scattered  radiation  were:  (1)  that 
the  X  rays  should  be  specularly  reflected3  by  the  ions  in  any  one  plane  and  (2)  that  the 
reflected  rays  from  successive  planes  should  interfere  constructively.  Rays  specularly 
reflected  from  adjoining  planes  are  shown  in  Figure  6.1.  The  path  difference  between 
the  two  rays  is  just  2d  sin  6 ,  where  6  is  the  angle  of  incidence.4 *  For  the  rays  to  interfere 
constructively,  this  path  difference  must  be  an  integral  number  of  wavelengths,  leading 
to  the  celebrated  Bragg  condition: 

nX  =  2d  sin  6.  (6-2> 


The  integer  n  is  known  as  the  order  of  the  corresponding  reflection.  For  a  beam 
of  X  rays  containing  a  range  of  different  wavelengths  (“white  radiation”)  many 
different  reflections  are  observed.  Not  only  can  one  have  higher-order  reflections 
from  a  given  set  of  lattice  planes,  but  in  addition  one  must  recognize  that  there  are 

1  Actually  the  ions  vibrate  about  their  ideal  equilibrium  sites  (Chapters  21  -26).  This  does  not  affect 
the  conclusions  reached  in  this  chapter  (though  in  the  early  days  of  X-ray  diffraction  it  was  not  clear  why 
such  vibrations  did  not  obliterate  the  pattern  characteristic  of  a  periodic  structure).  It  turns  out  that 
the  vibrations  have  two  main  consequences  (see  Appendix  N):  (a)  the  intensity  in  the  characteristic  peaks 
that  reveal  the  crystal  structure  is  diminished,  but  not  eliminated;  and  (b)  a  much  weaker  continuous 

background  of  radiation  (the  "diffuse  background")  is  produced.  , 

2  Amorphous  solids  and  liquids  have  about  the  same  density  as  crystalline  solids,  and  are  therefore 
also  susceptible  to  probing  with  X  rays.  However,  the  discrete,  sharp  peaks  of  scattered  radiation  charac- 
teristic  of  crystals  are  not  found. 

3  In  specular  reflection  the  angle  of  incidence  equals  the  angle  of  reflection.  , 

*  The  angle  of  incidence  in  X-ray  crystallography  is  conventionally  measured  om  the  plane  of 

reflection  rather  than  from  the  normal  to  that  plane  (as  in  classical  optics).  Note  l  is  just  halt  the 

angle  of  deflection  of  the  incident  beam  (Figure  6.2). 
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Figure  6.1 

A  Bragg  reflection  from  a  particular 
family  of  lattice  planes,  separated  by  a 
distance  d.  Incident  and  reflected  rays  are 
shown  for  the  two  neighboring  planes. 
The  path  difference  is  2d  sin  8. 


♦ 


Figure  6.2 

The  Bragg  angle  8  is  just  half  the  total  angle  by  which  the  incident 
beam  is  deflected. 


Figure  6.3 

The  same  portion  of  Bravais  lattice  shown 
in  Figure  6.1,  with  a  different  resolution 
into  lattice  planes  indicated.  The  incident 
ray  is  the  same  as  in  Figure  6.1,  but  both 
the  direction  (shown  in  the  figure)  and 
wavelength  (determined  by  the  Bragg 
condition  (6.2)  with  d  replaced  by  d')  of 
the  reflected  ray  are  different  from  the 
reflected  ray  in  Figure  6.1.  Reflections 
are  possible,  in  general,  for  any  of  the 
infinitely  many  ways  of  resolving  the 
lattice  into  planes. 

many  different  ways  of  sectioning  the  crystal  into  planes,  each  of  which  will  itself 
produce  further  reflections  (see,  for  example,  Figure  5.3  or  Figure  6.3). 

VON  LAUE  FORMULATION  OF  X-RAY  DIFFRACTION 
BY  A  CRYSTAL 

The  von  Laue  approach  differs  from  the  Bragg  approach  in  that  no  particular  sec¬ 
tioning  of  the  crystal  into  lattice  planes  is  singled  out,  and  no  ad  hoc  assumption  of 
specular  reflection  is  imposed.5  Instead  one  regards  the  crystal  as  composed  of 


5  The  Bragg  assumption  of  specular  reflection  is,  however,  equivalent  to  the  assumption  that  rays 
scattered  from  individual '  ’  within  each  lattice  plane  interfere  constructively.  Thus  both  the  Bragg  and 

the  von  Laue  approaches  eased  on  the  same  physical  assumptions,  and  their  precise  equivalence  (see 
page  99)  is  to  be  expected. 
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Figure  6.4 

Illustrating  that  the  path  difference  for  rays 
scattered  from  two  points  separated  by  d  is 
given  by  Eq.  (6.3)  or  (6.4). 


identical  microscopic  objects  (sets  of  ions  or  atoms)  placed  at  the  sites  R  of  a  Bravais 
lattice,  each  of  which  can  reradiate  the  incident  radiation  in  all  directions.  Sharp 
peaks  will  be  observed  only  in  directions  and  at  wavelengths  for  which  the  rays 
scattered  from  all  lattice  points  interfere  constructively. 

To  find  the  condition  for  constructive  interference,  consider  first  just  two  scatterers, 
separated  by  a  displacement  vector  d  (Figure  6.4).  Let  an  X  ray  be  incident  from 
very  far  away,  along  a  direction  ft,  with  wavelength  A,  and  wave  vector  k  =  2jifi/A. 
A  scattered  ray  will  be  observed  in  a  direction  ft'  with  wavelength6  A  and  wave  vector 
k'  =  2nfi'/A,  provided  that  the  path  difference  between  the  rays  scattered  by  each  of 
the  two  ions  is  an  integral  number  of  wavelengths.  From  Figure  6.4  it  can  be  seen 
that  this  path  difference  is  just 

d  cos  6  +  d  cos  6'  =  d  •  (ft  —  ft')-  (6*3) 

The  condition  for  constructive  interference  is  thus 

d  ■  (ft  —  ft')  =  m2,  (6.4) 

for  integral  m.  Multiplying  both  sides  of  (6.4)  by  2 n/A  yields  a  condition  on  the  incident 
and  scattered  wave  vectors: 

•  d  •  (k  -  k')  =  2 nm,  (6.5) 

for  integral  m. 

Next,  we  consider  not  just  two  scatterers,  but  an  array  of  scatterers,  at  the  sites  of 
a  Bravais  lattice.  Since  the  lattice  sites  are  displaced  from  one  another  by  the  Bravais 
lattice  vectors  R,  the  condition  that  all  scattered  rays  interfere  constructively  is  that 
condition  (6.5)  hold  simultaneously  for  all  values  of  d  that  are  Bravais  lattice  vectors: 

for  integral  m  and 

R  •  (k  —  k')  =  2nm,  all  Bravais  lattice  (6.6) 

vectors  R. 

This  can  be  written  in  the  equivalent  form 

ei(k  -k)  R  _  i,  for  au  Bravais  lattice  vectors  R.  (6.7) 


«  Here  (and  in  the  Bragg  picture)  we  assume  that  the  incident  and  scattered  radiation  has  the  same 
wavelength.  In  terms  of  photons  this  means  that  no  energy  has  been  lost  in  the  scattering,  i.e.,  that  the 
scattering  is  elastic.  To  a  good  approximation  the  bulk  of  the  scattered  radiation  is  elastically  scattered, 
though  there  is  much  to  be  learned  from  the  study  of  that  small  component  of  the  radiation  that  is  in- 
elastically  scattered  (Chapter  24  and  Appendix  N).  I 
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Comparing  this  condition  with  the  definition  (5.2)  of  the  reciprocal  lattice,  we 
arrive  at  the  Laue  condition  that  constructive  interference  will  occur  provided  that 
the  change  in  wave  vector,  K  =  k'  —  k,  is  a  vector  of  the  reciprocal  lattice. 

It  is  sometimes  convenient  to  have  an  alternative  formulation  of  the  Laue  con¬ 
dition,  stated  entirely  in  terms  of  the  incident  wave  vector  k.  First  note  that  because 
the  reciprocal  lattice  is  a  Bravais  lattice,  if  k'  —  k  is  a  reciprocal  lattice  vector,  so  is 
k  —  k'.  Calling  the  latter  vector  K,  the  condition  that  k  and  k'  have  the  same  magni¬ 
tude  is 


k  =  |k  —  K|. 

(6.8) 

Squaring  both  sides  of  (6.8)  yields  the  condition 

k-K  =  K; 

(6.9) 

i.e.,  the  component  of  the  incident  wave  vector  k  along  the  reciprocal  lattice  vector 
K  must  be  half  the  length  of  K. 

Thus  an  incident  wave  vector  k  will  satisfy  the  Laue  condition  if  and  only  if  the 
tip  of  the  vector  lies  in  a  plane  that  is  the  perpendicular  bisector  of  a  line  joining  the 
origin  of  /c-space  to  a  reciprocal  lattice  point  K  (Figure  6.5).  Such  f-space  planes 
are  called  Bragg  planes. 


Figure  6.5 

The  Laue  condition.  If  the  sum  of  k  and  —  k' 
is  a  vector  K,  and  if  k  and  k'  have  the  same 
length,  then  the  tip  of  the  vector  k  is  equi¬ 
distant  from  the  origin  O  and  the  tip  of  the 
vector  K,  and  therefore  it  lies  in  the  plane 
bisecting  the  line  joining  the  origin  to  the  tip 
of  K. 


It  is  a  consequence  of  the  equivalence  of  the  Bragg  and  von  Laue  points  of  view, 
demonstrated  in  the  following  section,  that  the  k- space  Bragg  plane  associated  with 
a  particular  diffraction  peak  in  the  Laue  formulation  is  parallel  to  the  family  of  direct 
lattice  planes  responsible  for  the  peak  in  the  Bragg  formulation. 

EQUIVALENCE  OF  THE  BRAGG  AND  VON  LAUE  FORMULATIONS 

The  equivalence  of  these  two  criteria  for  constructive  interference  of  X  rays  by  a 
crystal  follows  from  the  relation  between  vectors  of  the  reciprocal  lattice  and  families 
of  direct  lattice  planes  (see  Chapter  5).  Suppose  the  incident  and  scattered  wave 
vectors,  k  and  k',  satisfy  the  Laue  condition  that  K  =  k'  —  k  be  a  reciprocal  lattice 
vector.  Because  the  incident  and  scattered  waves  have  the  same  wavelength,6  k'  and 
k  have  the  same  magnitudes.  It  follows  (see  Figure  6.6)  that  k'  and  k  make  the  same 
angle  6  with  the  plane  perpendicular  to  K.  Therefore  the  scattering  can  be  viewed 
as  a  Bragg  reflection,  with  Bragg  angle  6,  from  the  family  of  direct  lattice  planes 
perpendic  «-  to  the  reciprocal  lattice  vector  K. 
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K  =  k' 


-k 


Figure  6.6 

The  plane  of  the  paper  contains  the  incident  wave 
vector  k,  the  reflected  wave  vector  k\  and  their  differ¬ 
ence  K  satisfying  the  Laue  condition.  Since  the  scat¬ 
tering  is  elastic  ( k ’  =  k),  the  direction  of  K  bisects  the 
angle  between  k  and  k'.  The  dashed  line  is  the  inter¬ 
section  of  the  plane  perpendicular  to  K  with  the  plane 
of  the  paper. 


To  demonstrate  that  this  reflection  satisfies  the  Bragg  condition  (6.2),  note  that 
the  vector  K  is  an  integral  multiple7  of  the  shortest  reciprocal  lattice  vector  K0  parallel 
to  K.  According  to  the  theorem  on  page  90,  the  magnitude  of  K0  is  just  2n/d,  where 
d  is  the  distance  between  successive  planes  in  the  family  perpendicular  to  K0  or  to 
K.  Thus 

K=—.  (6.10) 

d 

On  the  other  hand,  it  follows  from  Figure  6.6  that  K  —  2k  sin  6,  and  thus 


/csinfl  ™  (6-11) 

a 

Since  k  =  2 rc/A,  Eq.  (6.11)  implies  that  the  wavelength  satisfies  the  Bragg  condition 

(6.2).  .  .  ,  . 

Thus  a  Laue  diffraction  peak  corresponding  to  a  change  in  wave  vector  given  by  the 

reciprocal  lattice  vector  K  corresponds  to  a  Bragg  reflection  from  the  family  of  direct 
lattice  planes  perpendicular  to  K.  The  order,  n,  of  the  Bragg  reflection  is  just  the  length 
of  K  divided  by  the  length  of  the  shortest  reciprocal  lattice  vector  parallel  to  K. 

Since  the  reciprocal  lattice  associated  with  a  given  Bravais  lattice  is  far  more  easily 
visualized  than  the  set  of  all  possible  planes  into  which  the  Bravais  lattice  can  be 
resolved,  the  Laue  condition  for  diffraction  peaks  is  far  more  simple  to  work  with 
than  the  Bragg  condition.  In  the  rest  of  this  chapter  we  shall  apply  the  Laue  condition 
to  a  description  of  three  of  the  most  important  ways  in  which  X-ray  crystallographic 
analyses  of  real  samples  are  performed,  and  to  a  discussion  of  how  one  can  extract 
information  not  only  about  the  underlying  Bravais  lattice,  but  also  about  the  arrange¬ 
ment  of  ions  within  the  primitive  cell. 


EXPERIMENTAL  GEOMETRIES  SUGGESTED  BY  THE 
LAUE  CONDITION 

An  incident  wave  vector  k  will  lead  to  a  diffraction  peak  (or  Bragg  reflection  )  if 
and  only  if  the  tip  of  the  wave  vector  lies  on  a  k-space  Bragg  plane.  Since  the  set  of  all 


7  This  is  an  elementary  consequence  of  the  fact  that  the  reciprocal  lattice  is  a  Bravais  lattice.  See 
Chapter  5,  Problem  4. 


Experimental  Geometries  Suggested  by  the  Laue  Condition  101 

Bragg  planes  is  a  discrete  family  of  planes,  it  cannot  begin  to  fill  up  three-dimensional 
/c-space,  and  in  general  the  tip  of  k  will  not  lie  on  a  Bragg  plane.  Thus  for  a  fixed 
incident  wave  vector — i.e.,  for  a  fixed  X-ray  wavelength  and  fixed  incident  direction 
relative  to  the  crystal  axes — there  will  be  in  general  no  diffraction  peaks  at  all. 

If  one  wishes  to  search  experimentally  for  Bragg  peaks  one  must  therefore  relax 
the  constraint  of  fixed  k,  either  varying  the  magnitude  of  k  (i.e.,  varying  the  wavelength 
of  the  incident  beam)  or  varying  its  direction  (in  practice,  varying  the  orientation  of 
the  crystal  with  respect  to  the  incident  direction). 

The  Ewald  Construction 

A  simple  geometric  construction  due  to  Ewald  is  of  great  help  in  visualizing  these 
various  methods  and  in  deducing  the  crystal  structure  from  the  peaks  so  observed. 
We  draw  in  /c-space  a  sphere  centered  on  the  tip  of  the  incident  wave  vector  k  of  radius 
k  (so  that  it  passes  through  the  origin).  Evidently  (see  Figure  6.7)  there  will  be  some 
wave  vector  k'  satisfying  the  Laue  condition  if  and  only  if  some  reciprocal  lattice 
point  (in  addition  to  the  origin)  lies  on  the  surface  of  the  sphere,  in  which  case  there 
will  be  a  Bragg  reflection  from  the  family  of  direct  lattice  planes  perpendicular  to 
that  reciprocal  lattice  vector. 

Figure  6.7 

The  Ewald  construction.  Given  the 
incident  wave  vector  k,  a  sphere  of 
radius  k  is  drawn  about  the  point  k. 

Diffraction  peaks  corresponding  to  re¬ 
ciprocal  lattice  vectors  K  will  be  ob¬ 
served  only  if  K  gives  a  reciprocal  lattice 
point  on  the  surface  of  the  sphere.  Such 
a  reciprocal  lattice  vector  is  indicated  in 
the  figure,  together  with  the  wave  vector 
k'  of  the  Bragg  reflected  ray. 


In  general,  a  sphere  in  k- space  with  the  origin  on  its  surface  will  have  no  other 
reciprocal  lattice  points  on  its  surface,  and  therefore  the  Ewald  construction  con¬ 
firms  our  observation  that  for  a  general  incident  wave  vector  there  will  be  no  Bragg 
peaks.  One  can,  however,  ensure  that  some  Bragg  peaks  will  be  produced  by  several 
techniques: 

1.  The  Laue  Method  One  can  continue  to  scatter  from  a  single  crystal  of  fixed 
orientation  from  a  fixed  incident  direction  fl,  but  can  search  for  Bragg  peaks  by 
using  not  a  monochromatic  X-ray  beam,  but  one  containing  wavelengths  from 
ki  up  to  /0.  The  Ewald  sphere  will  then  expand  into  the  region  contained  between 
the  two  spheres  determined  by  k0  =  27Tfi//0  and  k,  =  2nh/Xi,  and  Bragg  peaks 
will  be  observed  corresponding  to  any  reciprocal  lattice  vectors  lying  within  this 
region  (Figure  6.8).  By  making  the  spread  in  wavelengths  sufficiently  large,  one 
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2. 


3. 


Figure  6.8 

The  Ewald  construction  for 
the  Laue  method.  The  crystal 
and  incident  X-ray  direction 
are  fixed,  and  a  continuous 
range  of  wavelengths,  corre¬ 
sponding  to  wave  vectors 
between  k0  and  kj  in  magni¬ 
tude,  is  present.  The  Ewald 
spheres  for  all  incident  wave 
vectors  fill  the  shaded  region 
between  the  sphere  centered 
on  the  tip  of  the  vector  k0 
and  that  centered  on  the  tip 
of  k,.  Bragg  peaks  will  be 
observed  corresponding  to 
all  reciprocal  lattice  points 
lying  within  the  shaded  re¬ 
gion.  (For  simplicity  in  illus¬ 
tration,  the  incident  direction 
has  been  taken  to  lie  in  a 
lattice  plane,  and  only  recip¬ 
rocal  lattice  points  lying  in 
that  plane  are  shown.) 


can  be  sure  of  finding  some  reciprocal  lattice  points  within  the  region;  whereas 
by  keeping  it  from  getting  too  large,  one  can  avoid  too  many  Bragg  reflections, 
thereby  keeping  the  picture  fairly  simple. 

The  Laue  method  is  probably  best  suited  for  determining  the  orientation  of 
a  single  crystal  specimen  whose  structure  is  known,  since,  for  example,  if  the 
incident  direction  lies  along  a  symmetry  axis  of  the  crystal,  the  pattern  of  spots 
produced  by  the  Bragg  reflected  rays  will  have  the  same  symmetry.  Since  solid 
state  physicists  generally  do  study  substances  of  known  crystal  structure,  the 
Laue  method  is  probably  the  one  of  greatest  practical  interest. 

The  Rotating-Crystal  Method  This  method  uses  monochromatic  X  rays,  but 
allows  the  angle  of  incidence  to  vary.  In  practice  the  direction  of  the  X-ray  beam 
is  kept  fixed,  and  the  orientation  of  the  crystal  varied  instead.  In  the  rotating 
crystal  method  the  crystal  is  rotated  about  some  fixed  axis,  and  all  Bragg  peaks 
that  occur  during  the  rotation  are  recorded  on  a  film.  As  the  crystal  rotates,  the 
reciprocal  lattice  it  determines  will  rotate  by  the  same  amount  about  the  same 
axis.  Thus  the  Ewald  sphere  (which  is  determined  by  the  fixed  incident  wave 
vector  k)  is  fixed  in  k- space,  while  the  entire  reciprocal  lattice  rotates  about  the 
axis  of  rotation  of  the  crystal.  During  this  rotation  each  reciprocal  lattice  point 
traverses  a  circle  about  the  rotation  axis,  and  a  Bragg  reflection  occurs  whenever 
this  circle  intersects  the  Ewald  sphere.  This  is  illustrated  in  Figure  6.9  for  a 
particularly  simple  geometry. 

The  Powder  or  Debye-Scherrer  Method  This  is  equivalent  to  a  rotating  crystal 
experiment  in  which,  in  addition,  the  axis  of  rotation  is  varied  over  all  possible 
orientations.  In  practice  this  isotropic  averaging  of  the  incident  direction  is 
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Figure  6.9 

The  Ewald  construction  for  the  rotating-crystal  method.  For 
simplicity  a  case  is  shown  in  which  the  incident  wave  vector  lies 
in  a  lattice  plane,  and  the  axis  of  rotation  is  perpendicular  to  that 
plane.  The  concentric  circles  are  the  orbits  swept  out  under  the 
rotation  by  the  reciprocal  lattice  vectors  lying  in  the  plane  per¬ 
pendicular  to  the  axis  containing  k.  Each  intersection  of  such  a 
circle  with  the  Ewald  sphere  gives  the  wave  vector  of  a  Bragg 
reflected  ray.  (Additional  Bragg  reflected  wave  vectors  associated 
with  reciprocal  lattice  vectors  in  other  planes  are  not  shown.) 

achieved  by  using  a  polycrystalline  sample  or  a  powder,  grains  of  which  are  still 
enormous  on  the  atomic  scale  and  therefore  capable  of  diffracting  X  rays.  Because 
the  crystal  axes  of  the  individual  grains  are  randomly  oriented,  the  diffraction 
pattern  produced  by  such  a  powder  is  what  one  would  produce  by  combining 
the  diffraction  patterns  for  all  possible  orientations  of  a  single  crystal. 

The  Bragg  reflections  are  now'  determined  by  fixing  the  incident  k  vector,  and 
with  it  the  Ewald  sphere,  and  allowing  the  reciprocal  lattice  to  rotate  through  all 
possible  angles  about  the  origin,  so  that  each  reciprocal  lattice  vector  K  generates 
a  sphere  of  radius  K  about  the  origin.  Such  a  sphere  will  intersect  the  Ewald 
sphere  in  a  circle  (Figure  6.10a)  provided  that  K  is  less  than  2k.  The  vector 
joining  any  point  on  such  a  circle  with  the  tip  of  the  incident  vector  k  is  a  wave 
vector  k',  for  which  scattered  radiation  will  be  observed.  Thus  each  reciprocal 
lattice  vector  of  length  less  than  2k  generates  a  cone  of  scattered  radiation  at  an 
angle  4>  to  the  forward  direction,  where  (Figure  6.10b) 

K  =  2k  sin 


(6.12) 
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Figure  6.10 

The  Ewald  construction  for  the  powder  method,  (a)  The  Ewald  sphere  is  the  smaller  sphere. 
It  is  centered  on  the  tip  of  the  incident  wave  vector  k  with  radius  k,  so  that  the  origin  O  is  on  its 
surface.  The  larger  sphere  is  centered  on  the  origin  and  has  a  radius  K.  The  two  spheres  intersect 
in  a  circle  (foreshortened  to  an  ellipse).  Bragg  reflections  will  occur  for  any  wave  vector  k'  con¬ 
necting  any  point  on  the  circle  of  intersection  to  the  tip  of  the  vector  k.  The  scattered  rays 
therefore  lie  on  the  cone  that  opens  in  the  direction  opposite  to  k.  (b)  A  plane  section  of  (a), 
containing  the  incident  wave  vector.  The  triangle  is  isosceles,  and  thus  K  =  2k  sin  7 <j> ■ 

By  measuring  the  angles  <p  at  which  Bragg  reflections  are  observed,  one  therefore 
learns  the  lengths  of  all  reciprocal  lattice  vectors  shorter  than  2k.  Armed  with  this 
information,  some  facts  about  the  macroscopic  crystal  symmetry,  and  the  fact  that 
the  reciprocal  lattice  is  a  Bravais  lattice,  one  can  usually  construct  the  reciprocal 
lattice  itself  (see,  for  example.  Problem  1). 


DIFFRACTION  BY  A  MONATOMIC  LATTICE  WITH  A  BASIS; 

THE  GEOMETRICAL  STRUCTURE  FACTOR 

The  preceding  discussion  was  based  on  the  condition  (6.7)  that  rays  scattered  from 
each  primitive  cell  should  interfere  constructively.  If  the  crystal  structure  is  that  of 
a  monatomic  lattice  with  an  n-atom  basis  (for  example,  carbon  in  the  diamond 
structure  or  hexagonal  close-packed  beryllium,  both  of  which  have  n  =  2),  then  the 
contents  of  each  primitive  cell  can  be  further  analyzed  into  a  set  of  identical  scatterers 
at  positions  d,,  ...,d„  within  the  cell.  The  intensity  of  radiation  in  a  given  Bragg  peak 
will  depend  on  the  extent  to  which  the  rays  scattered  from  these  basis  sites  interfere 
with  one  another,  being  greatest  when  there  is  complete  constructive  interference  and 
vanishing  altogether  should  there  happen  to  be  complete  destructive  interference. 

If  the  Bragg  peak  is  associated  with  a  change  in  wave  vector  k'  -  k  =  K,  then  the 
phase  difference  (Figure  6.4)  between  the  rays  scattered  at  d,  and  dy  will  be  K  •  (d,  - 
d7)  and  the  amplitudes  of  the  two  rays  will  differ  by  a  factor  e'K  (  '  ^  Thus  the 
amplitudes  of  the  rays  scattered  at  d] ,  . . . ,  d„  are  in  the  ratios  e‘  ' ,  ?  ■  The  net 
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ray  scattered  by  the  entire  primitive  cell  is  the  sum  of  the  individual  rays,  and  will 
therefore  have  an  amplitude  containing  the  factor 

SK=  £  eiK  \  (6.13) 

i=  i 

The  quantity  SK,  known  as  the  geometrical  structure  factor,  expresses  the  extent 
to  which  interference  of  the  waves  scattered  from  identical  ions  within  the  basis  can 
diminish  the  intensity  of  the  Bragg  peak  associated  with  the  reciprocal  lattice  vector 
K.  The  intensity  in  the  Bragg  peak,  being  proportional  to  the  square  of  the  absolute 
value  of  the  amplitude,  will  contain  a  factor  |SK|2.  It  is  important  to  note  that  this 
is  not  the  only  source  of  K  dependence  to  the  intensity.  Further  dependence  on  the 
change  in  wave  vector  comes  both  from  the  ordinary  angular  dependence  of  any 
electromagnetic  scattering,  together  with  the  influence  on  the  scattering  of  the  detailed 
internal  structure  of  each  individual  ion  in  the  basis.  Therefore  the  structure  factor 
alone  cannot  be  used  to  predict  the  absolute  intensity  in  a  Bragg  peak.8  It  can, 
however,  lead  to  a  characteristic  dependence  on  K  that  is  easily  discerned  even  though 
other  less  distinctive  K  dependences  have  been  superimposed  upon  it.  The  one  case, 
in  which  the  structure  factor  can  be  used  with  assurance  is  when  it  vanishes.  This 
occurs  when  the  elements  of  the  basis  are  so  arranged  that  there  is  complete  destructive 
interference  for  the  K  in  question;  in  that  case  no  features  of  the  rays  scattered  by 
the  individual  basis  elements  can  prevent  the  net  ray  from  vanishing. 

We  illustrate  the  importance  of  a  vanishing  structure  factor  in  two  cases9: 

1.  Body-Centered  Cubic  Considered  as  Simple  Cubic  with  a  Basis  Since  the  body- 
centered  cubic  lattice  is  a  Bravais  lattice,  we  know  that  Bragg  reflections  will  occur 
when  the  change  in  wave  vector  K  is  a  vector  of  the  reciprocal  lattice,  which  is  face- 
centered  cubic.  Sometimes,  however,  it  is  convenient  to  regard  the  bcc  lattice  as  a 
simple  cubic  lattice  generated  by  primitive  vectors  alt,  ay,  and  a%  with  a  two-point 
basis  consisting  of  d;  =  0  and  d2  =  (a/ 2)(X  +  $'  +  2).  From  this  point  of  view  the 
reciprocal  lattice  is  also  simple  cubic,  with  a  cubic  cell  of  side  2n/a.  However,  there 
will  now  be  a  structure  factor  SK  associated  with  each  Bragg  reflection.  In  the  present 
case,  (6.13)  gives 

SK  =  1  4-  exp  [iK  •  7d(x  +  y  +  z)].  (6-14) 

A  general  vector  in  the  simple  cubic  reciprocal  lattice  has  the  form 

K  =  —  (rqX  +  n2S  +  n3t).  (6.15) 

a 

Substituting  this  into  (6.14),  we  find  a  structure  factor 

SK  =  1  +  e','("l+',2+"3>  =  1  +  ( —  |)nl  +n2't'n3 

_  f 2,  Hi  +  n2  +  «3  even,)  (6.16) 

(0,  n1  +  n2  +  «3  odd.  J 


8  A  brief  but  thorough  discussion  of  the  scattering  of  electromagnetic  radiation  by  crystals,  including 
the  derivation  of  detailed  intensity  formulas  for  the  various  experimental  geometries  described  above,  is 
given  by  Landau  and  Lifshitz,  Electrodynamics  of  Continuous  Media,  Chapter  15,  Addison-Wesley, 
Reading,  Mass.,  1966. 

9  Further  example  -e  given  in  Problems  2  and  3. 
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Thus  those  points  in  the  simple  cubic  reciprocal  lattice,  the  sum  of  whose  coor¬ 
dinates  with  respect  to  the  cubic  primitive  vectors  are  odd,  will  actually  have  no 
Bragg  reflection  associated  with  them.  This  converts  the  simple  cubic  reciprocal  lattice 
into  the  face-centered  cubic  structure  that  we  would  have  had  if  we  had  treated  the 
body-centered  cubic  direct  lattice  as  a  Bravais  lattice  rather  than  as  a  lattice  with 
a  basis  (see  Figure  6.11). 


Figure  6.11 

Points  in  the  simple  cubic  reciprocal  lattice  of  side  2-nja, 
for  which  the  structure  factor  (6.16)  vanishes,  are  those 
(white  circles)  that  can  be  reached  from  the  origin  by 
moving  along  an  odd  number  of  nearest-neighbor  bonds. 
When  such  sites  are  eliminated,  the  remaining  sites 
(black  circles)  constitute  a  face-centered  cubic  lattice 
with  cubic  cell  of  side  4 nia. 


Thus  if,  either  inadvertently  or  for  reasons  of  greater  symmetry  in  description,  one 
chooses  to  describe  a  Bravais  lattice  as  a  lattice  with  a  basis,  one  still  recovers  the 
correct  description  of  X-ray  diffraction,  provided  that  the  vanishing  of  the  structure 
factor  is  taken  into  account. 

2.  Monatomic  Diamond  Lattice  The  monatomic  diamond  lattice  (carbon,  silicon, 
germanium,  or  grey  tin)  is  not  a  Bravais  lattice  and  must  be  described  as  a  lattice 
with  a  basis.  The  underlying  Bravais  lattice  is  face-centered  cubic,  and  the  basis  can 
be  taken  to  be  dx  =  0,  d2  =  (a/ 4)(X  +  S’  +  2),  where  X,  S',  and  2,  are  along  the  cubic 
axes  and  a  is  the  side  of  the  conventional  cubic  cell.  The  reciprocal  lattice  is  body- 
centered  cubic  with  conventional  cubic  cell  of  side  4n fa.  If  we  take  as  primitive 
vectors 

bi  =  —(S’  +  2  -  X),  b2  =  —  (2  +  X  -  S’),  b3  =  — (X  +  S’  -  2).  (6-17) 

a  a  a 

then  the  structure  factor  (6.13)  for  K  =  XUjb,  is 

SK  =  1  +  exp  +  n2  +  n3)] 

2,  n  i  +  n2  +  n3  twice  an  even  number, 

1  ±  i,  n,  +  n2  +  n3  odd, 

0,  ni  +  n2  +  n3  twice  an  odd  number. 

To  interpret  these  conditions  on  Su,-  geometrically,  note  that  if  we  substitute  (6.17) 
into  K  =  Snibi,  we  can  write  the  general  reciprocal  lattice  vector  in  the  form 

471 

K  =  —  (vjX  +  v2f  +  v32). 


(6.18) 


(6.19) 
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where 

3 

V:  =  i(«i  +  n2  +  n3)  —  nj,  £  v,-  =  2{rt1  +  n2  +  n3).  (6.20) 

J=  t 

We  know  (see  Chapter  5)  that  the  reciprocal  to  the  fee  lattice  with  cubic  cell  of  side 
a  is  a  bcc  lattice  with  cubic  cell  of  side  4n/a.  Let  us  regard  this  as  composed  of  two 
simple  cubic  lattices  of  side  4n/a  The  first,  containing  the  origin  (K  =  0),  must 
have  all  v,  integers  (according  to  (6.19))  and  must  therefore  be  given  by  K  with 
tii  +  n2  +  n3  even  (according  to  (6.20)).  The  second,  containing  the  “body-centered 
point”  (4ti /a)i(X  +  $’  +  2),  must  have  all  vf  integers  +  \  (according  to  (6.19))  and 
must  therefore  be  given  by  K  with  nl  +  n2  +  n3  odd  (according  to  (6.20)). 

Comparing  this  with  (6.18),  we  find  that  the  points  with  structure  factor  1  +  i 
are  those  in  the  simple  cubic  sublattice  of  “body-centered”  points.  Those  whose 
structure  factor  S  is  2  or  0  are  in  the  simple  cubic  sublattice  containing  the  origin, 
where  2vf  is  even  when  S  =  2  and  odd  when  S  =  0.  Thus  the  points  with  zero  struc¬ 
ture  factor  are  again  removed  by  applying  the  construction  illustrated  in  Figure  6.1 1 
to  the  simple  cubic  sublattice  containing  the  origin,  converting  it  to  a  face-centered 


Figure  6.12 

The  body-centered  cubic  lattice  with  cubic  cell  side 
4k /a  that  is  reciprocal  to  a  face-centered  cubic  lattice 
with  cubic  cell  side  a.  When  the  fee  lattice  is  that  under¬ 
lying  the  diamond  structure,  then  the  white  circles 
indicate  sites  with  zero  structure  factor.  (The  black 
circles  are  sites  with  structure  factor  2,  and  the  gray  ones 
are  sites  with  structure  factor  1  +  i.) 


O-5=0 


cubic  structure  (Figure  6.12). 


DIFFRACTION  BY  A  POLYATOMIC  CRYSTAL; 

THE  ATOMIC  FORM  FACTOR 

If  the  ions  in  the  basis  are  not  identical,  the  structure  factor  (6.13)  assumes  the  form 

SK=ifj(K)e*\  (6.21) 

3=1 

where /j,  known  as  the  atomic  form  factor,  is  entirely  determined  by  the  internal 
structure  of  the  ion  that  occupies  position  df  in  the  basis.  Identical  ions  have  identical 
form  factors  (regardless  of  where  they  are  placed),  so  (6.21)  reduces  back  to  (6.13), 
multiplied  by  the  common  value  of  the  form  factors,  in  the  monatomic  case. 

In  elementary  treatments  the  atomic  form  factor  associated  with  a  Bragg  reflection 
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given  by  the  reciprocal  lattice  vector  K  is  taken  to  be  proportional  to  the  Fourier 
transform  of  the  electronic  charge  distribution  of  the  corresponding  ion10: 

fj(K)  =  -i  Jdrc,K  'Pj(r).  (6.22) 

Thus  the  atomic  form  factor  fj  depends  on  K  and  on  the  detailed  features  of  the 
charge  distribution  of  the  ion  that  occupies  position  6j  in  the  basis.  As  a  result,  one 
would  not  expect  the  structure  factor  to  vanish  for  any  K  unless  there  is  some  for¬ 
tuitous  relation  between  form  factors  of  different  types.  By  making  reasonable 
assumptions  about  the  K  dependence  of  the  different  form  factors,  one  can  often 
distinguish  quite  conclusively  between  various  possible  crystal  structures  on  the  basis 
of  the  variation  with  K  of  the  Bragg  peak  intensities  (see,  for  example.  Problem  5). 

This  concludes  our  discussion  of  the  Bragg  reflection  of  X  rays.  Our  analysis  has 
exploited  no  properties  of  the  X  rays  other  than  their  wave  nature.11  Consequently 
we  shall  find  many  of  the  concepts  and  results  of  this  chapter  reappearing  in  sub¬ 
sequent  discussions  of  other  wave  phenomena  in  solids,  such  as  electrons  (Chapter  9) 
and  neutrons  (Chapter  24). 12 

PROBLEMS 


1.  Powder  specimens  of  three  different  monatomic  cubic  crystals  are  analyzed  with  a  Debye- 
Scherrer  camera.  It  is  known  that  one  sample  is  face-centered  cubic,  one  is  body-centered  cubic, 
and  one  has  the  diamond  structure.  The  approximate  positions  of  the  first  four  diffraction  rings 
in  each  case  are  (see  Figure  6.13): 

VALUES  OF  <p  FOR  SAMPLES 


A 

B 

C 

42.T 

28.8° 

42.8° 

49.2 

41.0 

73.2 

72.0 

50.8 

89.0 

87.3 

59.6 

115.0 

(a)  Identify  the  crystal  structures  of  A,  B,  and  C. 

(b)  If  the  wavelength  of  the  incident  X-ray  beam  is  1.5  A,  what  is  the  length  of  the  side  of  the 
conventional  cubic  cell  in  each  case? 

(c)  If  the  diamond  structure  were  replaced  by  a  zincblende  structure  with  a  cubic  unit  cell 
of  the  same  side,  at  what  angles  would  the  first  four  rings  now  occur? 


10  The  electronic  charge  density  py(r)  is  that  of  an  ion  of  type  j  placed  at  r  =  0;  thus  the  contribution 
of  the  ion  at  R  +  d,  to  the  electronic  charge  density  of  the  crystal  is  p7(  r  —  [R  +  d;  ] ).  (The  electronic 
charge  is  usually  factored  out  of  the  atomic  form  factor  to  make  it  dimensionless.) 

11  As  a  result  we  have  been  unable  to  make  precise  statements  about  the  absolute  intensity  of  the 
Bragg  peaks,  or  about  the  diffuse  background  of  radiation  in  directions  not  allowed  by  the  Bragg  condition. 

12  Considered  quantum  mechanically,  a  particle  of  momentum  p  can  be  viewed  as  a  wave  of  wave¬ 
length  X  =  h/p. 
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Figure  6.13 

Schematic  view  of  a  Debye-Scherrer  camera. 
Diffraction  peaks  are  recorded  on  the  film  strip. 


2.  It  is  often  convenient  to  represent  a  face-centered  cubic  Bravais  lattice  as  simple  cubic,  with 
a  cubic  primitive  cell  of  side  a  and  a  four-point  basis. 

(a)  Show  that  the  structure  factor  (6. 1 3)  is  then  either  4  or  0  at  all  points  of  the  simple  cubic 
reciprocal  lattice. 

(b)  Show  that  when  points  with  zero  structure  factor  are  removed,  the  remaining  points  of 
the  reciprocal  lattice  make  up  a  body-centered  cubic  lattice  with  conventional  cell  of  side  4 n/a. 
Why  is  this  to  be  expected? 

3.  (a)  Show  that  the  structure  factor  for  a  monatomic  hexagonal  close-packed  crystal  structure 
can  take  on  any  of  the  six  values  1  +  c"”'3,  n  =  1,  ....  6,  as  K  ranges  through  the  points  of  the 
simple  hexagonal  reciprocal  lattice. 

(b)  Show  that  all  reciprocal  lattice  points  have  nonvanishing  structure  factor  in  the  plane 
perpendicular  to  the  c-axis  containing  K  =  0. 

(c)  _  Show  that  points  of  zero  structure  factor  are  found  in  alternate  planes  in  the  family  of 
reciprocal  lattice  planes  perpendicular  to  the  c-axis. 

(d)  Show  that  in  such  a  plane  the  point  that  is  displaced  from  K  =  0  by  a  vector  parallel  to 
the  c-axis  has  zero  structure  factor. 

(e)  Show  that  the  removal  of  all  points  of  zero  structure  factor  from  such  a  plane  reduces  the 
triangular  network  of  reciprocal  lattice  points  to  a  honeycomb  array  (Figure  4.3). 

4.  Consider  a  lattice  with  an  n-ion  basis.  Suppose  that  the  ith  ion  in  the  basis,  when  translated 
to  r  =  0,  can  be  regarded  as  composed  of  point  particles  of  charge  -  z0e,  located  at  positions 

by,  J  =  1,  .  ,  rttj. 

(a)  Show  that  the  atomic  form  factor  /,  is  given  by 

S 

/,  =  X  V*  "-  (6.23) 

j=  i 

(b)  Show  that  the  total  structure  factor  (6.21)  implied  by  (6.23)  is  identical  to  the  structure 
factor  one  would  have  found  if  the  lattice  were  equivalently  described  as  having  a  basis  of 
m,  +  ■  •  -  +  m„  point  ions. 

5.  (a)  The  sodium  chloride  structure  (Figure  4.24)  can  be  regarded  as  an  fee  Bravais 

lattice  of  cube  side  a,  with  a  basis  consisting  of  a  positively  charged  ion  at  the  origin  and  a  negatively 
charged  ion  at  (a/2)x.  The  reciprocal  lattice  is  body-centered  cubic,  and  the  general  reciprocal 
lattice  vector  has  the  form  (6.19),  with  all  the  coefficients  v,  either  integers  or  integers  +  If  the 
atomic  form  factors  for  the  two  ions  are /+  and /_ ,  show  that  the  structure  factor  is  SK  =  /+  +  /_ , 
if  the  vf  are  integers,  and/+  if  the  v,  are  integers  +  k.  (Why  does  5  vanish  in  the  latter 

case  when  /+  =  /_  ?) 

(b)  The  zincblende  structure  (Figure  4. 1 8)  is  also  a  face-centered  cubic  Bravais  lattice  of  cube 
side  a,  with  a  basis  consisting  of  a  positively  charged  ion  at  the  origin  and  a  negatively  charged 
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ion  at  (a/4)(x  +  y  +  z).  Show  that  the  structure  factor  SK  is  /+  +  i/_  if  the  v,-  are  integers  +^, 
/+  +  /_  if  the  Vi  are  integers  and  is  even,  and /+  —  /_  if  the  Vi  are  integers  and  Xvj  is  odd. 

(c)  Suppose  that  a  cubic  crystal  is  known  to  be  composed  of  closed-shell  (and  hence  spheri¬ 
cally  symmetric)  ions,  so  that  f±  (K)  depends  only  on  the  magnitude  of  K.  The  positions  of  the  Bragg 
peaks  reveal  that  the  Bravais  lattice  is  face-centered  cubic.  Discuss  how  one  might  determine, 
from  the  structure  factors  associated  with  the  Bragg  peaks,  whether  the  crystal  structure  was 
likely  to  be  of  the  sodium  chloride  or  zincblende  type. 
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In  Chapters  4  and  5,  only  the  translational  symmetries  of  Bravais  lattices  were 
described  and  exploited.  For  example,  the  existence  and  basic  properties  of  the 
reciprocal  lattice  depend  only  on  the  existence  of  three  primitive  direct  lattice  vectors 
a,,  and  not  on  any  special  relations  that  may  hold  among  them.1  The  translational 
symmetries  are  by  far  the  most  important  for  the  general  theory  of  solids.  It  is  never¬ 
theless  clear  from  examples  already  described  that  Bravais  lattices  do  fall  naturally 
into  categories  on  the  basis  of  symmetries  other  than  translational.  Simple  hexagonal 
Bravais  lattices,  for  example,  regardless  of  the  c/a  ratio,  bear  a  closer  resemblance 
to  one  another  than  they  do  to  any  of  the  three  types  of  cubic  Bravais  lattice  we 
have  described. 

It  is  the  subject  of  crystallography  to  make  such  distinctions  systematic  and 
precise.2  Here  we  shall  only  indicate  the  basis  for  the  rather  elaborate  crystallographic 
classifications,  giving  some  of  the  major  categories  and  the  language  by  which  they 
are  described.  In  most  applications  what  matters  are  the  features  of  particular  cases, 
rather  than  a  systematic  general  theory,  so  few  solid  state  physicists  need  master  the 
full  analysis  of  crystallography.  Indeed,  the  reader  with  little  taste  for  the  subject  can 
skip  this  chapter  entirely  with  little  loss  in  understanding  what  follows,  referring 
back  to  it  on  occasion  for  the  elucidation  of  arcane  terms. 


THE  CLASSIFICATION  OF  BRAVAIS  LATTICES 

The  problem  of  classifying  all  possible  crystal  structures  is  too  complex  to  approach 
directly,  and  we  first  consider  only  the  classification  of  Bravais  lattices.3  From  the 
point  of  view  of  symmetry,  a  Bravais  lattice  is  characterized  by  the  specification  of 
all  rigid  operations4  that  take  the  lattice  into  itself.  This  set  of  operations  is  known 
as  the  symmetry  group  or  space  group  of  the  Bravais  lattice.5 

The  operations  in  the  symmetry  group  of  a  Bravais  lattice  include  all  translations 
through  lattice  vectors.  In  addition,  however,  there  will  in  general  be  rotations, 
reflections,  and  inversions6  that  take  the  lattice  into  itself.  A  cubic  Bravais  lattice, 
for  example,  is  taken  into  itself  by  a  rotation  through  90°  about  a  line  of  lattice  points 
in  a  <100)  direction,  a  rotation  through  120°  about  a  line  of  lattice  points  in  a  <111) 
direction,  reflection  of  all  points  in  a  {100}  lattice  plane,  etc.;  a  simple  hexagonal 
Bravais  lattice  is  taken  into  itself  by  a  rotation  through  60"  about  a  line  of  lattice 
points  parallel  to  the  c-axis,  reflection  in  a  lattice  plane  perpendicular  to  the  c-axis,  etc. 


1  An  example  of  such  a  relation  is  the  orthonormality  condition  a,  •  at  =  o2^,  holding  for  the 
appropriate  primitive  vectors  in  a  simple  cubic  Bravais  lattice. 

2  A  detailed  view  of  the  subject  can  be  found  in  M.  J.  Buerger,  Elementary  Crystallography,  Wiley, 
New  York,  1963. 

3  In  this  chapter  a  Bravais  lattice  is  viewed  as  the  crystal  structure  formed  by  placing  at  each  point 
of  an  abstract  Bravais  lattice  a  basis  of  maximum  possible  symmetry  (such  as  a  sphere,  centered  on  the 
lattice  point)  so  that  no  symmetries  of  the  point  Bravais  lattice  are  lost  because  of  the  insertion  of  the  basis. 

4  Operations  that  preserve  the  distance  between  all  lattice  points. 

5  We  shall  avoid  the  language  of  mathematical  group  theory,  since  we  shall  make  no  use  of  the 
analytical  conclusions  to  which  it  leads. 

0  Reflection  in  a  plane  replaces  an  object  by  its  mirror  image  in  that  plane;  inversion  in  a  point  P 
takes  the  point  with  coordinates  r  (with  respect  to  P  as  origin)  into  —  r.  All  Bravais  lattices  have  inversion 
symmetry  in  any  lattice  point  (Problem  1). 
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Any  symmetry  operation  of  a  Bravais  lattice  can  be  compounded  out  of  a  trans¬ 
lation  Tr  through  a  lattice  vector  R  and  a  rigid  operation  leaving  at  least  one  lattice 
point  fixed.7  This  is  not  immediately  obvious.  A  simple  cubic  Bravais  lattice,  for 
example,  is  left  fixed  by  a  rotation  through  90°  about  a  <100>  axis  that  passes  through 
the  center  of  a  cubic  primitive  cell  with  lattice  points  at  the  eight  vertices  of  the 
cube.  This  is  a  rigid  operation  that  leaves  no  lattice  point  fixed.  However,  it  can  be 
compounded  out  of  a  translation  through  a  Bravais  lattice  vector  and  a  rotation 
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(b) 

Figure  7.1 

(a)  A  simple  cubic  lattice  is  carried  into  itself  by  a  rotation  through  90 
about  an  axis  that  contains  no  lattice  points.  The  rotation  axis  is  perpen¬ 
dicular  to  the  page,  and  only  the  four  lattice  points  closest  to  the  axis  in 
a  single  lattice  plane  are  shown,  (b)  Illustrating  how  the  same  final  result 
can  be  compounded  out  of  (at  left)  a  translation  through  a  lattice  constant 
and  (at  right)  a  rotation  about  the  lattice  point  numbered  1. 


about  a  line  of  lattice  points,  as  illustrated  in  Figure  7.1.  That  such  a  representation 
is  always  possible  can  be  seen  as  follows: 

Consider  a  symmetry  operation  S  that  leaves  no  lattice  point  fixed.  Suppose  it 
takes  the  origin  of  the  lattice  O  into  the  point  R.  Consider  next  the  operation  one  gets 
by  first  applying  S ,  and  then  applying  a  translation  through  -R,  which  we  denote 
by  T  R.  The  composite  operation,  which  we  call  T_KS,  is  also  a  symmetry  of  the  lattice, 
but  it  leaves  the  origin  fixed,  since  S  transports  the  origin  to  R  while  7IR  carries  R 
back  to  the  origin.  Thus  T„S  is  an  operation  that  leaves  at  least  one  lattice  point 
(namely  the  origin)  fixed.  If,  however,  after  performing  the  operation  7  KS  we  then 
perform  the  operation  7„,  the  result  is  equivalent  to  the  operation  S  alone,  since  the 
final  application  of  7R  just  undoes  the  preceding  application  of  T  K.  Therefore  S  can 
be  compounded  out  of  7  R.S',  which  leaves  a  point  fixed,  and  7k.  which  is  a  pure 
translation. 


7  Note  that  translation  through  a  lattice  vector  (other  than  O)  leaves  no  point  fixed. 
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Thus  the  full  symmetry  group  of  a  Bravais  lattice8  contains  only  operations  of 
the  following  form: 

1.  Translations  through  Bravais  lattice  vectors; 

2.  Operations  that  leave  a  particular  point  of  the  lattice  fixed; 

3.  Operations  that  can  be  constructed  by  successive  applications  of  the  operations 
of  type  (1)  or  (2). 

The  Seven  Crystal  Systems 

When  examining  nontranslational  symmetries,  one  often  considers  not  the  entire 
space  group  of  a  Bravais  lattice,  but  only  those  operations  that  leave  a  particular 
point  fixed  (i.e.,  the  operations  in  category  (2)  above).  This  subset  of  the  full  symmetry 
group  of  the  Bravais  lattice  is  called  the  point  group  of  the  Bravais  lattice. 

There  turn  out  to  be  only  seven  distinct  point  groups  that  a  Bravais  lattice  can 
have.9  Any  crystal  structure  belongs  to  one  of  seven  crystal  systems,  depending  on 
which  of  these  seven  point  groups  is  the  point  group  of  its  underlying  Bravais  lattice. 
The  seven  crystal  systems  are  enumerated  in  the  next  section. 


Figure  7.2 

(a)  Every  symmetry  operation  of  a  cube  is  also  a  symmetry  operation 
of  a  regular  octahedron,  and  vice  versa.  Thus  the  cubic  group  is 
identical  to  the  octahedral  group,  (b)  Not  every  symmetry  operation 
of  a  cube  is  a  symmetry  operation  of  a  regular  tetrahedron.  For 
example,  rotation  through  90”  about  the  indicated  vertical  axis  takes 
the  cube  into  itself,  but  not  the  tetrahedron. 


8  We  shall  see  below  that  a  general  crystal  structure  can  have  additional  symmetry  operations  that 
are  not  of  types  (1),  (2),  or  (3).  They  are  known  as  "screw  axes"  and  "glide  planes.” 

9  Two  point  groups  are  identical  if  they  contain  precisely  the  same  operations.  For  example,  the  set 
of  all  symmetry  operations  of  a  cube  is  identical  to  the  set  of  all  symmetry  operations  of  a  regular  octahe¬ 
dron,  as  can  readily  be  seen  by  inscribing  the  octahedron  suitably  in  the  cube  (Figure  7.2a).  On  the  other 
hand,  the  symmetry  group  of  the  cube  is  not  equivalent  to  the  symmetry  group  of  the  regular  tetrahedron. 
The  cube  has  more  symmetry  operations  (Figure  7.2b). 
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The  Fourteen  Bravais  Lattices 

When  one  relaxes  the  restriction  to  point  operations  and  considers  the  full  symmetry 
group  of  the  Bravais  lattice,  there  turn  out  to  be  fourteen  distinct  space  groups  that  a 
Bravais  lattice  can  have.10  Thus,  from  the  point  of  view  of  symmetry,  there  are 
fourteen  different  kinds  of  Bravais  lattice.  This  enumeration  was  first  done  by  M.  L. 
Frankenheim  (1842).  Frankenheim  miscounted,  however,  reporting  fifteen  possibili¬ 
ties.  A.  Bravais  (1845)  was  the  first  to  count  the  categories  correctly. 

Enumeration  of  the  Seven  Crystal  Systems  and  Fourteen  Bravais  Lattices 

We  list  below  the  seven  crystal  systems  and  the  Bravais  lattices  belonging  to  each. 
The  number  of  Bravais  lattices  in  a  system  is  given  in  parentheses  after  the  name  of 
the  system: 

Cubic  (5)  The  cubic  system  contains  those  Bravais  lattices  whose  point  group  is  just 
the  symmetry  group  of  a  cube  (Figure  7.3a).  Three  Bravais  lattices  with  nonequivalent 
space  groups  all  have  the  cubic  point  group.  They  are  the  simple  cubic,  body-centered 
cubic,  and  face-centered  cubic.  All  three  have  been  described  in  Chapter  4. 

Tetragonal  (2)  One  can  reduce  the  symmetry  of  a  cube  by  pulling  on  two  opposite 
faces  to  stretch  it  into  a  rectangular  prism  with  a  square  base,  but  a  height  not  equal 
to  the  sides  of  the  square  (Figure  7.3b).  The  symmetry  group  of  such  an  object  is  the 
tetragonal  group.  By  so  stretching  the  simple  cubic  Bravais  lattice  one  constructs  the 
simple  tetragonal  Bravais  lattice,  which  can  be  characterized  as  a  Bravais  lattice 
generated  by  three  mutually  perpendicular  primitive  vectors,  only  two  of  which 
are  of  equal  length.  The  third  axis  is  called  the  c-axis.  By  similarly  stretching  the 
body-centered  and  face-centered  cubic  lattices  only  one  more  Bravais  lattice  of  the 
tetragonal  system  is  constructed,  the  centered  tetragonal. 

To  see  why  there  is  no  distinction  between  body-centered  and  face-centered  te¬ 
tragonal,  consider  Figure  7.4a,  which  is  a  representation  of  a  centered  tetragonal 
Bravais  lattice  viewed  along  the  c-axis.  The  points  2  lie  in  a  lattice  plane  a  distance 


10  The  equivalence  of  two  Bravais  lattice  space  groups  is  a  somewhat  more  subtle  notion  than  the 
equivalence  of  two  point  groups  (although  both  reduce  to  the  concept  of  “isomorphism”  in  abstract  group 
theory).  It  is  no  longer  enough  to  say  that  two  space  groups  are  equivalent  if  they  have  the  same  operations, 
for  the  operations  of  identical  space  groups  can  differ  in  inconsequential  ways.  For  example,  two  simple 
cubic  Bravais  lattices  with  different  lattice  constants,  a  and  a\  are  considered  to  have  the  same  space 
groups  even  though  the  translations  in  one  are  in  steps  of  a,  whereas  the  translations  in  the  other  are  in 
steps  of  a Similarly,  we  would  like  to  regard  all  simple  hexagonal  Bravais  lattices  as  having  identical  space 
groups,  regardless  of  the  value  of  c/a,  which  is  clearly  irrelevant  to  the  total  symmetry  of  the  structure. 

We  can  get  around  this  problem  by  noting  that  in  such  cases  one  can  continuously  deform  a  structure 
of  a  given  type  into  another  of  the  same  type  without  ever  losing  any  of  the  symmetry  operations  along 
the  way.  Thus  one  can  uniformly  expand  the  cube  axes  from  a  to  a',  always  maintaining  the  simple  cubic 
symmetry,  or  one  can  stretch  (or  shrink)  the  c-axis  (or  o-axis),  always  maintaining  the  simple  hexagonal 
symmetry.  Therefore  two  Bravais  lattices  can  be  said  to  have  the  same  space  group  if  it  is  possible  con¬ 
tinuously  to  transform  one  into  the  other  in  such  a  way  that  every  symmetry  operation  of  the  first  is  con¬ 
tinuously  transformed  into  a  symmetry  operation  of  the  second,  and  there  are  no  additional  symmetry 
operations  of  the  second  not  so  obtained  from  symmetry  operations  of  the  first. 
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Figure  7.3 

Objects  whose  symmetries 
are  the  point-group  symme¬ 
tries  of  Bravais  lattices  be¬ 
longing  to  the  seven  crystal 
systems:  (a)  cubic:  (b)  te¬ 
tragonal;  (c)  orthorhombic; 
(d)  monoclinic;  (e)  triclinic; 
(f)  trigonal;  (g)  hexagonal. 


c/2  from  the  lattice  plane  containing  the  points  1.  If  c  =  a,  the  structure  is  nothing 
but  a  body-centered  cubic  Bravais  lattice,  and  for  general  c  it  can  evidently  be  viewed 
as  the  result  of  stretching  the  bcc  lattice  along  the  c-axis.  However,  precisely  the  same 
lattice  can  also  be  viewed  along  the  c-axis,  as  in  Figure  7.4b,  with  the  lattice  planes 
regarded  as  centered  square  arrays  of  side  o'  =  y/2 a.  If  c  =  a/2  =  o/ x/2,  the  struc¬ 
ture  is  nothing  but  a  face-centered  cubic  Bravais  lattice,  and  for  general  c  it  can 
therefore  be  viewed  as  the  result  of  stretching  the  fee  lattice  along  the  c-axis. 

Putting  it  the  other  way  around,  face-centered  cubic  and  body-centered  cubic  are 
both  special  cases  of  centered  tetragonal,  in  which  the  particular  value  of  the  c/o 
ratio  introduces  extra  symmetries  that  are  revealed  most  clearly  when  one  views  the 
lattice  as  in  Figure  7.4a  (bcc)  or  Figure  7.4b  (fee). 


Orthorhombic  ( 4 )  Continuing  to  still  less  symmetric  deformations  of  the  cube,  one 
can  reduce  tetragonal  symmetry  by  deforming  the  square  faces  of  the  object  in  Figure 
7.3b  into  rectangles,  producing  an  object  with  mutually  perpendicular  sides  of  three 
unequal  lengths  (Figure  7.3c).  The  orthorhombic  group  is  the  symmetry  group  of 
such  an  object.  By  stretching  a  simple  tetragonal  lattice  along  one  of  the  o-axes 
(Figure  7.5a  and  b),  one  produces  the  simple  orthorhombic  Bravais  lattice.  However, 
by  stretching  the  simple  tetragonal  lattice  along  a  square  diagonal  (Figure  7.5c  and 
d)  one  produces  a  second  Bravais  lattice  of  orthorhombic  point  group  svmmetry,  the 
base-centered  orthorhombic. 
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Figure  7.4 

Two  ways  of  viewing  the  same  centered  tetragonal  Bravais  lattice.  The  view  is  along  the 
c-axis.  The  points  labeled  1  lie  in  a  lattice  plane  perpendicular  to  the  c-axis,  and  the  points 
labeled  2  lie  in  a  parallel  lattice  plane  a  distance  c/2  away.  In  (a)  the  points  1  are  viewed 
as  a  simple  square  array,  stressing  that  centered  tetragonal  is  a  distortion  of  body-centered 
cubic.  In  (b)  the  points  1  are  viewed  as  a  centered  square  array,  stressing  that  centered 
tetragonal  is  also  a  distortion  of  face-centered  cubic 


Figure  7.5 

Two  ways  of  deforming  the  same  simple  tetragonal  Bravais  lattice.  The  view  is  along 
the  c-axis,  and  a  single  lattice  plane  is  shown.  In  (a)  bonds  are  drawn  to  emphasize  that 
the  points  in  the  plane  can  be  viewed  as  a  simple  square  array.  Stretching  along  a  side 
of  that  array  leads  to  the  rectangular  nets  (b),  stacked  directly  above  one  another.  The 
resulting  Bravais  lattice  is  simple  orthorhombic.  In  (c)  lines  are  drawn  to  emphasize 
that  the  same  array  of  points  as  shown  in  (a)  can  also  be  viewed  as  a  centered  square 
array.  Stretching  along  a  side  of  that  array  (i.e..  along  a  diagonal  of  the  square  array 
emphasized  in  (a))  yields  the  centered  rectangular  nets  (d),  stacked  directly  above  one 
another.  The  resul  Bravais  lattice  is  base-centered  orthorhombic. 
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In  the  same  way,  one  can  reduce  the  point  symmetry  of  the  centered  tetragonal 
lattice  to  orthorhombic  in  two  ways,  stretching  either  along  one  set  of  parallel  lines 
drawn  in  Figure  7.4a  to  produce  body-centered  orthorhombic,  or  along  one  set  of 
parallel  lines  in  Figure  7.4b,  producing  face-centered  orthorhombic. 

These  four  Bravais  lattices  exhaust  the  orthorhombic  system. 

Monoclinic  (2)  One  can  reduce  orthorhombic  symmetry  by  distorting  the  rectan¬ 
gular  faces  perpendicular  to  the  c-axis  in  Figure  7.3c  into  general  parallelograms. 
The  symmetry  group  of  the  resulting  object  (Figure  7.3d)  is  the  monoclinic  group. 
By  so  distorting  the  simple  orthorhombic  Bravais  lattice  one  produces  the  simple 
monoclinic  Bravais  lattice,  which  has  no  symmetries  other  than  those  required  by  the 
fact  that  it  can  be  generated  by  three  primitive  vectors,  one  of  which  is  perpendicular 
to  the  plane  of  the  other  two.  Similarly,  distorting  the  base-centered  orthorhombic 
Bravais  lattice  produces  a  lattice  with  the  same  simple  monoclinic  space  group.  How- 
ever,  so  distorting  either  the  face-centered  or  body-centered  orthorhombic  Bravais 
lattices  produces  the  centered  monoclinic  Bravais  lattice  (Figure  7.6). 

Figure  7.6 

View  along  the  c-axis  of  a  centered  monoclinic 
Bravais  lattice.  The  points  labeled  1  lie  in  a 
lattice  plane  perpendicular  to  the  c-axis.  The 
points  labeled  2  lie  in  a  parallel  lattice  plane  a 
(5)  distance  c/2  away,  and  are  directly  above  the 
centers  of  the  parallelograms  formed  by  the 
points  1. 

Note  that  the  two  monoclinic  Bravais  lattices  correspond  to  the  two  tetragonal 
ones.  The  doubling  in  the  orthorhombic  case  reflects  the  fact  that  a  rectangular  net 
and  a  centered  rectangular  net  have  distinct  two-dimensional  symmetry  groups,  while 
a  square  net  and  centered  square  net  are  not  distinct,  nor  are  a  parallelogram  net 
and  centered  parallelogram  net. 

Triclinic  (/)  The  destruction  of  the  cube  is  completed  by  tilting  the  c-axis  in  Figure 
7.3d  so  that  it  is  no  longer  perpendicular  to  the  other  two,  resulting  in  the  object 
pictured  in  Figure  7.3e,  upon  which  there  are  no  restrictions  except  that  pairs  of  oppo¬ 
site  faces  are  parallel.  By  so  distorting  either  monoclinic  Bravais  lattice  one  constructs 
the  triclinic  Bravais  lattice.  This  is  the  Bravais  lattice  generated  by  three  primitive 
vectors  with  no  special  relationships  to  one  another,  and  is  therefore  the  Bravais 
lattice  of  minimum  symmetry.  The  triclinic  point  group  is  not,  however,  the  group 
of  an  object  without  any  symmetry,  since  any  Bravais  lattice  is  invariant  under  an 
inversion  in  a  lattice  point.  That,  however,  is  the  only  symmetry  required  by  the 
general  definition  of  a  Bravais  lattice,  and  therefore  the  only  operation11  in  the 
triclinic  point  group. 

By  so  torturing  a  cube  we  have  arrived  at  twelve  of  the  fourteen  Bravais  lattices 
and  five  of  the  seven  crystal  systems.  We  can  find  the  thirteenth  and  sixth  by  returning 
to  the  original  cube  and  distorting  it  differently: 


11  Other  than  the  identity  operation  (which  leaves  the  lattice  where  it  is),  which  is  always  counted 
among  the  members  of  a  symmetry  group. 


I 
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Trigonal  {1)  The  trigonal  point  group  describes  the  symmetry  of  the  object  one 
produces  by  stretching  a  cube  along  a  body  diagonal  (Figure  7.3f).  The  lattice  made 
by  so  distorting  any  of  the  three  cubic  Bravais  lattices  is  the  rhombohedral  (or  trigonal ) 
Bravais  lattice.  It  is  generated  by  three  primitive  vectors  of  equal  length  that  make 
equal  angles  with  one  another.12 

Finally,  unrelated  to  the  cube,  is: 

Hexagonal  (1)  The  hexagonal  point  group  is  the  symmetry  group  of  a  right  prism 
with  a  regular  hexagon  as  base  (Figure  7.3g).  The  simple  hexagonal  Bravais  lattice 
(described  in  Chapter  4)  has  the  hexagonal  point  group  and  is  the  only  Bravais  lattice 
in  the  hexagonal  system.13 

The  seven  crystal  systems  and  fourteen  Bravais  lattices  described  above  exhaust 
the  possibilities.  This  is  far  from  obvious  (or  the  lattices  would  have  been  known  as 
Frankenheim  lattices).  However,  it  is  of  no  practical  importance  to  understand  why 
these  are  the  only  distinct  cases.  It  is  enough  to  know  why  the  categories  exist,  and 
what  they  are. 

THE  CRYSTALLOGRAPHIC  POINT  GROUPS  AND  SPACE  GROUPS 

We  next  describe  the  results  of  a  similar  analysis,  applied  not  to  Bravais  lattices  but 
to  general  crystal  structures.  We  consider  the  structure  obtained  by  translating  an 
arbitrary  object  through  the  vectors  of  any  Bravais  lattice,  and  try  to  classify  the 
symmetry  groups  of  the  arrays  so  obtained.  These  depend  both  on  the  symmetry  of 
the  object  and  the  symmetry  of  the  Bravais  lattice.  Because  the  objects  are  no  longer 
required  to  have  maximum  (e.g.,  spherical)  symmetry,  the  number  of  symmetry  groups 
is  greatly  increased:  there  turn  out  to  be  230  different  symmetry  groups  that  a  lattice 
with  a  basis  can  have,  known  as  the  230  space  groups.  (This  is  to  be  compared  with 
the  fourteen  space  groups  that  result  when  the  basis  is  required  to  be  completely 
symmetric.) 

The  possible  point  groups  of  a  general  crystal  structure  have  also  been  enumerated. 
These  describe  the  symmetry  operations  that  take  the  crystal  structure  into  itself 
while  leaving  one  point  fixed  (i.e.,  the  nontranslational  symmetries).  There  are  thirty- 
two  distinct  point  groups  that  a  crystal  structure  can  have,  known  as  the  thirty-two 
crystallographic  point  groups.  (This  is  to  be  compared  with  the  seven  point  groups 
one  can  have  when  the  basis  is  required  to  have  full  symmetry.) 

These  various  numbers  and  their  relations  to  one  another  are  summarized  in 
Table  7.1. 

The  thirty-two  crystallographic  point  groups  can  be  constructed  out  of  the  seven 
Bravais  lattice  point  groups  by  systematically  considering  all  possible  ways  of  re¬ 
ducing  the  symmetry  of  the  objects  (Figure  7.3)  characterized  by  these  groups. 

Each  of  the  twenty-five  new  groups  constructed  in  this  way  is  associated  with  one 


12  Special  values  of  that  angle  may  introduce  extra  symmetries,  in  which  case  the  lattice  may  actually 
be  one  of  the  three  cubic  types.  See,  for  example,  Problem  2(a). 

13  If  one  tries  to  produce  further  Bravais  lattices  from  distortions  of  the  simple  hexagonal,  one  finds 
that  changing  the  angle  between  the  two  primitive  vectors  of  equal  length  perpendicular  to  the  c-axis  yields  a 
base-centered  orthorhombic  lattice,  changing  their  magnitudes  as  well  leads  to  monoclinic,  and  tilting 
the  c-axis  from  the  perpendicular  leads,  in  general,  to  triclinic. 
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Table  7.1 

POINT  AND  SPACE  GROUPS  OF  BRAVAIS  LATTICES  AND  CRYSTAL  STRUCTURES 


BRAVAIS  LATTICE 

(basis  of  spherical  symmetry) 

CRYSTAL  STRUCTURE 

(basis  of  arbitrary  symmetry) 

Number  of 

7 

32 

point  groups: 

(“the  7  crystal  systems") 

(“the  32  crystallographic  point  groups”) 

Number  of 

14 

230 

space  groups: 

(“the  14  Bravais  lattices") 

(“the  230  space  groups”) 

of  the  seven  crystal  systems  according  to  the  following  rule:  Any  group  constructed 
by  reducing  the  symmetry  of  an  object  characterized  by  a  particular  crystal  system 
continues  to  belong  to  that  system  until  the  symmetry  has  been  reduced  so  far  that 
all  of  the  remaining  symmetry  operations  of  the  object  are  also  found  in  a  less  sym¬ 
metrical  crystal  system;  when  this  happens  the  symmetry  group  of  the  object  is 
assigned  to  the  less  symmetrical  system.  Thus  the  crystal  system  of  a  crystallographic 
point  group  is  that  of  the  least  symmetric14  of  the  seven  Bravais  lattice  point  groups 
containing  every  symmetry  operation  of  the  crystallographic  group. 

- Cubic  Figure  7.7 

|  The  hierarchy  of  symmetries  among  the  seven  crystal  systems. 

Hexagonal  Tetragonal  Each  Bravais  lattice  point  group  contains  all  those  that  can  be 

r  y  I  reached  from  it  by  moving  in  the  direction  of  the  arrows. 

Trigonal  1 - .-Orthorhombic 

|  1 

1 - *-  Monoclinic 

I 

Triclinic 

Objects  with  the  symmetries  of  the  five  crystallographic  point  groups  in  the  cubic 
system  are  pictured  in  Table  7.2.  Objects  with  the  symmetries  of  the  twenty-seven 
noncubic  crystallographic  groups  are  shown  in  Table  7.3. 

Crystallographic  point  groups  may  contain  the  following  kinds  of  symmetry 
operations : 

1.  Rotations  through  Integral  Multiples  of  2ti  n  about  Some  Axis  The  axis  is  called 
an  n-fold  rotation  axis.  It  is  easily  shown  (Problem  6)  that  a  Bravais  lattice  can 


14  The  notion  of  a  hierarchy  of  crystal  system  symmetries  needs  some  elaboration.  In  Figure  7.7  each 
crystal  system  is  more  symmetric  than  any  that  can  be  reached  from  it  by  moving  along  arrows;  i.e.,  the 
corresponding  Bravais  lattice  point  group  has  no  operations  that  are  not  also  in  the  groups  from  which  it 
can  be  so  reached.  There  appears  to  be  some  ambiguity  in  this  scheme  since  the  four  pairs  cubic-hexagonal, 
tetragonal-hexagonal,  tetragonal-trigonal,  and  orthorhombic-trigonal  are  not  ordered  by  the  arrows. 
Thus  one  might  imagine  an  object  all  of  whose  symmetry  operations  belonged  to  both  the  tetragonal  and 
trigonal  groups  but  to  no  group  lower  than  both  of  these.  The  symmetry  group  of  such  an  object  could 
be  said  to  belong  to  either  the  tetragonal  or  trigonal  systems,  since  there  would  be  no  unique  system  of 
lowest  symmetry.  It  turns  out,  however,  both  in  this  and  the  other  three  ambiguous  cases,  that  all  symmetry 
elements  common  to  both  groups  in  a  pair  also  belong  to  a  group  that  is  lower  than  both  in  the  hierarchy. 
(For  example,  any  element  common  to  both  the  tetragonal  and  the  trigonal  groups  also  belongs  to  the 
monoclinic  group.)  There  is  therefore  always  a  unique  group  of  lowest  symmetry. 
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Table  7.2 


OBJECTS  WITH  THE  SYMMETRY  OF  THE  FIVE  CUBIC  CRYSTALLOGRAPHIC 
POINT  GROUPS'1 


(m3) 


“To  the ,Ieft  of  each  obJect  is  the  Schoenflies  name  or  its  symmetry  group  and  to  the  right  is  the 
international  name.  The  unpictured  faces  may  be  deduced  from  the  fact  thal  rotation  about  a 
body  diagonal  through  120  is  a  symmetry  operation  for  all  five  objects.  (Such  an  axis  is  shown 
on  the  undecorated  cube.) 

contain  only  2-,  3-,  4-,  or  6-fold  axes.  Since  the  crystallographic  point  groups  are 
contained  in  the  Bravais  lattice  point  groups,  they  too  can  only  have  these  axes. 

2.  Rotation-Reflections  Even  when  a  rotation  through  2n,n  is  not  a  symmetry 
element,  sometimes  such  a  rotation  followed  by  a  reflection  in  a  plane  perpen¬ 
dicular  to  the  axis  may  be.  The  axis  is  then  called  an  n-fold  rotation-reflection 
axis.  For  example,  the  groups  S6  and  S4  (Table  7.3)  have  6-  and  4-fold  rotation- 
reflection  axes. 

3.  Rotation-Inversions  Similarly,  sometimes  a  rotation  through  In,  n  followed  by  an 
inversion  in  a  point  lying  on  the  rotation  axis  is  a  symmetry  element,  even  though 
such  a  rotation  by  itself  is  not.  The  axis  is  then  called  an  n-fold  rotation-inversion 
axis.  The  axis  in  S4  (Table  7.3).  for  example,  is  also  a  4-fold  rotation-inversion 
axis.  However,  the  axis  in  S6  is  only  a  3-fold  rotation-inversion  axis. 

4.  Reflections  A  reflection  takes  every  point  into  its  mirror  image  in  a  plane,  known 
as  a  mirror  plane. 
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Table  7.3 

THE  NONCLBIC  CRYSTALLOGRAPHIC  POINT  GROUPS" 
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Table  7.3  ( continued) 

The  unpictured  faces  can  be  deduced  by  imagining  the  representative  objects  to  be  rotated  about 
the  n-fold  axis,  which  is  always  vertical.  The  Schoenfiies  name  of  the  group  is  given  to  the  left 
of  the  representative  object,  and  the  international  designation  the  right.  The  groups  are  organized 
into  vertical  columns  by  crystal  system,  and  into  horizontal  rows  by  the  Schoenfiies  or  inter¬ 
national  type.  Note  that  the  Schoenfiies  categories  (given  on  the  extreme  left  of  the  table)  divide 
up  the  groups  somewhat  differently  from  the  international  categories  (given  on  the  extreme  right). 
In  most  (but  not  all)  cases  the  representative  objects  have  been  made  by  simply  decorating  in  the 
appropriate  symmetry  reducing  manner  the  faces  of  the  objects  used  to  represent  the  crystal 
systems  (Bravais  lattice  point  groups)  in  Figure  7.3.  Exceptions  are  the  trigonal  groups  and  two 
of  the  hexagonal  groups,  where  the  figures  have  been  changed  to  emphasize  the  similarity  within 
the  (horizontal)  Schoenfiies  categories.  For  a  representation  of  the  trigonal  groups  by  decorations 
of  the  object  in  Figure  7.3f,  see  Problem  4. 


5.  Inversions  An  inversion  has  a  single  fixed  point.  If  that  point  is  taken  as  the 
origin,  then  every  other  point  r  is  taken  into  —  r. 

Point-Group  Nomenclature 

Two  nomenclatural  systems,  the  Schonfiies  and  the  international,  are  in  wide  use. 
Both  designations  are  given  in  Tables  7.2  and  7.3. 

Schoenfiies  Notation for  the  Noncubic  Crystallographic  Point  Groups  The  Schoenfiies 
categories  are  illustrated  by  grouping  the  rows  in  Table  7.3  according  to  the  labels 
given  on  the  left  side.  They  are:1 5 

C„:  These  groups  contain  only  an  n-fold  rotation  axis. 

C„v :  In  addition  to  the  n-fold  axis,  these  groups  have  a  mirror  plane  that  contains 
the  axis  of  rotation,  plus  as  many  additional  mirror  planes  as  the  existence  of 
the  n-fold  axis  requires. 

Cnh:  These  groups  contain  in  addition  to  the  n-fold  axis,  a  single  mirror  plane  that 
is  perpendicular  to  the  axis. 

S„:  These  groups  contain  only  an  n-fold  rotation-reflection  axis. 

D„:  In  addition  to  an  n-fold  rotation  axis,  these  groups  contain  a  2-fold  axis 
perpendicular  to  the  n-fold  axis,  plus  as  many  additional  2-fold  axes  as  are 
required  by  the  existence  of  the  n-fold  axis. 

Dnh  :  These  (the  most  symmetric  of  the  groups)  contain  all  the  elements  of  D„  plus 
a  mirror  plane  perpendicular  to  the  n-fold  axis. 

Z)„d:  These  contain  the  elements  of  D„  plus  mirror  planes  containing  the  n-fold 
axis,  which  bisect  the  angles  between  the  2-fold  axes. 

It  is  instructive  to  verify  that  the  objects  shown  in  Table  7.3  do  indeed  have  the 
symmetries  required  by  their  Schoenfiies  names. 

International  Notation  for  the  Noncubic  Crystallographic  Point  Groups  The  interna¬ 
tional  categories  are  illustrated  by  grouping  the  rows  in  Table  7.3  according  to 


15  C  stands  for  “cyclic,”  D  for  “dihedral,”  and  S  for  "Spiegel”  (mirror).  The  subscripts  ft,  r,  and  d  stand 
for  "horizontal,”  "vertical,”  and  “diagonal,”  and  refer  to  the  placement  of  the  mirror  planes  with  respect 
to  the  n-fold  axis,  considered  to  be  vertical.  (The  “diagonal”  planes  in  Dni  are  vertical  and  bisect  the  angles 
between  the  2-fold  axes.) 
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the  labels  given  on  the  right  side.  Three  categories  are  identical  to  the  Schoenflies 
categories: 

n  is  the  same  as  C„.  .  , 

nmm  is  the  same  as  C*.  The  two  ms  refer  to  two  distinct  types  of  mirror  planes 
containing  the  n-fold  axis.  What  they  are  is  evident  from  the  objects  illustrating  6 mm, 
4mm  and  2 mm.  These  demonstrate  that  a  2/-foId  axis  takes  a  vertical  mirror  plane 
into’/  mirror  planes,  but  in  addition  j  others  automatically  appear,  bisecting  the 
angles  between  adjacent  planes  in  the  first  set.  However  a  ft  +  D-fold  axis  takes  a 
mirror  plane  into  2 j  +  1  equivalent  ones,  and  therefore  C3l,  is  only  called  3m 
«22  is  the  same  as  D„.  The  discussion  is  the  same  as  for  nmm,  but  now  perpendicular 
2-fold  axes  are  involved  instead  of  vertical  mirror  planes. 

The  other  international  categories  and  their  relation  to  those  of  Schoenflies  are 
as  follows: 

n/m  is  the  same  as  C„„,  except  that  the  international  system  prefers  to  regard  C» 
as  containing  a  6-fold  rotation-inversion  axis,  making  it  6  (see  the  next  category). 
Note  also  that  Clh  becomes  simply  m,  rather  than  1  /m. 

n  is  a  group  with  an  n-fold  rotation-inversion  axis.  This  category  contains  C3h, 
disguised  as  6.  It  also  contains  S4,  which  goes  nicely  into  4.  However,  S6  becomes 
3  and  S2  becomes  I  by  virtue  of  the  difference  between  rotation-reflection  and 
rotation-inversion  axes. 

n  1 1  abbreviated  n/mmm,  is  just  D„h,  except  that  the  international  jsystem  prefers  to 
regard  D3h  as  containing  a  6-fold  rotation-inversion  axis,  making  it  62m  (see  the  next 
category,  and  note  the  similarity  to  the  ejection  of  C3*  from  n/m  into  n)  Note  also 
that  2/mmm  is  conventionally  abbreviated  further  into  mmm.  The  full-blown  inter¬ 
national  title  is  supposed  to  remind  one  that  D„*  can  be  viewed  as  an  M-fald » axis  with 
a  perpendicular  mirror  plane,  festooned  with  two  sets  of  perpendicular  2-fold  axes, 

each  with  its  own  perpendicular  mirror  planes.  _  .  .  , 

n2m  is  the  same  as  Dnd,  except  that  is  included  as  62m.  The  name  is  intended 
to  suggest  an  n-fold  rotation-inversion  axis  with  a  perpendicular  2-fold  axis  and  a 
vertical  mirror  plane.  The  n  =  3  case  is  again  exceptional,  the  full  name  being  3m 
(abbreviated  3m)  to  emphasize  the  fact  that  in  this  case  the  vertical  mirror  plane  is 
perpendicular  to  the  2-fold  axis. 

Nomenclature  for  the  Cubic  Crystallographic  Point  Groups  The  Schoenflies  and  in¬ 
ternational  names  for  the  five  cubic  groups  are  given  in  Table  7.2  0,  is  the  lull 
symmetry  group  of  the  cube  (or  octahedron,  whence  the  O)  including  improper 
operations,17  which  the  horizontal  reflection  plane  ( h )  admits.  O  is  the  cubic  (or 
octahedral)  group  without  improper  operations.  Td  is  the  full  symmetry  group  of  the 
regular  tetrahedron  including  all  improper  operations,  Tis  the  group  of  the  regular 
tetrahedron  excluding  all  improper  operations,  and  Th  is  what  results  when  an 
inversion  is  added  to  T. 


..  ln  emphasizing  the  differences  between  odd-  and  even-fold  axes,  the  international  system,  unlike 

the  Schoenflies,  treats  the  3-fold  axis  as  a  special  case.  A1! 

.V  Any  operation  that  takes  a  right-handed  object  into  a  left-handed  one  ts  called  improper.  All  others 
are  proper.  Operations  containing  an  odd  number  of  inversions  or  mirrorings  are  tmproper. 
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The  international  names  for  the  cubic  groups  are  conveniently  distinguished  from 
those  of  the  other  crystallographic  point  groups  by  containing  3  as  a  second  number, 
referring  to  the  3-fold  axis  present  in  all  the  cubic  groups. 

The  230  Space  Groups 

We  shall  have  mercifully  little  to  say  about  the  230  space  groups,  except  to  point 
out  that  the  number  is  larger  than  one  might  have  guessed.  For  each  crystal  system 
one  can  construct  a  crystal  structure  with  a  different  space  group  by  placing  an  object 
with  the  symmetries  of  each  of  the  point  groups  of  the  system  into  each  of  the  Bravais 
lattices  of  the  system.  In  this  way,  however,  we  find  only  61  space  groups,  as  shown 
in  Table  7.4. 


Table  7.4 

ENUMERATION  OF  SOME  SIMPLE  SPACE  GROUPS 


SYSTEM  NUMBER  OF  POINT  GROUPS 

NUMBER  OF  BRAVAIS  LATTICES 

PRODUCT 

Cubic 

5 

3 

15 

Tetragonal 

7 

2 

14 

Orthorhombic 

3 

4 

12 

Monoclinic 

3 

2 

6 

Triclinic 

2 

1 

2 

Hexagonal 

7 

1 

7 

Trigonal 

5 

1 

5 

Totals 

32 

14 

61 

We  can  eke  out  five  more  by  noting  that  an  object  with  trigonal  symmetry  yields 
a  space  group  not  yet  enumerated,  when  placed  in  a  hexagonal  Bravais  lattice.18 


18  Although  the  trigonal  point  group  is  contained  in  the  hexagonal  point  group,  the  trigonal  Bravais 
lattice  cannot  be  obtained  from  the  simple  hexagonal  by  an  infinitesimal  distortion.  (This  is  in  contrast 
to  all  other  pairs  of  systems  connected  by  arrows  in  the  symmetry  hierarchy  of  Figure  7.7.)  The  trigonal 
point  group  is  contained  in  the  hexagonal  point  group  because  the  trigonal  Bravais  lattice  can  be  viewed 
as  simple  hexagonal  with  a  three-point  basis  consisting  of 

0;  3a,,ja2,$c;  and  |a,Ja2,§c. 

As  a  result,  placing  a  basis  with  a  trigonal  point  group  into  a  hexagonal  Bravais  lattice  results  in  a  different 
space  group  from  that  obtained  by  placing  the  same  basis  into  a  trigonal  lattice.  In  no  other  case  is  this 
so.  For  example,  a  basis  w'ith  tetragonal  symmetry,  when  placed  in  a  simple  cubic  lattice,  yields  exactly  the 
same  space  group  as  it  would  if  placed  in  a  simple  tetragonal  lattice  (unless  there  happens  to  be  a  special 
relation  between  the  dimensions  of  the  object  and  the  length  of  the  r-axis).  This  is  reflected  physically  in  the 
fact  that  there  are  crystals  that  have  trigonal  bases  in  hexagonal  Bravais  lattices,  but  none  with  tetragonal 
bases  in  cubic  Bravais  lattices.  In  the  latter  case  there  would  be  nothing  in  the  structure  of  such  an  object 
to  require  the  c-axis  to  have  the  same  length  as  the  o-axes;  if  the  lattice  did  remain  cubic  it  would  be  a 
mere  coincidence.  In  contrast,  a  simple  hexagonal  Bravais  lattice  cannot  distort  continuously  into  a  trigonal 
one,  and  can  therefore  be  held  in  its  simple  hexagonal  form  even  by  a  basis  with  only  trigonal  symmetry. 

Because  trigonal  point  groups  can  characterize  a  crystal  structure  with  a  hexagonal  Bravais  lattice, 
crystallographers  sometimes  maintain  that  there  are  only  six  crystal  systems.  This  is  because  crystal¬ 
lography  emphasizes  the  point  symmetry  rather  than  the  translational  symmetry.  From  the  point  of  view 
of  the  Bravais  lattice  point  groups,  however,  there  are  unquestionably  seven  crystal  systems:  the  point 
groups  Did  and  Dth  are  both  the  point  groups  of  Bravais  lattices,  and  are  not  equivalent. 
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Another  seven  arise  from  cases  in  which  an  object  with  the  symmetry  of  a  given  point 
group  can  be  oriented  in  more  than  one  way  in  a  given  Bravais  lattice  so  that  more 
than  one  space  group  arises.  These  73  space  groups  are  called  symmorphic. 

The  majority  of  the  space  groups  are  nonsymmorphic ,  containing  additional  oper¬ 
ations  that  cannot  be  simply  compounded  out  of  Bravais  lattice  translations  and 
point-group  operations.  For  there  to  be  such  additional  operations  it  is  essential  that 
there  be  some  special  relation  between  the  dimensions  of  the  basis  and  the  dimensions 
of  the  Bravais  lattice.  When  the  basis  does  have  a  size  suitably  matched  to  the  primitive 
vectors  of  the  lattice,  two  new  types  of  operations  may  arise: 

1.  Screw  Axes  A  crystal  structure  with  a  screw  axis  is  brought  into  coincidence 
with  itself  by  translation  through  a  vector  not  in  the  Bravais  lattice,  followed  by  a 
rotation  about  the  axis  defined  by  the  translation. 

2.  Glide  Planes  A  crystal  structure  with  a  glide  plane  is  brought  into  coincidence 
with  itself  by  translation  through  a  vector  not  in  the  Bravais  lattice,  followed  by 
a  reflection  in  a  plane  containing  that  vector. 

The  hexagonal  close-packed  structure  offers  examples  of  both  types  of  operation, 
as  shown  in  Figure  7.8.  They  occur  only  because  the  separation  of  the  two  basis 
points  along  the  c-axis  is  precisely  half  the  distance  between  lattice  planes. 
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Figure  7.8 

The  hexagonal  close-packed  structure 
viewed  along  the  c-axis.  Lattice  planes 
perpendicular  to  the  c-axis  are  separated 
by  c/2  and  contain,  alternately,  points  of 
type  1  and  points  of  type  2.  The  line  parallel 
to  the  c-axis  passing  through  the  dot  in  the 
center  of  the  figure  is  a  screw  axis:  the 
structure  is  invariant  under  a  translation 
through  c/2  along  the  axis  followed  by  a 
rotation  of  60°  (but  it  is  not  invariant  under 
either  the  translation  or  rotation  alone.) 
The  plane  parallel  to  the  c-axis  that  inter¬ 
sects  the  figure  in  the  dashed  line  is  a  glide 
plane:  the  structure  is  invariant  under  a 
translation  through  c/2  along  the  c-axis 
followed  by  a  reflection  in  the  glide  plane 
(but  is  not  invariant  under  either  the  trans¬ 
lation  or  reflection  alone). 


There  are  both  Schoenflies  and  international  systems  of  space-group  nomenclature, 
which  can  be  found,  on  the  few  occasions  they  may  be  needed,  in  the  book  by  Buerger 
cited  in  footnote  2. 
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EXAMPLES  AMONG  THE  ELEMENTS 

In  Chapter  4  we  listed  those  elements  with  face-centered  cubic,  body-centered  cubic, 
hexagonal  close-packed,  or  diamond  crystal  structures.  Over  70  percent  of  the  ele¬ 
ments  fall  into  these  four  categories.  The  remaining  ones  are  scattered  among  a  variety 
of  crystal  structures,  most  with  polyatomic  primitive  cells  that  are  sometimes  quite 
complex.  We  conclude  this  chapter  with  a  few  further  examples  listed  in  Table  7.5, 
7.6,  and  7.7.  Data  are  from  Wyckoff  (cited  on  page  70)  and  are  for  room  temperature 
and  normal  atmospheric  pressure,  unless  stated  otherwise. 


Table  7.5 

ELEMENTS  WITH  RHOMBOHEDRAL  (TRIGONAL)  BRA  VAIS  LAT¬ 
TICES" 


ELEMENT 

a  (A) 

e 

ATOMS  IN 

PRIMITIVE  CELL 

BASIS 

Hg(5  K) 

2.99 

70°45' 

i 

x  =  0 

As 

4.13 

54°  10' 

2 

x  =  ±0.226 

Sb 

4.51 

57°6' 

2 

x  =  ±0.233 

Bi 

4.75 

57° 14' 

2 

x  =  ±0.237 

Sm 

9.00 

23° 13' 

3 

x  =  0,  ±0.222 

°  The  common  length  of  the  primitive  vectors  is  a,  and  the  angle  between  any 
two  of  them  is  6.  In  all  cases  the  basis  points  expressed  in  terms  of  these  primitive 
vectors  have  the  form  x(a,  +a2  +  a3).  Note  (Problem  2(b))  that  arsenic,  antimony, 
and  bismuth  are  quite  close  to  a  simple  cubic  lattice,  distorted  along  a  body 
diagonal. 


Table  7.6 

ELEMENTS  WITH  TETRAGONAL  BRAVAIS  LATTICES" 


ELEMENT 

a  (A) 

c(A) 

BASIS 

In 

4.59 

4.94 

At  face-centered  positions 
of  the  conventional  cell 

Sn  (white) 

5.82 

3.17 

AtOOO,OH,iOLHi, 

with  respect  to  the  axes 
of  the  conventional  cell 

“  The  common  length  of  two  perpendicular  primitive  vectors  is  a,  and  the 
length  of  the  third,  perpendicular  to  these,  is  c.  Both  examples  have  centered 
tetragonal  Bravais  lattices,  indium  with  a  one-atom  and  white  tin  with  a  two- 
atom  basis.  However,  both  are  more  commonly  described  as  simple  tetragonal 
with  bases.  The  conventional  cell  for  indium  is  chosen  to  stress  that  it  is  a 
slightly  distorted  (along  a  cube  edge)  fee  structure.  The  white  tin  structure 
can  be  viewed  as  a  diamond  structure  compressed  along  one  of  the  cube  axes. 
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Table  7.7 

ELEMENTS  WITH  ORTHORHOMBIC  BRAVAIS  LATTICES0 


ELEMENT 

a  (A) 

MA) 

c(A) 

Ga 

4.511 

4.517 

7.645 

P  (black) 

3.31 

4.38 

10.50 

Cl  (1 13  K) 

6.24 

8.26 

4.48 

Br  ( 1 23  K ) 

6.67 

8.72 

4.48 

I 

7.27 

9.79 

4.79 

S  (rhombic) 

10.47 

12.87 

24.49 

°  The  lengths  of  the  three  mutually  perpendicular  primitive  vectors  are 
a,  b,  and  c.  The  structure  of  rhombic  sulfur  is  complex,  with  128  atoms 
per  unit  cell.  The  others  can  be  described  in  terms  of  an  eight-atom  unit 
cell.  For  details  the  reader  is  referred  to  Wyckoff. 


PROBLEMS 


1.  (a)  Prove  that  any  Bravais  lattice  has  inversion  symmetry  in  a  lattice  point.  (Hint:  Express 
the  lattice  translations  as  linear  combinations  of  primitive  vectors  with  integral  coefficients.) 

(b)  Prove  that  the  diamond  structure  is  invariant  under  an  inversion  in  the  midpoint  of  any 
nearest  neighbor  bond. 

2.  (a)  IfthreeprimitivevectorsforatrigonalBra\aisIatticeareatanglesof90  to  one  another, 
the  lattice  obviously  has  more  than  trigonal  symmetry,  being  simple  cubic.  Show  that  if  the 
angles  are  60  or  arc  cos  (— j)  the  lattice  again  has  more  than  trigonal  symmetry,  being  face- 
centered  cubic  or  body-centered  cubic. 

(b)  Show  that  the  simple  cubic  lattice  can  be  represented  as  a  trigonal  lattice  with  primitive 
vectors  a,  at  60°  angles  to  one  another,  with  a  two-point  basis  +j(a,  +  a2  +  a3).  (Compare 
these  numbers  with  the  crystal  structures  in  Table  7.5.) 

(c)  What  structure  results  if  the  basis  in  the  same  trigonal  lattice  is  taken  to  be  +  j(a,  + 
a2  +  a3)? 

3.  If  two  systems  are  connected  by  arrows  in  the  symmetry  hierarchy  of  Figure  7.7,  then  a 
Bravais  lattice  in  the  more  symmetric  system  can  be  reduced  to  one  of  lower  symmetry  by  an 
infinitesimal  distortion,  except  for  the  pair  hexagonal-trigonal  The  appropriate  distortions  have 
been  fully  described  in  the  text  in  all  cases  except  hexagonal-orthorhombic  and  trigonal- 
monoclinic. 

(a)  Describe  an  infinitesimal  distortion  that  reduces  a  simple  hexagonal  Bravais  lattice  to 
one  in  the  orthorhombic  system. 

(b)  What  kind  of  orthorhombic  Bravais  lattice  can  be  reached  in  this  way? 

(c)  Describe  an  infinitesimal  distortion  that  reduces  a  trigonal  Bravais  lattice  to  one  in  the 
monoclinic  system. 

(d)  What  kind  of  monoclinic  Bravais  lattice  can  be  reached  in  this  way? 
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4.  (a)  Which  of  the  trigonal  point  groups  described  in  Table  7.3  is  the  point  group  of  the 
mFigure?^?  Th3t  IS'  WhiCh  °f  thC  repreSentative  obJec,s  has  the  symmetry  of  the  object  shown 

(b)  In  Figure  7.9  the  faces  of  the  object  of  Figure  7.3f  are  decorated  in  various  symmetry- 

ReferZ?T  'T^f***  ^  °*  Symmetrics  of  the  Agonal  point  groups, 

erring  to  Table  7.3,  indicate  the  point-group  symmetry  of  each  object. 

Figure  7.9 

Objects  with  the  symmetries 
of  the  trigonal  groups  of 
lower  symmetry.  Which  is 
which? 


5.  Which  of  the  14  Bravais  lattices  other  than  face-centered  cubic  and  body-centered  cubic  do 
not  have  reciprocal  lattices  of  the  same  kind?  X  b  do 

6  (a)  Show  that  there  is  a  family  of  lattice  planes  perpendicular  to  any  ,,-fold  rotation  axis 

of  a  Brava, s  lattice,  n  *  3.  (The  result  is  also  true  when  „  =  2.  bu,  requires  a  somewhat  m" 
elaborate  argument  (Problem  7).)  ore 

(b.  Deduce  from  (a)  that  an  u-fold  axis  cannot  exist  in  any  three-dimensional  Bravais  lattice 
unless  it  can  exist  in  some  two-dimensional  Bravais  lattice. 

,,,(C)  _Pr°V£  th'a'n°  ‘^dimensional  Bravais  lattice  can  have  an  n-fold  axis  with  „  =  5  orn  5*  7 
i  ,  '  f  Sh°Wthat  th°  aX,s  can  be  chose"  «°  P™  through  a  lattice  point.  Then  argue  by  reduaio 

'  usinS  “*  ^t  of  points  into  which  the  nearest  neighbor  of  the  fixed  pomt  ,s  taken  by 

Note  7ha  The  7  3  T'  *  ^  '***  ^  ,he  a-SSUmed  nearest  "eiShbor  distance 

(Note  that  the  case  u  -  5  requires  slightly  different  treatment  from  the  others).) 

7.  (a)  Show  that  if  a  Bravais  lattice  has  a  mirror  plane,  then  there  ,s  a  family  of  lattice  planes 
parallel  to  the  mirror  plane.  (Hmt  :  Show  from  the  argument  on  page  113  that  the  existencT  of  a 

::~P  “ the  CXiStCnCe  °f  a  mirr°r  P,ane  Containi"e  «  point. !,  ,s  then  enouah  to 
prose  that  that  plane  contains  two  other  lattice  points  not  collinear  with  the  first ) 

(b)  Show  that  ,f  a  Bravais  lattice  has  a  2-fold  rotation  axis  then  there  is  a  familv  of  lattice- 
planes  perpendicular  to  the  axis.  y  L 


Electron  Levels 
in  a  Periodic 
Potential: 
General 
Properties 


The  Periodic  Potential  and  Bloch’s  Theorem 
Born-von  Karman  Boundary  Condition 
A  Second  Proof  of  Bloch’s  Theorem 
Crystal  Momentum,  Band  Index,  and  Velocity 
The  Fermi  Surface 

Density  of  Levels  and  van  Hove  Singularities 


132  Chapter  8  Electron  Levels  in  a  Periodic  Potential 


Because  the  ions  in  a  perfect  crystal  are  arranged  in  a  regular  periodic  array,  we  are 
led  to  consider  the  problem  of  an  electron  in  a  potential  U(i)  with  the  periodicity  of 
the  underlying  Bravais  lattice;  i.e., 

U(t  +  R)  =  I7(r)  (8-1) 

for  all  Bravais  lattice  vectors  R. 

Since  the  scale  of  periodicity  of  the  potential  17  (^  10  8  cm)  is  the  size  of  a  typical 
de  Broglie  wavelength  of  an  electron  in  the  Sommerfeld  free  electron  model,  it  is 
essential  to  use  quantum  mechanics  in  accounting  for  the  effect  of  periodicity  on 
electronic  motion.  In  this  chapter  we  shall  discuss  those  properties  of  the  electronic 
levels  that  depend  only  on  the  periodicity  of  the  potential,  without  regard  to  its 
particular  form.  The  discussion  will  be  continued  in  Chapters  9  and  10  in  two  limiting 
cases  of  great  physical  interest  that  provide  more  concrete  illustrations  of  the  general 
results  of  this  chapter.  In  Chapter  1 1  some  of  the  more  important  methods  for  the 
detailed  calculation  of  electronic  levels  are  summarized.  In  Chapters  12  and  13  we 
shall  discuss  the  bearing  of  these  results  on  the  problems  of  electronic  transport 
theory  first  raised  in  Chapters  1  and  2,  indicating  how  many  of  the  anomalies  of 
free  electron  theory  (Chapter  3)  are  thereby  removed.  In  Chapters  14  and  15  we  shall 
examine  the  properties  of  specific  metals  that  illustrate  and  confirm  the  general  theory. 

We  emphasize  at  the  outset  that  perfect  periodicity  is  an  idealization.  Real  solids 
are  never  absolutely  pure,  and  in  the  neighborhood  of  the  impurity  atoms  the  solid 
is  not  the  same  as  elsewhere  in  the  crystal.  Furthermore,  there  is  always  a  slight 
temperature-dependent  probability  of  finding  missing  or  misplaced  ions  (Chapter  30) 
that  destroy  the  perfect  translational  symmetry  of  even  an  absolutely  pure  crystal. 
Finally,  the  ions  are  not  in  fact  stationary,  but  continually  undergo  thermal  vibra¬ 
tions  about  their  equilibrium  positions. 

These  imperfections  are  all  of  great  importance.  They  are,  for  example,  ultimately 
responsible  for  the  fact  that  the  electrical  conductivity  of  metals  is  not  infinite. 
Progress  is  best  made,  however,  by  artificially  dividing  the  problem  into  two  parts: 

(a)  the  ideal  fictitious  perfect  crystal,  in  which  the  potential  is  genuinely  periodic,  and 

(b)  the  effects  on  the  properties  of  a  hypothetical  perfect  crystal  of  all  deviations  from 
perfect  periodicity,  treated  as  small  perturbations. 

We  also  emphasize  that  the  problem  of  electrons  in  a  periodic  potential  does  not 
arise  only  in  the  context  of  metals.  Most  of  our  general  conclusions  apply  to  all 
crystalline  solids,  and  will  play  an  important  role  in  our  subsequent  discussions  of 
insulators  and  semiconductors. 

THE  PERIODIC  POTENTIAL 

The  problem  of  electrons  in  a  solid  is  in  principle  a  many-electron  problem,  for  the 
full  Hamiltonian  of  the  solid  contains  not  only  the  one-electron  potentials  describing 
the  interactions  of  the  electrons  with  the  massive  atomic  nuclei,  but  also  pair  potentials 
describing  the  electron-electron  interactions.  In  the  independent  electron  approxi¬ 
mation  these  interactions  are  represented  by  an  effective  one-electron  potential  (7(r). 
The  problem  of  how  best  to  choose  this  effective  potential  is  a  complicated  one, 
which  we  shall  return  to  in  Chapters  1 1  and  17.  Here  we  merely  observe  that  whatever 
detailed  form  the  one-electron  effective  potential  may  have,  if  the  crystal  is  perfectly 
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periodic  it  must  satisfy  (8.1).  From  this  fact  alone  many  important  conclusions  can 
already  be  drawn. 

Qualitatively,  however,  a  typical  crystalline  potential  might  be  expected  to  have 
the  form  shown  in  Figure  8.1,  resembling  the  individual  atomic  potentials  as  the  ion 
is  approached  closely  and  flattening  off  in  the  region  between  ions. 


Figure  8.1 

A  typical  crystalline  periodic 
potential,  plotted  along  a  line 
of  ions  and  along  a  line  mid¬ 
way  between  a  plane  of  ions. 
(Closed  circles  are  the  equi¬ 
librium  ion  sites;  the  solid 
curves  give  the  potential 
along  the  line  of  ions;  the 
dotted  curves  give  the  poten¬ 
tial  along  a  line  between 
planes  of  ions;  the  dashed 
curves  give  the  potential  of 
single  isolated  ions.) 


We  are  thus  led  to  examine  general  properties  of  the  Schrodinger  equation  for  a 
single  electron, 

—(-£ 

that  follow  from  the  fact  that  the  potential  U  has  the  periodicity  (8.1).  The  free  electron 
Schrodinger  equation  (2.4)  is  a  special  case  of  (8.2)  (although,  as  we  shall  see,  in  some 
respects  a  very  pathological  one),  zero  potential  being  the  simplest  example  of  a 
periodic  one. 

Independent  electrons,  each  of  which  obeys  a  one  electron  Schrodinger  equation 
with  a  periodic  potential,  are  known  as  Bloch  electrons  (in  contrast  to  “free  electrons,” 
to  which  Bloch  electrons  reduce  when  the  periodic  potential  is  identically  zero).  The 
stationary  states  of  Bloch  electrons  have  the  following  very  important  property  as 
a  general  consequence  of  the  periodicity  of  the  potential  U : 


V2  +  U(r) 


y 


£<P, 


(8.2) 


BLOCH’S  THEOREM 


Theorem.1  The  eigenstates  \p  of  the  one-electron  Hamiltonian  H  =  —h2\2/2m  + 
l/(r),  where  l/(r  +  R)  =  l/(r)  for  all  R  in  a  Bravais  lattice,  can  be  chosen  to  have  the 
form  of  a  plane  wave  times  a  function  with  the  periodicity  of  the  Bravais  lattice: 


>P, Jr)  =  e'^'ujr). 


(8.3) 


1  The  theorem  was  first  proved  by  Floquet  in  the  one-dimensional  case,  where  it  is  frequently  called 
Floquet's  theorem. 
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where 

u„k(r  +  R)  =  H,k(r) 

(8.4) 

for  all  R  in  the  Bravais  lattice.2 

Note  that  Eqs.  (8.3)  and  (8.4)  imply  that 

Wr  +  R)  =  e**iPJr). 

(8.5) 

Bloch’s  theorem  is  sometimes  stated  in  this  alternative  form:3  the  eigenstates  of  H 
can  be  chosen  so  that  associated  with  each  ip  is  a  wave  vector  k  such  that 


ip(t  +  R)  =  c,k'V(r), 


(8.6) 


for  every  R  in  the  Bravais  lattice. 

We  offer  two  proofs  of  Bloch’s  theorem,  one  from  general  quantum-mechanical 
considerations  and  one  by  explicit  construction.4 


FIRST  PROOF  OF  BLOCH’S  THEOREM 

For  each  Bravais  lattice  vector  R  we  define  a  translation  operator  TK  which,  when 
operating  on  any  function  /( r),  shifts  the  argument  by  R: 

TR/(r)=/(r  +  R).  (8.7) 

Since  the  Hamiltonian  is  periodic,  we  have 

TkHiP  =  //( r  +  R)i//(r  +  R)  =  H0#(r  +  R)  =  HTR*p.  (8.8) 

Because  (8.8)  holds  identically  for  any  function  ip,  we  have  the  operator  identity 

TrH  =  HTr.  (8.9) 

In  addition,  the  result  of  applying  two  successive  translations  does  not  depend  on 
the  order  in  which  they  are  applied,  since  for  any  ip(r) 

TKTKiP(r)  =  TKTKip(r)  =  r  +  R  +  R').  (8.10) 

Therefore 

TkTr  =  Tr.Tr  =  TR+R..  (8.11) 

Equations  (8.9)  and  (8.11)  assert  that  the  TR  for  all  Bravais  lattice  vectors  R  and 
the  Hamiltonian  H  form  a  set  of  commuting  operators.  It  follows  from  a  fundamental 
theorem  of  quantum  mechanics5  that  the  eigenstates  of  H  can  therefore  be  chosen 
to  be  simultaneous  eigenstates  of  all  the  7R : 

Hip  =  &\p, 

TRiP  =  c{R)\p.  (8.12) 


2  The  index  n  is  known  as  the  bund  index  and  occurs  because  for  a  given  k,  as  we  shall  see,  there 
will  be  many  independent  eigenstates. 

3  Equation  (8.6)  implies  (8.3)  and  (8.4),  since  it  requires  the  function  u(r)  =  exp  ( —  ik  -  r)  ^(r)  to  have 
the  periodicity  of  the  Bravais  lattice. 

4  The  first  proof  relies  on  some  formal  results  of  quantum  mechanics.  The  second  is  more  elementary, 
but  also  notationally  more  cumbersome. 

5  See,  for  example,  D.  Park,  Introduction  to  the  Quantum  Theory.  McGraw-Hill,  New  York,  1964, 
p.  123. 
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The  eigenvalues  c(R)  of  the  translation  operators  are  related  because  of  the  condition 
(8.11),  for  on  the  one  hand 

Tr  TR\p  =  c(R)TRij/  =  cfRMR'M, 

while,  according  to  (8.1 1), 

TrTriI/  =  7^+r  tj/  =  c(R  +  R')^. 

It  follows  that  the  eigenvalues  must  satisfy 

c(R  +  R')  =  c(R)c(R'). 

Now  let  a,-  be  three  primitive  vectors  for  the  Bravais  lattice.  We  can  always  write 
the  c(a,-)  in  the  form 


c(a,)  =  e2nix' 

(8.16) 

by  a  suitable  choice6  of  the  x,-.  It  then  follows  by  successive  applications  of  (8.15) 
that  if  R  is  a  general  Bravais  lattice  vector  given  by 

R  =  rqa,  4-  n2&2  +  ^383, 

then 

(8.17) 

c(R)  =  c(a1)nic(a2)"2c(a3)"3. 

(8.18) 

But  this  is  precisely  equivalent  to 

c(R)  =  e,kR, 

where 

(8.19) 

k  —  Xjbi  +  Xjl)2  4-  *3b3 

(8.20) 

and  the  b,-  are  the  reciprocal  lattice  vectors  satisfying  Eq.  (5.4):  bf  •  a j  =  2n5ij. 

Summarizing,  we  have  shown  that  we  can  choose  the  eigenstates  \p  of  H  so  that 
for  every  Bravais  lattice  vector  R, 

TR\p  =  \j/(  r  +  R)  =  c(R)^  =  e,ll'R^(r). 

(8.21) 

This  is  precisely  Bloch’s  theorem,  in  the  form  (8.6). 


(8.13) 

(8.14) 

(8.15) 


THE  BORN-VON  KARMAN  BOUNDARY  CONDITION 

By  imposing  an  appropriate  boundary  condition  on  the  wave  functions  we  can 
demonstrate  that  the  wave  vector  k  must  be  real,  and  arrive  at  a  condition  restricting 
the  allowed  values  of  k.  The  condition  generally  chosen  is  the  natural  generalization 
of  the  condition  (2.5)  used  in  the  Sommerfeld  theory  of  free  electrons  in  a  cubical 
box.  As  in  that  case,  we  introduce  the  volume  containing  the  electrons  into  the  theory 
through  a  Born-von  Karman  boundary  condition  of  macroscopic  periodicity  (page 
33).  Unless,  however,  the  Bravais  lattice  is  cubic  and  L  is  an  integral  multiple  of 
the  lattice  constant  a,  it  is  not  convenient  to  continue  to  work  in  a  cubical  volume 
of  side  L.  Instead,  it  is  more  convenient  to  work  in  a  volume  commensurate  with  a 


6  We  shall  see  that  for  suitable  boundary  conditions  the  x,  must  be  real,  but  for  now  they  can  be 
regarded  as  general  complex  numbers. 
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primitive  cell  of  the  underlying  Bravais  lattice.  We  therefore  generalize  the  periodic 
boundary  condition  (2.5)  to 

^(r  +  N, a;)  =  i//(r),  i  =  1,2,  3,  (8.22) 

where  the  a;  are  three  primitive  vectors  and  the  JVf  are  all  integers  of  order  JV1/3, 
where  N  =  N1N2N3  is  the  total  number  of  primitive  cells  in  the  crystal. 

As  in  Chapter  2,  we  adopt  this  boundary  condition  under  the  assumption  that 
the  bulk  properties  of  the  solid  will  not  depend  on  the  choice  of  boundary  condition, 
which  can  therefore  be  dictated  by  analytical  convenience. 

Applying  Bloch’s  theorem  (8.6)  to  the  boundary  condition  (8.22)  we  find  that 

,Ur  +  N, a.)  =  elN‘l'*'<Mr),  *  =  1,  2,  3,  (8.23) 

which  requires  that 

=  1,  i  =  1, 2, 3.  (8-24) 

When  k  has  the  form  (8.20),  Eq.  (8.24)  requires  that 

g2 nilVi  _ 

and  consequently  we  must  have 

x,  =  m;  integral 

Therefore  the  general  form  for  allowed  Bloch  wave  vectors  is7 

k=£^bj,  r>h  integral.  (8.27) 

i=i 

It  follows  from  (8.27)  that  the  volume  Ak  of  k- space  per  allowed  value  of  k  is  just 
the  volume  of  the  little  parallelepiped  with  edges  bJN, : 

<8jsi 

Since  Iq  •  (b2  x  b3)  is  the  volume  of  a  reciprocal  lattice  primitive  cell,  Eq.  (8.28) 
asserts  that  the  number  of  allowed  wave  vectors  in  a  primitive  cell  of  the  reciprocal 
lattice  is  equal  to  the  number  of  sites  in  the  crystal. 

The  volume  of  a  reciprocal  lattice  primitive  cell  is  (2n)3/v,  where  v  =  V/N  is  the 
volume  of  a  direct  lattice  primitive  cell,  so  Eq.  (8.28)  can  be  written  in  the  alternative 
form: 

(8.29) 

This  is  precisely  the  result  (2.18)  we  found  in  the  free  electron  case. 


7  Note  that  (8.27)  reduces  to  the  form  (2.16)  used  in  free  electron  theory  when  the  Bravais  lattice  is 
simple  cubic,  the  a,  are  the  cubic  primitive  vectors,  and  A'i  =  =  A'3  =  L/a. 


(8.25) 

(8.26) 
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SECOND  PROOF  OF  BLOCH’S  THEOREM8 

This  second  proof  of  Bloch’s  theorem  illuminates  its  significance  from  a  rather  dif¬ 
ferent  point  of  view,  which  we  shall  exploit  further  in  Chapter  9.  We  start  with  the 
observation  that  one  can  always  expand  any  function  obeying  the  Born -von  Karman 
boundary  condition  (8.22)  in  the  set  of  all  plane  waves  that  satisfy  the  boundary 
condition  and  therefore  have  wave  vectors  of  the  form  (8.27):9 

W  =  E^'r.  (8.30) 

_<i 

Because  the  potential  U (r)  is  periodic  in  the  lattice,  its  plane  wave  expansion  will 
only  contain  plane  waves  with  the  periodicity  of  the  lattice  and  therefore  with  wave 
vectors  that  are  vectors  of  the  reciprocal  lattice:10 

l/(r)  =  X  UKe*  '.  (8.31) 

K 

The  Fourier  coefficients  UK  are  related  to  U(r)  by11 

UK=-  dr  c“*'rL/(r).  (8.32) 

f  t  cell 

Since  we  are  at  liberty  to  change  the  potential  energy  by  an  additive  constant,  we 
fix  this  constant  by  requiring  that  the  spatial  average  l/0  of  the  potential  over  a 
primitive  cell  vanish: 

1  f 

Ue  =  -  dr  l/(r)  =  0.  (8.33) 

V  Jcell 

Note  that  because  the  potential  l/(r)  is  real,  it  follows  from  (8.32)  that  the  Fourier 
coefficients  satisfy 

U-k  =  UK*.  (8.34) 

If  we  assume  that  the  crystal  has  inversion  symmetry12  so  that,  for  a  suitable  choice 
of  origin,  C7 (r)  =  t/(  —  r),  then  (8.32)  implies  that  UK  is  real,  and  thus 


U_K  =  UK  =  UK*  (for  crystals  with  inversion  symmetry).  (8.35) 

We  now  place  the  expansions  (8.30)  and  (8.31)  into  the  Schrodinger  equation  (8.2). 
The  kinetic  energy  term  gives 


(8.36) 


8  Allhough  more  elementary  than  the  first  proof,  the  second  is  also  notationally  more  complicated, 
and  of  importance  primarily  as  a  starting  point  for  the  approximate  calculations  of  Chapter  9.  The  reader 
may  therefore  wish  to  skip  it  at  this  point. 

9  We  shall  subsequently  understand  unspecified  summations  over  k  to  be  over  all  wave  vectors  of  the 
form  (8.27)  allowed  by  the  Born -von  Karman  boundary  condition. 

10  A  sum  indexed  by  K  shall  always  be  understood  to  run  over  all  reciprocal  lattice  vectors. 

11  See  Appendix  D,  where  the  relevance  of  the  reciprocal  lattice  to  Fourier  expansions  of  periodic 
functions  is  discussed. 

12  The  reader  is  invited  to  pursue  the  argument  of  this  section  (and  Chapter  9)  without  the  assumption 
of  inversion  symmetry,  which  is  made  solely  to  avoid  inessential  complications  in  the  notation. 
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The  term  in  the  potential  energy  can  be  written13 


X  t/Kc/TK^'r  = 

Kq 


1UK 

Kq 


(8.37) 


We  change  the  names  of  the  summation  indices  in  (8.37) — from  K  and  q',  to  K' 
and  q — so  that  the  Schrodinger  equation  becomes 


(8.38) 


Since  the  plane  waves  satisfying  the  Born -von  Karman  boundary  condition  are 
an  orthogonal  set,  the  coefficient  of  each  separate  term  in  (8.38)  must  vanish,14  and 
therefore  for  all  allowed  wave  vectors  q. 


(2  £)c«  +  ILkC,-k'-°- 


(8.39) 


It  is  convenient  to  write  q  in  the  form  q  =  k  —  K,  where  K  is  a  reciprocal  lattice 
vector  chosen  so  that  k  lies  in  the  first  Brillouin  zone.  Equation  (8.39)  becomes 


^2m  (k  _  ^)2  -  Ck-K  +  Z  ^K'Ck-K-K'  ~  0, 


(8.40) 


or,  if  we  make  the  change  of  variables  K'  -*•  K'  —  K, 


(8.41) 


We  emphasize  that  Eqs.  (8.39)  and  (8.41)  are  nothing  but  restatements  of  the 
original  Schrodinger  equation  (8.2)  in  momentum  space,  simplified  by  the  fact  that 
because  of  the  periodicity  of  the  potential,  Uk  is  nonvanishing  only  when  k  is  a  vector 
of  the  reciprocal  lattice. 

For  fixed  k  in  the  first  Brillouin  zone,  the  set  of  equations  (8.41)  for  all  reciprocal 
lattice  vectors  K  couples  only  those  coefficients  ct,  ct_K,  ck_K.,  ct_K., . . .  whose  wave 
vectors  differ  from  k  by  a  reciprocal  lattice  vector.  Thus  the  original  problem  has 
separated  into  N  independent  problems:  one  for  each  allowed  value  of  k  in  the  first 
Brillouin  zone.  Each  such  problem  has  solutions  that  are  superpositions  of  plane 
waves  containing  only  the  wave  vector  k  and  wave  vectors  differing  from  k  by  a 
reciprocal  lattice  vector. 


13  The  last  step  follows  from  making  the  substitution  K  +  q  =  q\  and  noting  that  because  K  is  a 
reciprocal  lattice  vector,  summing  over  all  q  of  the  form  (8.27)  is  the  same  as  summing  over  all  q'  of  that 
form. 

14  This  can  also  be  deduced  from  Eq.  (D.  12),  Appendix  D,  by  multiplying  (8.38)  by  the  appropriate 
plane  wave  and  integrating  over  the  volume  of  the  crystal. 
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Putting  this  information  back  into  the  expansion  (8.30)  of  the  wave  function  ip, 
we  see  that  if  the  wave  vector  q  only  assumes  the  values  k,  k  —  K',  k  —  K", 
then  the  wave  function  will  be  of  the  form 

<Ak  =  LWl-K,-r-  (8-42> 

K 

If  we  write  this  as 

Mr)  =  e,kr(Ick^-'Kr),  (8.43) 

K 

then  this  is  of  the  Bloch  form  (8.3)  with  the  periodic  function  ti(r)  given  by15 

u(  r)  =  Eq.K^".  (8.44) 

K 

GENERAL  REMARKS  ABOUT  BLOCH’S  THEOREM 

1.  Bloch’s  theorem  introduces  a  wave  vector  k,  which  turns  out  to  play  the  same 
fundamental  role  in  the  general  problem  of  motion  in  a  periodic  potential  that  the 
free  electron  wave  vector  k  plays  in  the  Sommerfeld  theory.  Note,  however,  that 
although  the  free  electron  wave  vector  is  simply  p/ft,  where  p  is  the  momentum  of  the 
electron,  in  the  Bloch  case  k  is  not  proportional  to  the  electronic  momentum.  This 
is  clear  on  general  grounds,  since  the  Hamiltonian  does  not  have  complete  trans¬ 
lational  invariance  in  the  presence  of  a  nonconstant  potential,  and  therefore  its 
eigenstates  will  not  be  simultaneous  eigenstates  of  the  momentum  operator.  This 
conclusion  is  confirmed  by  the  fact  that  the  momentum  operator,  p  =  (ft/i)  V,  when 
acting  on  \J/„k  gives 

=yV(e»Xk(t)) 

=  fik</u  +  ell[ry  Vu„k(r),  (8.45) 

which  is  not,  in  general,  just  a  constant  times  ipnt ;  i.e.,  i]/nk  is  not  a  momentum  eigen¬ 
state. 

Nevertheless,  in  many  ways  hk  is  a  natural  extension  of  p  to  the  case  of  a  periodic 
potential.  It  is  known  as  the  crystal  momentum  of  the  electron,  to  emphasize  this 
similarity,  but  one  should  not  be  misled  by  the  name  into  thinking  that  hk  is  a  mo¬ 
mentum,  for  it  is  not.  An  intuitive  understanding  of  the  dynamical  significance  of 
the  wave  vector  k  can  only  be  acquired  when  one  considers  the  response  of  Bloch 
electrons  to  externally  applied  electromagnetic  fields  (Chapter  12).  Only  then  does 
its  full  resemblance  to  p/ft  emerge.  For  the  present,  the  reader  should  view  k  as  a 
quantum  number  characteristic  of  the  translational  symmetry  of  a  periodic  potential, 
just  as  the  momentum  p  is  a  quantum  number  characteristic  of  the  fuller  translational 
symmetry  of  free  space. 

2.  The  wave  vector  k  appearing  in  Bloch’s  theorem  can  always  be  confined  to 
the  first  Brillouin  zone  (or  to  any  other  convenient  primitive  cell  of  the  reciprocal 


ls  Note  that  there  will  be  (infinitely)  many  solutions  to  the  (infinite)  set  of  equations  (8.41)  for  a  given 
k.  These  are  classified  by  the  band  index  n  (see  footnote  2). 
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lattice).  This  is  because  any  k'  not  in  the  first  Brillouin  zone  can  be  written  as 

k'  =  k  +  K  (8.46) 

where  K  is  a  reciprocal  lattice  vector  and  k  does  lie  in  the  first  zone.  Since  e,K  R  =  1 
for  any  reciprocal  lattice  vector,  if  the  Bloch  form  (8.6)  holds  for  k ,  it  will  also  hold 
for  k. 

3.  The  index  n  appears  in  Bloch’s  theorem  because  for  given  k  there  are  many 
solutions  to  the  Schrodinger  equation.  We  noted  this  in  the  second  proof  of  Bloch  s 
theorem,  but  it  can  also  be  seen  from  the  following  argument: 

Let  us  look  for  all  solutions  to  the  Schrodinger  equation  (8.2)  that  have  the  Bloch 
form 

#■)  =  e*'i<(r),  (8.47) 

where  k  is  fixed  and  u  has  the  periodicity  of  the  Bravais  lattice.  Substituting  this  into 
the  Schrodinger  equation,  we  find  that  u  is  determined  by  the  eigenvalue  problem 

Hku  t(r)  =  (j-  0  v  +  k  j  +  c/(r)j"k(r)  (8-48> 

=  6k»t(r) 


with  boundary  condition 


i'k(r)  =  «k(r  +  R)- 


(8.49) 


Because  of  the  periodic  boundary  condition  we  can  regard  (8.48)  as  a  Hermitian 
eigenvalue  problem  restricted  to  a  single  primitive  cell  of  the  crystal.  Because  the 
eigenvalue  problem  is  set  in  a  fixed  finite  volume,  we  expect  on  general  grounds  to 
find  an  infinite  family  of  solutions  with  discretely  spaced  eigenvalues,16  which  we 
label  with  the  band  index  n. 

Note  that  in  terms  of  the  eigenvalue  problem  specified  by  (8.48)  and  (8.49),  the 
wave  vector  k  appears  only  as  a  parameter  in  the  Hamiltonian  Hk.  We  therefore 
expect  each  of  the  energy  levels,  for  given  k,  to  vary  continuously  as  k  varies.17  In 
this  way  we  arrive  at  a  description  of  the  levels  of  an  electron  in  a  periodic  potential 
in  terms  of  a  family  of  continuous18  functions  6„(k). 

4.  Although  the  full  set  of  levels  can  be  described  with  k  restricted  to  a  single 
primitive  cell,  it  is  often  useful  to  allow  k  to  range  through  all  of  k-space,  even  though 
this  gives  a  highly  redundant  description.  Because  the  set  of  all  wave  functions  and 
energy  levels  for  two  values  of  k  differing  by  a  reciprocal  lattice  vector  must  be 


16  Just  as  the  problem  of  a  free  electron  in  a  box  of  fixed  finite  dimensions  has  a  set  of  discrete 
energy  levels,  the  vibrational  normal  modes  of  a  finite  drumhead  have  a  set  of  discrete  frequencies,  etc. 

17  This  expectation  is  implicit,  for  example,  in  ordinary  perturbation  theory,  which  is  possible  only 
because  small  changes  in  parameters  in  the  Hamiltonian  lead  to  small  changes  in  the  energy  levels.  In 
Appendix  E  the  changes  in  the  energy  levels  for  small  changes  in  k  are  calculated  explicitly. 

18  The  fact  that  the  Born-von  Karman  boundary  condition  restricts  k  to  discrete  values  of  the 
form  (8.27)  has  no  bearing  on  the  continuity  of  £„(k)  as  a  function  of  a  continuous  variable  k,  for  the 
eigenvalue  problem  given  by  (8.48)  and  (8.49)  makes  no  reference  to  the  size  of  the  whole  crystal  and  is 
well  defined  for  any  k.  One  should  also  note  that  the  set  of  k  of  the  form  (8.27)  becomes  dense  in  fc-space 
in  the  limit  of  an  infinite  crystal. 
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identical,  we  can  assign  the  indices  n  to  the  levels  in  such  a  way  that  for  given  n,  the 
eigenstates  and  eigenvalues  are  periodic  functions  of  kin  the  reciprocal  lattice: 


Ipn,  k  +  n(r)  —  l/'nl(r)> 
Gn,kfK  =  Enk- 


This  leads  to  a  description  of  the  energy  levels  of  an  electron  in  a  periodic  potential 
in  terms  of  a  family  of  continuous  functions  £„k  (or  £„(k)),  each  with  the  periodicity 
of  the  reciprocal  lattice.  The  information  contained  in  these  functions  is  referred  to 
as  the  band  structure  of  the  solid. 

For  each  n,  the  set  of  electronic  levels  specified  by  £„(k)  is  called  an  energy  band. 
The  origin  of  the  term  “band”  will  emerge  in  Chapter  10.  Here  we  only  note  that 
because  each  £„(k)  is  periodic  in  k  and  continuous,  it  has  an  upper  and  lower  bound, 
so  that  all  the  levels  £„(k)  lie  in  the  band  of  energies  lying  between  these  limits. 

5.  It  can  be  shown  quite  generally  (Appendix  E)  that  an  electron  in  a  level  specified 
by  band  index  n  and  wave  vector  k  has  a  nonvanishing  mean  velocity,  given  by 


v„(k)  =  ^  Vt  £„(k). 


(8.51) 


This  is  a  most  remarkable  fact.  It  asserts  that  there  are  stationary  (i.e.,  time- 
independent)  levels  for  an  electron  in  a  periodic  potential  in  which,  in  spite  of  the 
interaction  of  the  electron  with  the  fixed  lattice  of  ions,  it  moves  forever  without 
any  degradation  of  its  mean  velocity.  This  is  in  striking  contrast  to  the  idea  of  Drude 
that  collisions  were  simply  encounters  between  the  electron  and  a  static  ion.  Its 
implications  are  of  fundamental  importance,  and  will  be  explored  in  Chapters  12 
and  13. 


THE  FERMI  SURFACE 

The  ground  state  of  N  free  electrons19  is  constructed  by  occupying  all  one-electron 
levels  k  with  energies  £(k)  =  h  2k2/2m  less  than  £F,  where  £f  is  determined  by  requiring 
the  total  number  of  one-electron  levels  with  energies  less  than  £f  to  be  equal  to  the 
total  number  of  electrons  (Chapter  2). 

The  ground  state  of  N  Bloch  electrons  is  similarly  constructed,  except  that  the 
one-electron  levels  are  now  labeled  by  the  quantum  numbers  n  and  k,  £„(k)  does  not 
have  the  simple  explicit  free  electron  form,  and  k  must  be  confined  to  a  single  primitive 
cell  of  the  reciprocal  lattice  if  each  level  is  to  be  counted  only  once.  When  the  lowest 
of  these  levels  are  filled  by  a  specified  number  of  electrons,  two  quite  distinct  types 
of  configuration  can  result: 


ig  We  shall  not  distinguish  notationally  between  the  number  of  conduction  electrons  and  the  number 
of  primitive  cells  when  it  is  clear  from  the  context  which  is  meant;  they  are  equal,  however,  only  in  a 
monovalent  monatomic  Bravais  latticefe.g.,  the  alkali  metals). 
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1.  A  certain  number  of  bands  may  be  completely  filled,  all  others  remaining  empty. 
The  difference  in  energy  between  the  highest  occupied  level  and  the  lowest  un¬ 
occupied  level  (i.e.,  between  the  “top”  of  the  highest  occupied  band  and  the 
“bottom”  of  the  lowest  empty  band)  is  known  as  the  band  gap.  We  shall  find  that 
solids  with  a  band  gap  greatly  in  excess  of  kBT  ( T  near  room  temperature)  are 
insulators  (Chapter  12).  If  the  band  gap  is  comparable  to  kBT,  the  solid  is  known 
as  an  intrinsic  semiconductor  (Chapter  28).  Because  the  number  of  levels  in  a 
band  is  equal  to  the  number  of  primitive  cells  in  the  crystal  (page  136)  and  because 
each  level  can  accommodate  two  electrons  (one  of  each  spin),  a  configuration 
with  a  band  gap  can  arise  (though  it  need  not )  only  if  the  number  of  electrons  per 
primitive  cell  is  even. 

2.  A  number  of  bands  may  be  partially  filled.  When  this  occurs,  the  energy  of  the 
highest  occupied  level,  the  Fermi  energy  £f,  lies  within  the  energy  range  of  one 
or  more  bands.  For  each  partially  filled  band  there  will  be  a  surface  in  k-space 
separating  the  occupied  from  the  unoccupied  levels.  The  set  of  all  such  surfaces 
is  known  as  the  Fermi  surface ,  and  is  the  generalization  to  Bloch  electrons  of  the 
free  electron  Fermi  sphere.  The  parts  of  the  Fermi  surface  arising  from  individual 
partially  filled  bands  are  known  as  branches  of  the  Fermi  surface.20  We  shall 
see  (Chapter  12)  that  a  solid  has  metallic  properties  provided  that  a  Fermi 
surface  exists. 

Analytically,  the  branch  of  the  Fermi  surface  in  the  nth  band  is  that  surface  in 
k-space  (if  there  is  one)  determined  by21 

S„(k)  =  £f.  (8.52) 

Thus  the  Fermi  surface  is  a  constant  energy  surface  (or  set  of  constant  energy  surfaces) 
in  k-space,  just  as  the  more  familiar  equipotentials  of  electrostatic  theory  are  constant 
energy  surfaces  in  real  space. 

Since  the  £y,(k)  are  periodic  in  the  reciprocal  lattice,  the  complete  solution  to  (8.52) 
for  each  n  is  a  k-space  surface  with  the  periodicity  of  the  reciprocal  lattice.  When  a 
branch  of  the  Fermi  surface  is  represented  by  the  full  periodic  structure,  it  is  said 
to  be  described  in  a  repeated  zone  scheme.  Often,  however,  it  is  preferable  to  take 
just  enough  of  each  branch  of  the  Fermi  surface  so  that  every  physically  distinct  level 
is  represented  by  just  one  point  of  the  surface.  This  is  achieved  by  representing  each 
branch  by  that  portion  of  the  full  periodic  surface  contained  within  a  single  primitive 
cell  of  the  reciprocal  lattice.  Such  a  representation  is  described  as  a  reduced  zone 


20  In  many  important  cases  the  Fermi  surface  is  entirely  within  a  single  band,  and  generally  it  is  found 
to  lie  within  a  fairly  small  number  of  bands  (Chapter  15). 

21  If  Gf  is  generally  defined  as  the  energy  separating  the  highest  occupied  from  the  lowest  unoccupied 
level,  then  it  is  not  uniquely  specified  in  a  solid  with  an  energy  gap,  since  any  energy  in  the  gap  meets  this 
test.  People  nevertheless  speak  of  “ the  Fermi  energy”  of  an  intrinsic  semiconductor.  What  they  mean  is 
the  chemical  potential,  which  is  well  defined  at  any  nonzero  temperature  (Appendix  B).  As  T  -*■  0,  the 
chemical  potential  of  a  solid  with  an  energy  gap  approaches  the  energy  at  the  middle  of  the  gap  (page  575), 
and  one  sometimes  finds  it  asserted  that  this  is  the  “Fermi  energy”  of  a  solid  with  a  gap.  With  either  the 
correct  (undetermined)  or  colloquial  definition  of  £F,  Eq.  (8.52)  asserts  that  solids  with  a  gap  have  no 
Fermi  surface. 
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scheme.  The  primitive  cell  chosen  is  often,  but  not  always,  the  first  Brillouin  zone. 

Fermi  surface  geometry  and  its  physical  implications  will  be  illustrated  in  many 
of  the  following  chapters,  particularly  Chapters  9  and  15. 


DENSITY  OF  LEVELS22 


One  must  often  calculate  quantities  that  are  weighted  sums  over  the  electronic  levels 
of  various  one-electron  properties.  Such  quantities  are  of  the  form23 


6  =  2  £  e„(k),  (8.53) 

njc 

where  for  each  n  the  sum  is  over  all  allowed  k  giving  physically  distinct  levels,  i.e.,  all 
k  of  the  form  (8.27)  lying  in  a  single  primitive  cell.24 

In  the  limit  of  a  large  crystal  the  allowed  values  (8.27)  of  k  get  very  close  together, 
and  the  sum  may  be  replaced  with  an  integral.  Since  the  volume  of  k-space  per  allowed 
k  (Eq.  (8.29))  has  the  same  value  as  in  the  free  electron  case,  the  prescription  derived 
in  the  free  electron  case  (Eq.  (2.29))  remains  valid,  and  we  find  that25 


(8.54) 

where  the  integral  is  over  a  primitive  cell. 

If,  as  is  often  the  case,26  g„(k)  depends  on  n  and  k  only  through  the  energy  E„(k), 
then  in  further  analogy  to  the  free  electron  case  one  can  define  a  density  of  levels 
per  unit  volume  (or  “density  oflevels”  for  short)  g(C)  so  that  q  has  the  form  (cf.  (2.60) ): 


<2  = 


I 


<?S0(S)G(6). 


Comparing  (8.54)  and  (8.55)  we  find  that 


(8.55) 


<7(6)  =  I>(E),  (8-56) 

n 

where  gn{ 6),  the  density  of  levels  in  the  nth  band,  is  given  by 

Cdk 

9„(C)  =  U^,5(e-  e„(k)),  (8.57) 

where  the  integral  is  over  any  primitive  cell. 


22  The  reader  can,  without  loss  of  continuity,  skip  this  section  at  a  first  reading,  referring  back  to  it 
in  subsequent  chapters  when  necessary. 

23  The  factor  2  is  because  each  level  specified  by  n  and  k  can  accommodate  two  electrons  of  opposite 
spin.  We  assume  that  Qn( k)  does  not  depend  on  the  electron  spin  s.  If  it  does,  the  factor  2  must  be  replaced 
by  a  sum  on  s. 

24  The  functions  (2„(k)  usually  have  the  periodicity  of  the  reciprocal  lattice,  so  the  choice  of  primitive 
cell  is  immaterial. 

25  See  page  37  for  the  appropriate  cautionary  remarks. 

26  For  example,  if  q  is  the  electronic  number  density  n,  then  £>(£)  =  /(£),  where/is  the  Fermi  function; 
if  q  is  the  electronic  energy  density  u,  then  ()(£)  =  £/(£). 
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An  alternative  representation  of  the  density  oflevels  can  be  constructed  by  noting 
that,  as  in  the  free  electron  case  (Eq.  (2.62)): 

(the  number  of  allowed  wave  vectors 
0„(£)  dZ  =  (2/F)  x  in  the  nth  band  in  the  energy  range  (8.58) 

from  £  to  £  +  dZ). 


The  number  of  allowed  wave  vectors  in  the  nth  band  in  this  energy  range  is  just  the 
volume  of  a  /c-space  primitive  cell,  with  £  <  £„(k)  <  £  +  dZ,  divided  by  the  volume 
per  allowed  wave  vector,  Ak  =  (2n)3/V.  Thus 


<7„(£)  dZ  = 


xl1 

l,  £  <  £„(k)  s:  £  +  d£,| 

J  4?t3  |c 

),  otherwise  J 

(8.59) 


Since  dZ  is  infinitesimal,  this  can  also  be  expressed  as  a  surface  integral.  Let  S„(£) 
be  the  portion  of  the  surface  £„(k)  =  £  lying  within  the  primitive  cell,  and  let  8k(k) 
be  the  perpendicular  distance  between  the  surfaces  S„(£)  and  S„(£  +  dZ)  at  the  point 
k.  Then  (Figure  8.2): 

0„(£)d£=[  ^  8k(k).  (8.60) 

JSn(E)  ^ 


Figure  8.2 

An  illustration  in  two  dimensions  of  the  construction  expressed 
in  Eq.  (8.60).  The  closed  curves  are  the  two  constant-energy 
surfaces,  the  required  area  is  that  lying  between  them  (shaded), 
and  the  distance  i51c(k)  is  indicated  for  a  particular  k. 


To  find  an  explicit  expression  for  8k(k)  note  that,  since  S„(£)  is  a  surface  of  constant 
energy,  the  fc-gradient  of  £„(k),  V£„(k),  is  a  vector  normal  to  that  surface  whose  mag¬ 
nitude  is  equal  to  the  rate  of  change  of  £„(k)  in  the  normal  direction;  i.e.. 


and  hence 


g  +  dZ  =  6  +  |V£„(k)  |  <5k(k). 


<5k(k)  = 


dZ 

|VE„(k)|’ 


(8.61) 

(8.62) 


Substituting  (8.62)  into  (8.60),  we  arrive  at  the  form 


ffr(S)  = 


f  dS  1 
Js„te,4n3  |VE„(k)| 


(8.63) 


which  gives  an  explicit  relation  between  the  density  of  levels  and  the  band  structure. 
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Equation  (8.63)  and  the  analysis  leading  to  it  will  be  applied  in  subsequent 
chapters.27  Here  we  only  note  the  following  quite  general  property  of  the  density 
of  levels : 

Because  £„(k)  is  periodic  in  the  reciprocal  lattice,  bounded  above  and  below  for 
each  n,  and,  in  general,  everywhere  differentiable,  there  must  be  values  of  k  in  each 
primitive  cell  at  which  |V£|  =  0.  For  example,  the  gradient  of  a  differentiable  function 
vanishes  at  local  maxima  and  minima,  but  the  boundedness  and  periodicity  of  each 
£„(k)  insure  that  for  each  n  there  will  be  at  least  one  maximum  and  minimum  in 
each  primitive  cell.28 

When  the  gradient  of  £„  vanishes,  the  integrand  in  the  density  of  levels  (8.63) 
diverges.  It  can  be  shown  that  in  three  dimensions29  such  singularities  are  integrable, 
yielding  finite  values  for  gn.  However,  they  do  result  in  divergences  of  the  slope, 
dg„/dz.  These  are  known  as  van  Hove  singularities.30  They  occur  at  values  of  £  for 
which  the  constant  energy  surface  S„(£)  contains  points  at  which  V£„(k)  vanishes. 
Since  derivatives  of  the  density  of  levels  at  the  Fermi  energy  enter  into  all  terms  but 
the  first  in  the  Sommerfeld  expansion,31  one  must  be  on  guard  for  anomalies  in  low- 
temperature  behavior  if  there  are  points  of  vanishing  V£„(k)  on  the  Fermi  surface. 

Typical  van  Hove  singularities  are  shown  in  Figure  8.3  and  are  examined  in 
Problem  2,  Chapter  9. 


Figure  8.3 

Characteristic  van  Hove  sin¬ 
gularities  in  the  density  of 
levels,  indicated  by  arrows  at 
right  angles  to  the  E-axis. 


This  concludes  our  discussion  of  the  general  features  of  one-electron  levels  in  a 
periodic  potential.32  In  the  following  two  chapters  we  consider  two  very  important, 
but  quite  different,  limiting  cases,  which  provide  concrete  illustrations  of  the  rather 
abstract  discussions  in  this  chapter. 


27  See  also  Problem  2. 

28  A  very  general  analysis  of  how  many  points  of  vanishing  gradient  must  occur  is  fairly  complex 
See,  for  example,  G.  Weinreich,  Solids ,  Wiley,  New  York,  1965,  pp.  73-79. 

29  In  one  dimension  #„(£)  itself  will  be  infinite  at  a  van  Hove  singularity. 

30  Essentially  the  same  singularities  occur  in  the  theory  of  lattice  vibrations.  See  Chapter  23. 

31  See,  for  example.  Problem  2f,  Chapter  2. 

32  Problem  1  pursues  the  general  analysis  somewhat  further  in  the  tractable  but  somewhat  misleading 
case  of  a  one-dimensional  periodic  potential. 
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PROBLEMS 


1.  Periodic  Potentials  in  One  Dimension 

The  general  analysis  of  electronic  levels  in  a  periodic  potential,  independent  of  the  detailed  features 
of  that  potential,  can  be  carried  considerably  further  in  one  dimension.  Although  the  one-dimen¬ 
sional  case  is  in  many  respects  atypical  (there  is  no  need  for  a  concept  of  a  Fermi  surface)  or 
misleading  (the  possibility — indeed,  in  two  and  three  dimensions  the  likelihood — of  band  overlap 
disappears),  it  is  nevertheless  reassuring  to  see  some  of  the  features  of  three-dimensional  band 
structure  we  shall  describe  through  approximate  calculations,  in  Chapters  9, 10,  and  1 1,  emerging 
from  an  exact  treatment  in  one  dimension. 

Consider,  then,  a  one-dimensional  periodic  potential  U(x)  (Figure  8.4).  It  is  convenient  to  view 
the  ions  as  residing  at  the  minima  of  U,  which  we  take  to  define  the  zero  of  energy.  We  choose 
to  view  the  periodic  potential  as  a  superposition  of  potential  barriers  r(x)  of  width  o,  centered  at 
the  points  x  =  ±na  (Figure  8.5): 

(8.64) 

Figure  8.4 

A  one-dimensional  periodic 
potential  U(x).  Note  that  the 
ions  occupy  the  positions  of 
a  Bravais  lattice  of  lattice 
constant  a.  It  is  convenient 
to  take  these  points  as  having 
coordinates  (n  +  j)a,  and  to 
choose  the  zero  of  potential 
to  occur  at  the  position  of 
the  ion. 

Figure  8.5 

Illustrating  particles  incident 
from  the  left  (a)  and  right  (b) 
on  a  single  one  of  the  bar¬ 
riers  separating  neighboring 
ions  in  the  periodic  potential 
of  Figure  8.4.  The  incident, 
transmitted,  and  reflected 
waves  are  indicated  by  ar¬ 
rows  along  the  direction  of 
propagation,  proportional  to 
the  corresponding  ampli¬ 
tudes. 

The  term  r(x  —  na )  represents  the  potential  barrier  against  an  electron  tunneling  between  the 
ions  on  opposite  sides  of  the  point  ml.  Tor  simplicity  we  assume  that  v(x)  =  r(— x)  (the  one¬ 
dimensional  analogue  of  the  inversion  symmetry  we  assumed  above),  but  we  make  no  other 
assumptions  about  v,  so  the  form  of  the  periodic  potential  U  is  quite  general. 
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The  band  structure  of  the  one-dimensional  solid  can  be  expressed  quite  simply  in  terms  of  the 
properties  of  an  electron  in  the  presence  of  a  single-barrier  potential  v(x).  Consider  therefore  an 
electron  incident  from  the  left  on  the  potential  barrier  i{x)  with  energy33  8  =  h2K2ilm.  Since 
r(x)  =  0  when  |.\|  >  ail.  in  these  regions  the  wave  function  i/q(x)  will  have  the  form 

i/q(x)  =  e‘Kx  +  re~,Kx,  x  ^ 

=  teiKx,  x  >  (8.65) 

This  is  illustrated  schematically  in  Figure  8.5a. 

The  transmission  and  reflection  coefficients  t  and  r  give  the  probability  amplitude  that  the 
electron  will  tunnel  through  or  be  reflected  from  the  barrier;  they  depend  on  the  incident  wave 
vector  K  in  a  manner  determined  by  the  detailed  features  of  the  barrier  potential  v.  However, 
one  can  deduce  many  properties  of  the  band  structure  of  the  periodic  potential  U  by  appealing 
only  to  very  general  properties  of  t  and  r.  Because  u  is  even,  \p,(x)  =  i/q(  —  x)  is  also  a  solution 
to  the  Schrodinger  equation  with  energy  E.  From  (8.65)  it  follows  that  ij/r(x)  has  the  form 

ipr(x)  =  te~iKx,  x 

=  e~iKx  +  reiKx.  x  3=  |.  (8.66) 

Evidently  this  describes  a  particle  incident  on  the  barrier  from  the  right,  as  depicted  in  Figure  8.5b. 

Since  and  i//r  are  two  independent  solutions  to  the  single-barrier  Schrodinger  equation  with 
the  same  energy,  any  other  solution  with  that  energy  will  be  a  linear  combination34  of  these  two: 
\p  =  Atp,  +  B\pr.  In  particular,  since  the  crystal  Hamiltonian  is  identical  to  that  for  a  single 
ion  in  the  region  -a/2  ^  x  ^  a/ 2,  any  solution  to  the  crystal  Schrodinger  equation  with  energy 
E  must  be  a  linear  combination  of  ipi  and  <p,  in  that  region: 

=  Aiptix)  +  Bij/r(x),  ^  X  ^  2' 

Now  Bloch’s  theorem  asserts  that  ip  can  be  chosen  to  satisfy 

ip(x  +  a)  =  e‘kai]/(x), 

for  suitable  k.  Differentiating  (8.68)  we  also  find  that  \p'  =  d\p/dx  satisfies 

i]/'(x  +  a)  =  e,kaip'(x). 

(a)  By  imposing  the  conditions  (8.68)  and  (8.69)  at  x  =  —a/2,  and  using  (8.65)  to  (8.67), 
show  that  the  energy  of  the  Bloch  electron  is  related  to  its  wave  vector  k  by: 

t2  -  r2  1  h2K 1 

cos  ka  =  -  eiK°  +  e 6  =  (8.70) 

2r  2 1  2m 

Verify  that  this  gives  the  right  answer  in  the  free  electron  case  (i?  =  0). 


(8.67) 

(8.68) 
(8.69) 


33  Note:  in  this  problem  K  is  a  continuous  variable  and  has  nothing  to  do  with  the  reciprocal  lattice. 

34  A  special  case  of  the  general  theorem  that  there  are  n  independent  solutions  to  an  nth-order  linear 
differential  equation. 
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Equation  (8.70)  is  more  informative  when  one  supplies  a  little  more  information  about  the 
transmission  and  reflection  coefficients.  We  write  the  complex  number  r  in  terms  of  its  magnitude 


and  phase: 

t  =  \t\e“ 


(8.71) 


The  real  number  S  is  known  as  the  phase  shift,  since  it  specifies  the  change  in  phase  of  the  trans¬ 
mitted  wave  relative  to  the  incident  one.  Electron  conservation  requires  that  the  probability  of 
transmission  plus  the  probability  of  reflection  be  unity. 

1  =  \t\ 2  +  |r|2.  (8.72) 


This,  and  some  other  useful  information,  can  be  proved  as  follows.  Let  and  <p2  be  any  two 
solutions  to  the  one-barrier  Schrodinger  equation  with  the  same  energy: 


h2  h2K2  , 

+  2m  fr. 


i  -  1,  2. 


(8.73) 


Define  w<0, ,  <j>2)  (the  “Wronskian”)  by 

wi4>u  4>2)  =  <j>i'(x)<p2{x)  -  <p,(x)<j>2(x).  (8.74) 

(b)  Prove  that  iv  is  independent  of  .v  by  deducing  from  (8.73)  that  its  derivative  vanishes. 

(c)  Prove  (8.72)  by  evaluating  w(ij/i.  ip*)  for  x  ^  -  ajl  and  x  >  fl/2.  noting  that  because  v(x) 
is  real  ip,*  will  be  a  solution  to  the  same  Schrodinger  equation  as  ifa. 

(d)  By  evaluating  »#,,  tpr*)  prove  that  rf*  is  Pure  imaginary,  so  r  must  have  the  form 

r  =  ±i  |r|  e,s,  (8.75) 


where  <5  is  the  same  as  in  (8.71). 

(e)  Show  as  a  consequence  of  (8.70),  (8.72).  and  (8.75)  that  the  energy  and  wave  vector  of  the 
Bloch  electron  are  related  by 


cos  ( Ka  +  d) 


-  cos  ka. 


e  = 


h2K2 


2m 


(8.76) 


Since  |f|  is  always  less  than  one,  but  approaches  unity  for  large  K  (the  barrier  becomes  in¬ 
creasingly  less  effective  as  the  incident  energy  grows),  the  left  side  of  (8.76)  plotted  against  K  has 
the  structure  depicted  in  Figure  8.6.  Fora  given  A,  the  allowed  values  of  K  (and  hence  the  allowed 
energies  E (k)  =  h1K2’2m)  are  given  by  the  intersection  of  the  curve  in  Figure  8.6  with  the  hori¬ 
zontal  line  of  height  cos  (An).  Note  that  values  of  K  in  the  neighborhood  of  those  satisfying 


Ka  +  6  =  nn 


(8.77) 


give  |cos  (Ka  +  <5)'  |r|  >  1,  and  are  therefore  not  allowed  for  any  A.  The  corresponding  regions 
of  energy  are  the  energy  gaps.  If  5  is  a  bounded  function  of  K  (as  is  generally  the  case),  then 
there  will  be  infinitely  many  regions  of  forbidden  energy,  and  also  infinitely  many  regions  of 
allowed  energies  for  each  value  of  A. 

(f)  Suppose  the  barrier  is  very  weak  (so  that  |t|  ^  1,  |r|  ~  0,  d  *  0).  Show  that  the  energy 
gaps  are  then  very  narrow,  the  width  of  the  gap  containing  K  =  nn/a  being 


2nn  - 


(8.78) 


(g)  Suppose  the  barrier  is  very  strong,  so  that  |r|  *  0,  |r|  s:  1.  Show  that  i 


lowed  bands 
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Figure  8.6 

Characteristic  form  of  the 
function  cos  (Aa  -f  £)/|r|.  Be¬ 
cause  |r(A)|  is  always  less 
than  unity  the  function  will 
exceed  unity  in  magnitude 
in  the  neighborhood  of  solu¬ 
tions  to  Ka  +  i5(A)  =  Bn. 
Equation  (8.76)  can  be  satis¬ 
fied  for  real  k  if  and  only 
if  the  function  is  less  than 
unity  in  magnitude.  Conse¬ 
quently  there  will  be  allowed 
(unshaded)  and  forbidden 
(shaded)  regions  of  A  (and 
therefore  of  £  =  h2K2/2m). 
Note  that  when  |t|  is  very 
near  unity  (weak  potential) 
the  forbidden  regions  will  be 
narrow,  but  if  |f|  is  very  small 
(strong  potential)  the  allowed 
regions  will  be  narrow. 


of  energies  are  then  very  narrow,  with  widths 


f  a.  ax  6>min  —  d(  |f  | ). 


(8.79) 


(h)  As  a  concrete  example,  one  often  considers  the  case  in  which  u(x)  =  g6(x),  where  <5{x) 
is  the  Dirac  delta  function  (a  special  case  of  the  “Kronig-Penney  model”).  Show  that  in  this  case 


cot  <5  = 


h2K 
mg  ’ 


|r|  =  cos  5. 


(8.80) 


This  model  is  a  common  textbook  example  of  a  one-dimensional  periodic  potential.  Note,  how¬ 
ever,  that  most  of  the  structure  we  have  established  is,  to  a  considerable  degree,  independent  of 
the  particular  functional  dependence  of  |t|  and  i5  on  A. 


2.  Density  of  Levels 

(a)  In  the  free  electron  case  the  density  of  levels  at  the  Fermi  energy  can  be  written  in  the 
form  (Eq.  (2.64))  g(EF)  =  mkF/h2n2.  Show  that  the  general  form  (8.63)  reduces  to  this  when 
E„(k)  =  h2k2/2in  and  the  (spherical)  Fermi  surface  lies  entirely  within  a  primitive  cell. 

(b)  Consider  a  band  in  which,  for  sufficiently  small  k,  £„(k)  =  E0  +  ( h2/I){kx2/mx  +  ky2/my  + 
kx2/mz)  (as  might  be  the  case  in  a  crystal  of  orthorhombic  symmetry  )  where  mx,  my,  and  m.  are 
positive  constants.  Show  that  if  £  is  close  enough  to  So  that  this  form  is  valid,  then  r/„(£)  is  pro¬ 
portional  to  (E  —  E0)1/2,  so  its  derivative  becomes  infinite  (van  Hove  singularity)  as  E  approaches 
the  band  minimum.  [Hint:  Use  the  form  (8.57)  for  the  density  of  levels.)  Deduce  from  this  that  if 
the  quadratic  form  for  E„(k)  remains  valid  up  to  Ef ,  then  c/„(Br)  can  be  written  in  the  obvious 
generalization  of  the  free  electron  form  (2.65): 


Sn(£f)  — 


3  n 

2  &F  —  Eq 


where  n  is  the  contri1  'on  of  the  electrons  in  the  band  to  the  total  electronic  density. 


(8.81) 
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(c)  Consider  the  density  of  levels  in  the  neighborhood  of  a  saddle  point,  where  £„(k)  =  C0  + 
(h1 /2){kx2 /mx  +  ky2/my  -  kz2/m.J  where  mx,  my,  and  m,  are  positive  constants.  Show  that  when 
£  x  e0,  the  derivative  of  the  density  of  levels  has  the  form 

£  >  £oi 
£  <  £0. 


g„'(e,)  x  constant, 

«  (£0  -  £)'1/2, 


(8.82) 
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One  can  gain  substantial  insight  into  the  structure  imposed  on  the  electronic  energy 
levels  by  a  periodic  potential,  if  that  potential  is  very  weak.  This  approach  might 
once  have  been  regarded  as  an  instructive,  but  academic,  exercise.  We  now  know, 
however  that  in  many  cases  this  apparently  unrealistic  assumption  gives  results 
surprisingly  close  to  the  mark.  Modem  theoretical  and  experimental  studies  of  the 
metals  found  in  groups  I,  II,  III,  and  IV  of  the  periodic  table  (^..  metals  whose 
atomic  structure  consists  of  s  and  p  electrons  outside  of  a  closed-shell  noble  gas 
configuration)  indicate  that  the  conduction  electrons  can  be  described  as  moving  in 
what  amounts  to  an  almost  constant  potential.  These  elements  are  often  referred  to 
as  “nearly  free  electron”  metals,  because  the  starting  point  for  their  description  is  the 
Sommerfeld  free  electron  gas,  modified  by  the  presence  of  a  weak  periodic  potential. 
In  this  chapter  we  shall  examine  some  of  the  broad  general  features  of  band  structure 
from  the  almost  free  electron  point  of  view.  Applications  to  particular  metals  will 

be  examined  in  Chapter  15.  .  ,  ,  ,  ,  ■ 

It  is  by  no  means  obvious  why  the  conduction  bands  of  these  metals  should  be 

so  free-electron-like.  There  are  two  fundamental  reasons  why  the  strong  interactions 
of  the  conduction  electrons  with  each  other  and  with  the  positive  ions  can  have  the 
net  effect  of  a  very  weak  potential. 

1  The  electron-ion  interaction  is  strongest  at  small  separations,  but  the  conduction 
electrons  are  forbidden  (by  the  Pauli  principle)  from  entering  the  immediate 
neighborhood  of  the  ions  because  this  region  is  already  occupied  by  the  core 

2.  hTSTregion  in  which  the  conduction  electrons  are  allowed,  their  mobility 
further  diminishes  the  net  potential  any  single  electron  experiences  for  they 
can  screen  the  fields  of  positively  charged  ions,  diminishing  the  total  effective 

potential. 

These  remarks  offer  only  the  barest  indication  of  why  the  following  discussion 
has  extensive  practical  application.  We  shall  return  later  to  the  problem  of  justifying 
the  nearly  free  electron  approach,  taking  up  point  1  in  Chapter  1 1  and  point  2  in 
Chapter  17. 

GENERAL  APPROACH  TO  THE  SCHRODINGER  EQUATION 
WHEN  THE  POTENTIAL  IS  WEAK 

When  the  periodic  potential  is  zero,  the  solutions  to  SchrOdinger’s  equation  are  plane 
waves.  A  reasonable  starting  place  for  the  treatment  of  weak  periodic  potent. als  is 
therefore  the  expansion  of  the  exact  solution  in  plane  waves  described  in  Chapter  8. 
The  wave  function  of  a  Bloch  level  with  crystal  momentum  k  can  be  written  in  th 

form  given  in  Eq.  (8.42): 

Wr)  =  ZCk-K^k-K,r.  (SU) 

K 

where  the  coefficients  ck.K  and  the  energy  of  the  level  £  are  determined  by  the  set 
of  Eqs.  (8.41): 


sA  '  K,i  - E] 


Ck_K  +  Z  UK  kCV  K  =  0. 


(9.2) 


Schrodinger  Equation  for  a  Weak  Periodic  Potential  153 


The  sum  in  (9.1)  is  over  all  reciprocal  lattice  vectors  K,  and  for  fixed  k  there  is  an 
equation  of  the  form  (9.2)  for  each  reciprocal  lattice  vector  K.  The  (infinitely  many) 
different  solutions  to  (9.2)  for  a  given  k  are  labeled  with  the  band  index  n.  The  wave 
vector  k  can  (but  need  not)  be  considered  to  lie  in  the  first  Brillouin  zone  of  k-space. 

In  the  free  electron  case,  all  the  Fourier  components  UK  are  precisely  zero.  Equation 
(9.2)  then  becomes 

(£?-k  -  £K-k  =  0,  (9.3) 

where  we  have  introduced  the  notation: 


Equation  (9.3)  requires  for  each  K  that  either  ck_K  =  0  or  £  =  £k_K.  The  latter 
possibility  can  occur  for  only  a  single  K,  unless  it  happens  that  some  of  the  £k_K 
are  equal  for  several  different  choices  of  K.  If  such  degeneracy  does  not  occur,  then 
we  have  the  expected  class  of  free  electron  solutions: 

£=£k-K>  (9.5) 

If,  however,  there  is  a  group  of  reciprocal  lattice  vectors  K,, . .  . ,  Km  satisfying 

£k-K,  =  "  '  =  £k-Km>  (9-6) 

then  when  £  is  equal  to  the  common  value  of  these  free  electron  energies  there  are 
m  independent  degenerate  plane  wave  solutions.  Since  any  linear  combination  of 
degenerate  solutions  is  also  a  solution,  one  has  complete  freedom  in  choosing  the 
coefficients  ck  _K  for  JK  =  K Km . 

These  simple  observations  acquire  more  substance  when  the  UK  are  not  zero,  but 
very  small.  The  analysis  still  divides  naturally  into  two  cases,  corresponding  to  the 
nondegenerate  and  degenerate  cases  for  free  electrons.  Now,  however,  the  basis  for 
the  distinction  is  not  the  exact  equality1  of  two  or  more  distinct  free  electron  levels, 
but  only  whether  they  are  equal  aside  from  terms  of  order  U. 

Case  1  Fix  k  and  consider  a  particular  reciprocal  lattice  vector  Ki  such  that  the 
free  electron  energy  £k  _K|  is  far  from  the  values  of  £k  K  (for  all  other  K)  compared 
with  U  (see  Figure  9.1)2: 

| £ ® _ K ,  —  £k-x|  »  U,  for  fixed  k  and  all  K  #  Kj.  (9.7) 

We  wish  to  investigate  the  effect  of  the  potential  on  that  free  electron  level  given  by: 

£=£5U,,  ck_K  =  0,  K#K,.  (9.8) 


1  The  reader  familiar  with  stationary  perturbation  theory  may  think  that  if  there  is  no  exact  de¬ 
generacy,  we  can  always  make  all  level  differences  large  compared  with  V  by  considering  sufficiently 
small  U.  That  is  indeed  true  for  any  given  k.  However,  once  we  are  given  a  definite  U ,  no  matter  how 
small,  we  want  a  procedure  valid  for  all  k  in  the  first  Brillouin  zone.  We  shall  see  that  no  matter  how- 
small  V  is  we  can  always  find  some  values  of  k  for  which  the  unperturbed  levels  are  closer  together  than 
U  Therefore  what  we  are  doing  is  more  subtle  than  conventional  degenerate  perturbation  theory. 

2  In  inequalities  of  this  form  we  shall  use  V  to  refer  to  a  typical  Fourier  component  of  the  potential. 
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Figure  9.1 

For  the  range  of  k  within  limits 
indicated  by  the  dark  band  the 
free  electron  levels  £V_K,  and 
6k  K  differ  by  an  energy  O(U). 


Setting  K  =  K,  in  Eq.  (9.2)  (and  using  the  short  notation  (9.4))  we  have  (dropping 
the  prime  from  the  summation  index): 


1  '  K  1 


(9.9) 


Because  we  have  picked  the  additive  constant  in  the  potential  energy  so  that  UK  =  0 
when  K  =  0  (see  page  137),  only  terms  with  K  #  K,  appear  on  the  right-hand  side 
of  (9.9).  Since  we  are  examining  that  solution  for  which  ck_K  vanishes  when  K  #  Kk 
in  the  limit  of  vanishing  U ,  we  expect  the  right-hand  side  of  (9.9)  to  be  of  second 
order  in  U.  This  can  be  explicitly  confirmed  by  writing  Eq.  (9.2)  for  K  #  Ki  as 


Ck-K 


-K^k-iq  y  U K -Kck-K’ 

£  —  Sk-K  K'#K,  £—  Ek-K 


(9.10) 


We  have  separated  out  of  the  sum  in  (9.10)  the  term  containing  ck-K,,  since  it  will 
be  an  order  of  magnitude  larger  than  the  remaining  terms,  which  involve  ck_K.  for 
K'  #  K,.  This  conclusion  depends  on  the  assumption  (9.7)  that  the  level  £k_  K,  is  not 
nearly  degenerate  to  some  other  Such  a  near  degeneracy  could  cause  some  of 
the  denominators  in  (9.10)  to  be  of  order  U,  canceling  the  explicit  U  in  the  numerator 
and  resulting  in  additional  terms  in  the  sum  in  (9.10)  comparable  to  the  K  =  K,  term. 

Therefore,  provided  there  is  no  near  degeneracy, 


Ck-K 


En.-KCk-K,  +  0(Lr2) 
£  -  £k_K 


(9.11) 


Placing  this  in  (9.9),  we  find: 


(£  —  £k-K,)Ck-K,  —  X 


C/k-k.Uk, 


V  LK  KI 


-Ck-K,  +  0([/3 


(9.12) 


Thus  the  perturbed  energy  level  £  differs  from  the  free  electron  value  £k_K,  W  terms 
of  order  U2.  To  solve  Eq.  (9.12)  for  £  to  this  order,  it  therefore  suffices  to  replace 
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the  £  appearing  in  the  denominator  on  the  right-hand  side  by  £j_Ki,  leading  to  the 
following  expression3  for  £,  correct  to  second  order  in  U : 


£  =  Ck-K,  +  E 


|C/K 


O  pO 
K  tk- 


+  0(l/3). 


(9.13) 


Equation  (9.13)  asserts  that  weakly  perturbed  nondegenerate  bands  repel  each 
other,  for  every  level  s£_K  that  lies  below  £k_Kl  contributes  a  term  in  (9.13)  that  raises 
the  value  of  £,  while  every  level  that  lies  above  £°_K)  contributes  a  term  that  lowers 
the  energy.  However,  the  most  important  feature  to  emerge  from  this  analysis  of  the 
case  of  no  near  degeneracy,  is  simply  the  gross  observation  that  the  shift  in  energy 
from  the  free  electron  value  is  second  order  in  U.  In  the  nearly  degenerate  case  (as 
we  shall  now  see)  the  shift  in  energy  can  be  linear  in  U.  Therefore,  to  leading  order 
in  the  weak  periodic  potential,  it  is  only  the  nearly  degenerate  free  electron  levels 
that  are  significantly  shifted,  and  we  must  devote  most  of  our  attention  to  this 
important  case. 


Case  2  Suppose  the  value  of  k  is  such  that  there  are  reciprocal  lattice  vectors  K , , . . . , 
Km  with  £k_K(, . . . ,  £°-Km  all  within  order  U  of  each  other,4  but  far  apart  from  the 
other  £k_K  on  the  scale  of  U: 

|e£_k  -  6?-kJ  »  U,  i  =  1, . . . ,  m,  K#K„...,Km.  (9.14) 

In  this  case  we  must  treat  separately  those  equations  given  by  (9.2)  when  K  is  set 
equal  to  any  of  the  m  values  K,,...,  Km.  This  gives  m  equations  corresponding  to 
the  single  equation  (9.9)  in  the  nondegenerate  case.  In  these  m  equations  we  separate 
from  the  sum  those  terms  containing  the  coefficients  ck_K  ,j  —  1, . . . ,  m,  which  need 
not  be  small  in  the  limit  of  vanishing  interaction,  from  the  remaining  rk_K,  which 
will  be  at  most  of  order  U.  Thus  we  have 


(£  £k-Kj)Ck— K,  —  E  ^Kj— K,Ck-K  •+  E  £  K  K,Ck-K>  *  —  1,  .  -  -  ,  Wk  (9.15) 

J=1  *#*,..  .Kra 

Making  the  same  separation  in  the  sum,  we  can  write  Eq.  (9.2)  for  the  remaining 
levels  as 


Ck-K 


-  KCk-K( 

+  E  ^K-K^k-K- 

•'■"‘Hi . 


K  #  Kj,...,Km,  (9.16) 


(which  corresponds  to  equation  (9.10)  in  the  case  of  no  near  degeneracy). 

Since  ck  K  will  be  at  most  of  order  U  when  K  #  K,, . . . ,  Km,  Eq.  (9.16)  gives 


c 


k-K 


E  £^K,-KCk-K: 

j=l  J  ’ 


+  0(U2). 


(9.17) 


We  use  Eq.  (8.34).  C/_K  =  l/£. 

In  one  dimension  m  cannot  exceed  2,  but  in  three  dimensions  it  can  be  quite  large. 
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Placing  this  in  (9.15),  we  find  that 

m 

(£  -  £°_k,)<V  K,  =  E  UKj-KCt-Kj 

+  E  (  E  C-K,  +  0(l/3)-  (9.18) 

;=iVk#k, . k„  S-£k-K/ 


J=  i 


Compare  this  with  the  result  (9.12)  in  the  case  of  no  near  degeneracy.  There  we 
found  an  explicit  expression  for  the  shift  in  energy  to  order  U 2  (to  which  the  set  of 
equations  (9.18)  reduces  when  m  =  1).  Now,  however,  we  find  that  to  an  accuracy 
of  order  U 2  the  determination  of  the  shifts  in  the  m  nearly  degenerate  levels  reduces 
to  the  solution  of  m  coupled  equations5  for  the  ck_K,  Furthermore,  the  coefficients 
in  the  second  term  on  the  right-hand  side  of  these  equations  are  of  higher  order  in 
U  than  those  in  the  first.6 7 *  Consequently,  to  find  the  leading  corrections  in  U  we  can 
replace  (9.18)  by  the  far  simpler  equations: 


(£  —  8k_ic)ck_|^  —  E  Utij-ufk-Kj’  *  1,  -  * . » til, 


(9.19) 


j=i 


which  are  just  the  general  equations  for  a  system  of  m  quantum  levels. 


ENERGY  LEVELS  NEAR  A  SINGLE  BRAGG  PLANE 

The  simplest  and  most  important  example  of  the  preceding  discussion  is  when  two 
free  electron  levels  are  within  order  U  of  each  other,  but  far  compared  with  U  from 
all  other  levels.  When  this  happens,  Eq.  (9.19)  reduces  to  the  two  equations: 


(£  —  £k-K,)Ck-K,  —  GK2_K,Ck-K2> 

(s  -  e£-K2kl-K2=  GKl_K2Ck_K,.  (9-20) 


When  only  two  levels  are  involved,  there  is  little  point  in  continuing  with  the 
notational  convention  that  labels  them  symmetrically.  We  therefore  introduce  vari¬ 
ables  particularly  convenient  for  the  two-level  problem: 

q  =  k  —  Kj  and  K  =  K2  —  Klt  (9.21) 


and  write  (9.20)  as 


(£  -  £°)c,  =  GKc,_K, 

(E  _  £°_K)c,_K  =  U_Kcq  =  l/Jc,. 


(9.22) 


5  These  are  rather  closely  related  to  the  equations  of  second-order  degenerate  perturbation  theory, 

to  which  they  reduce  when  all  the  e°_Ki  are  rigorously  equal,  i  =  1 - m.  (See  L.  D.  Landau  and 

E.  M.  Lifshitz,  Quantum  Mechanics ,  Addison-Wcsley,  Reading  Mass.,  1965,  p.  134.) 

6  The  numerator  is  explicitly  of  order  U2,  and  since  only  K-values  different  from  K , , . . . ,  K„  appear 
in  the  sum,  the  denominator  is  not  of  order  V  when  6  is  close  to  the  t.'i!  k(,  i  =  1, .  -  - ,  m. 

7  Note  that  the  rule  of  thumb  for  going  from  (9.19)  back  to  the  more  accurate  form  in  (9.18)  is  simply 
that  U  should  be  replaced  by  U\  where 


U'kj- K,  —  ^Ky-K, 


+  E 

K*K,... 


Lk, - K  L'k-k, 

£  —  £k-K 
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We  have: 

£°  *  E?-k,  K  -  £°_k  |  »  U,  for  K'  #  K,  0.  (9.23) 

Now  £°  is  equal  to  £j_K  for  some  reciprocal  lattice  vector  only  when  |q|  =  |q  —  K|. 
This  means  (Figure  9.2a)  that  q  must  lie  on  the  Bragg  plane  (see  Chapter  6)  bisecting 
the  line  joining  the  origin  of  k  space  to  the  reciprocal  lattice  point  K.  The  assertion 
that  £°  =  £q_K,  only  for  K'  =  K  requires  that  q  lie  only  on  this  Bragg  plane,  and  on 
no  other. 


Figure  9.2 

(a)  If  |q|  =  |q  —  K|,  then  the  point  q  must 
lie  in  the  Bragg  plane  determined  by  K. 

(b)  If  the  point  q  lies  in  the  Bragg  plane, 
then  the  vector  q  —  is  parallel  to  the 
plane. 


Thus  conditions  (9.23)  have  the  geometric  significance  of  requiring  q  to  be  close 
to  a  Bragg  plane  (but  not  close  to  a  place  where  two  or  more  Bragg  planes  intersect). 
Therefore  the  case  of  two  nearly  degenerate  levels  applies  to  an  electron  whose 
wave  vector  very  nearly  satisfies  the  condition  for  a  single  Bragg  scattering.8  Corre¬ 
spondingly,  the  general  case  of  many  nearly  degenerate  levels  applies  to  the  treatment 
of  a  free  electron  level  whose  wave  vector  is  close  to  one  at  which  many  simultaneous 
Bragg  reflections  can  occur.  Since  the  nearly  degenerate  levels  are  the  most  strongly 
affected  by  a  weak  periodic  potential,  we  conclude  that  a  weak  periodic  potential  has 
its  major  effects  on  only  those  free  electron  levels  whose  wave  vectors  are  close  to  ones 
at  which  Bragg  reflections  can  occur. 

We  discuss  systematically  on  pages  162  to  166  when  free  electron  wave  vectors 
do,  or  do  not,  lie  on  Bragg  planes,  as  well  as  the  general  structure  this  imposes  on 
the  energy  levels  in  a  weak  potential.  First,  however,  we  examine  the  level  structure 


8  An  incident  X-ray  beam  undergoes  Bragg  reflection  only  if  its  wave  vector  lies  on  a  Bragg  plane 
(see  Chapter  6). 
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when  only  a  single  Bragg  plane  is  nearby,  as  determined  by  (9.22).  These  equations 
have  a  solution  when: 


This  leads  to  a  quadratic  equation 

(£  -  fij)(fi  -  £°-k)  =  |l/K|2. 

The  two  roots 


(9.24) 

(9.25) 


Kfo  _  Po  \  2  qi/2 

“  +  |UK|2  (9.26) 

give  the  dominant  effect  of  the  periodic  potential  on  the  energies  of  the  two  free 
electron  levels  £°  and  £°  K  when  q  is  close  to  the  Bragg  plane  determined  by  K. 
These  are  plotted  in  Figure  9.3. 


Figure  9.3 

Plot  of  the  energy  bands  given  by  Eq.  (9.26)  for 
q  parallel  to  K.  The  lower  band  corresponds  to 
the  choice  of  a  minus  sign  in  (9.26)  and  the 
upper  band  to  a  plus  sign.  When  q  =  yK,  the 
two  bands  are  separated  by  a  band  gap  of  mag¬ 
nitude  2|t/K|.  When  q  is  far  removed  from  the 
Bragg  plane,  the  levels  (to  leading  order)  are 
indistinguishable  from  their  free  electron  values 
(denoted  by  dotted  lines). 


The  result  (9.26)  is  particularly  simple  for  points  lying  on  the  Bragg  plane  since, 
when  q  is  on  the  Bragg  plane,  £°  =  £°_K.  Hence 

£  =  £°  +  |1/k|,  q  on  a  single  Bragg  plane.  (9.27) 

Thus,  at  all  points  on  the  Bragg  plane,  one  level  is  uniformly  raised  by  |l/K|  and  the 
other  is  uniformly  lowered  by  the  same  amount. 

It  is  also  easily  verified  from  (9.26)  that  when  £°  =  £j_K, 

f=-(q-iK);  (9.28) 

dq  m 

i.e.,  when  the  point  q  is  on  the  Bragg  plane  the  gradient  of  £  is  parallel  to  the  plane 
(see  Figure  9.2b).  Since  the  gradient  is  perpendicular  to  the  surfaces  on  which  a 
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function  is  constant,  the  constant-energy  surfaces  at  the  Bragg  plane  are  perpendicular 
to  the  plane.9 

When  q  lies  on  a  single  Bragg  plane  we  may  also  easily  determine  the  form  of 
the  wave  functions  corresponding  to  the  two  solutions  8  =  8®  ±  |C/K|.  From  (9.22), 
when  8  is  given  by  (9.27),  the  two  coefficients  c,  and  rq  _K  satisfy10 

c,  =  ±sgn(l/K)cq  K.  (9.29) 


Since  these  two  coefficients  are  the  dominant  ones  in  the  plane-wave  expansion  (9.1), 
it  follows  that  if  UK  >  0,  then 


|^(r)|2  oc  (cos  •  r)2,  8=8°  +  |l/K|, 

|i/'(r)|2  cc  (sin  ■  r)2,  8  =  8®  -  |t/Kj, 


while  if  UK  <  0,  then 

|^(r)|2  cc  (sin  -jK  ■  r)2,  8  =  8°  +  |t/K|, 

|^(r)|2  cc  (cos  ■  r)2,  8  =  8°  -  jt/Kj.  (9.30) 

Sometimes  the  two  types  of  linear  combination  are  called  “p-Iike”  (|i//[2  ~  sin2  •  r) 
and  “s-like”  (|i//|2  ~  cos2  |K  •  r)  because  of  their  position  dependence  near  lattice 
points.  The  s-like  combination,  like  an  atomic  s-level,  does  not  vanish  at  the  ion; 
in  the  p-like  combination  the  charge  density  vanishes  as  the  square  of  the  distance 
from  the  ion  for  small  distances,  which  is  also  a  characteristic  of  atomic  p-Ievels. 


ENERGY  BANDS  IN  ONE  DIMENSION 

We  can  illustrate  these  general  conclusions  in  one  dimension,  where  twofold  degen¬ 
eracy  is  the  most  that  can  ever  occur.  In  the  absence  of  any  interaction  the  electronic 
energy  levels  are  just  a  parabola  in  k  (Figure  9.4a).  To  leading  order  in  the  weak 
one-dimensional  periodic  potential  this  curve  remains  correct  except  near  Bragg 
“planes”  (which  are  points  in  one  dimension).  When  q  is  near  a  Bragg  “plane”  cor¬ 
responding  to  the  reciprocal  lattice  vector  K  (i.e.,  the  point  jK)  the  corrected  energy 
levels  are  determined  by  drawing  another  free  electron  parabola  centered  around  K 
(Figure  9.4b),  noting  that  the  degeneracy  at  the  point  of  intersection  is  split  by  2|14| 
in  such  a  way  that  both  curves  have  zero  slope  at  that  point,  and  redrawing  Figure 
9.4b  to  get  Figure  9.4c.  The  original  free  electron  curve  is  therefore  modified  as  in 
Figure  9.4d.  When  all  Bragg  planes  and  their  associated  Fourier  components  are 
included,  we  end  up  with  a  set  of  curves  such  as  those  shown  in  Figure  9.4e.  This 
particular  way  of  depicting  the  energy  levels  is  known  as  the  extended-zone  scheme. 

If  we  insist  on  specifying  all  the  levels  by  a  w’ave  vector  k  in  the  first  Brillouin  zone, 
then  we  must  translate  the  pieces  of  Figure  9.4e,  through  reciprocal  lattice  vectors, 
into  the  first  Brillouin  zone.  The  result  is  shown  in  Figure  9.4f.  The  representation 
is  that  of  the  reduced-zone  scheme  (see  page  142). 


9  This  result  is  often,  but  not  always,  true  even  when  the  periodic  potential  is  not  weak,  because 
the  Bragg  planes  occupy  positions  of  fairly  high  symmetry. 

10  For  simplicity  we  assume  here  that  UK  is  real  (the  crystal  has  inversion  symmetry). 
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(a) 


<d) 


Figure  9.4 

(a)  The  free  electron  £  vs.  k 
parabola  in  one  dimension. 

(b)  Step  1  in  the  construction 
to  determine  the  distortion 
in  the  free  electron  parabola 
in  the  neighborhood  of  a 
Bragg  “plane,”  due  to  a  weak 
periodic  potential.  If  the 
Bragg  “plane”  is  that  deter¬ 
mined  by  K,  a  second  free 
electron  parabola  is  drawn, 
centered  on  K.  (c)  Step  2  in 
the  construction  to  deter¬ 
mine  the  distortion  in  the 
free  electron  parabola  in  the 
neighborhood  of  a  Bragg 
“plane.”  The  degeneracy  of 
the  two  parabolas  at  K/2 
is  split,  (d)  Those  portions 
of  part  (c)  corresponding 
to  the  original  free  electron 
parabola  given  in  (a),  (e)  Ef¬ 
fect  of  all  additional  Bragg 
“planes”  on  the  free  electron 
parabola.  This  particular 
way  of  displaying  the  elec¬ 
tronic  levels  in  a  periodic 
potential  is  known  as  the 
extended-zone  scheme,  (f)  The 
levels  of  (e),  displayed  in  a 
reduced-zone  scheme,  (g)  Free 
electron  levels  of  (e)  or  (f)  in 
a  repeated-zone  scheme. 


One  can  also  emphasize  the  periodicity  of  the  labeling  in  k-space  by  periodically 
extending  Figure  9.4f  throughout  all  of  k-space  to  arrive  at  Figure  9.4g,  which  em¬ 
phasizes  that  a  particular  level  at  k  can  be  described  by  any  wave  vector  differing 
from  k  by  a  reciprocal  lattice  vector.  This  representation  is  the  repeated-zone  scheme 
(see  page  142).  The  reduced-zone  scheme  indexes  each  level  with  a  k  lying  in  the  first 
zone,  while  the  extended-zone  scheme  uses  a  labeling  emphasizing  continuity  with 
the  free  electron  levels.  The  repeated-zone  scheme  is  the  most  general  representation, 


The  Energy  Gap  161 


but  is  highly  redundant,  since  the  same  level  is  shown  many  times,  for  all  equivalent 
wave  vectors  k,k±K,k±  2 K, .... 


ENERGY-YVAVE-VECTOR  CURVES  IN  THREE  DIMENSIONS 

In  three  dimensions  the  structure  of  the  energy  bands  is  sometimes  displayed  by 
plotting  8  vs.  k  along  particular  straight  lines  in  fc-space.  Such  curves  are  generally 
shown  in  a  reduced-zone  scheme,  since  for  general  directions  in  /(-space  they  are  not 
periodic.  Even  in  the  completely  free  electron  approximation  these  curves  are  surpris¬ 
ingly  complex.  An  example  is  shown  in  Figure  9.5,  which  was  constructed  by  plotting, 
as  k  varied  along  the  particular  lines  shown,  the  values  of  8?.  K  =  h\ k  —  K)2/2m  for 
all  reciprocal  lattice  vectors  K  close  enough  to  the  origin  to  lead  to  energies  lower 
than  the  top  of  the  vertical  scale. 


Figure  9.5 

Free  electron  energy  levels 
for  an  fee  Bravais  lattice.  The 
energies  are  plotted  along 
lines  in  the  first  Brillouin 
zone  joining  the  points 
F(k  =  0),  K,  L,  W,  and  X. 
£,  is  the  energy  at  point  X 
([ft2/2"i][2rt/a]2).  The  hori¬ 
zontal  lines  give  Fermi 
energies  for  the  indicated 
numbers  of  electrons  per 
primitive  cell.  The  number  of 
dots  on  a  curve  specifies  the 
number  of  degenerate  free 
electron  levels  represented  by 
the  curve.  (From  F.  Herman, 
in  An  Atomistic  Approach  to 
the  Nature  and  Properties  of 
Materials,  J.  A.  Pask,  ed., 
Wiley,  New  York,  1967.) 


Note  that  most  of  the  curves  are  highly  degenerate.  This  is  because  the  directions 
along  which  the  energy  has  been  plotted  are  all  lines  of  fairly  high  symmetry,  so 
points  along  them  are  likely  to  be  as  far  from  several  other  reciprocal  lattice  vectors 
as  they  are  from  any  given  one.  The  addition  of  a  weak  periodic  potential  will  in 
general  remove  some,  but  not  necessarily  all,  of  this  degeneracy.  The  mathematical 
theory  of  groups  is  often  used  to  determine  how  such  degeneracies  will  be  split. 


THE  ENERGY  GAP 

Quite  generally,  a  weak  periodic  potential  introduces  an  “energy  gap”  at  Bragg 
planes.  By  this  we  mean  the  following: 
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When  UK  =  0,  as  k  crosses  a  Bragg  plane  the  energy  changes  continuously  from 
the  lower  root  of  (9.26)  to  the  upper,  as  illustrated  in  Figure  9.4b.  When  UK  #  0, 
this  is  no  longer  so.  The  energy  only  changes  continuously  with  k,  as  the  Bragg  plane 
is  crossed,  if  one  stays  with  the  lower  (or  upper)  root,  as  illustrated  in  Figure  9.4c. 
To  change  branches  as  k  varies  continuously  it  is  now  necessary  for  the  energy  to 
change  discontinuously  by  at  least  2|  UK  |. 

We  shall  see  in  Chapter  12  that  this  mathematical  separation  of  the  two  bands 
is  reflected  in  a  physical  separation:  When  the  action  of  an  external  field  changes 
an  electron’s  wave  vector,  the  presence  of  the  energy  gap  requires  that  upon  crossing 
the  Bragg  plane,  the  electron  must  emerge  in  a  level  whose  energy  remains  in  the 
original  branch  of  S(k).  It  is  this  property  that  makes  the  energy  gap  of  fundamental 
importance  in  electronic  transport  properties. 


BRILLOUIN  ZONES 

Using  the  theory  of  electrons  in  a  weak  periodic  potential  to  determine  the  complete 
band  structure  of  a  three-dimensional  crystal  leads  to  geometrical  constructions  of 
great  complexity.  It  is  often  most  important  to  determine  the  Fermi  surface  (page  141) 
and  the  behavior  of  the  £„(k)  in  its  immediate  vicinity. 

In  doing  this  for  weak  potentials,  the  procedure  is  first  to  draw  the  free  electron 
Fermi  sphere  centered  at  k  =  0.  Next,  one  notes  that  the  sphere  will  be  deformed 
in  a  manner  of  which  Figure  9.6  is  characteristic1 1  when  it  crosses  a  Bragg  plane  and 
in  a  correspondingly  more  complex  way  when  it  passes  near  several  Bragg  planes. 
When  the  effects  of  all  Bragg  planes  are  inserted,  this  leads  to  a  representation  of  the 
Fermi  surface  as  a  fractured  sphere  in  the  extended-zone  scheme.  To  construct  the 
portions  of  the  Fermi  surface  lying  in  the  various  bands  in  the  repeated-zone  scheme 
one  can  make  a  similar  construction,  starting  with  free  electron  spheres  centered 
about  all  reciprocal  lattice  points.  To  construct  the  Fermi  surface  in  the  reduced-zone 
scheme,  one  can  translate  all  the  pieces  of  the  single  fractured  sphere  back  into  the 
first  zone  through  reciprocal  lattice  vectors.  This  procedure  is  made  systematic 
through  the  geometrical  notion  of  the  higher  Brillouin  zones. 

Recall  that  the  first  Brillouin  zone  is  the  Wigner-Seitz  primitive  cell  of  the  reciprocal 
lattice  (pages  73  and  89),  i.e.  the  set  of  points  lying  closer  to  K  =  0  than  to  any  other 


(b) 


Figure  9.6 

(a)  Free  electron  sphere  cut¬ 
ting  Bragg  plane  located  at 
1 K  from  the  origin  (l/K  =  0). 

(b)  Deformation  of  the  free 
electron  sphere  near  the 
Bragg  plane  when  UK  /  0. 
The  constant-energy  surface 
intersects  the  plane  in  two 
circles,  whose  radii  are  cal¬ 
culated  in  Problem  1. 


11  This  follows  from  the  demonstration  on  page  159  that  a  constant-energy  surface  is  perpendicular 
to  a  Bragg  plane  when  they  intersect,  in  the  nearly  free  electron  approximation. 
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reciprocal  lattice  point.  Since  Bragg  planes  bisect  the  lines  joining  the  origin  to  points 
of  the  reciprocal  lattice,  one  can  equally  well  define  the  first  zone  as  the  set  of  points 
that  can  be  reached  from  the  origin  without  crossing  any  Bragg  planes.12 

Higher  Brillouin  zones  are  simply  other  regions  bounded  by  the  Bragg  planes, 
defined  as  follows: 


The  first  Brillouin  zone  is  the  set  of  points  in  fc-space  that  can  be  reached  from 
the  origin  without  crossing  any  Bragg  plane.  The  second  Brillouin  zone  is  the  set  of 
points  that  can  be  reached  from  the  first  zone  by  crossing  only  one  Bragg  plane.  The 
(ji  +  l)th  Brillouin  zone  is  the  set  of  points  not  in  the  (n  —  l)th  zone  that  can  be 
reached  from  the  nth  zone  by  crossing  only  one  Bragg  plane. 

Alternatively,  the  nth  Brillouin  zone  can  be  defined  as  the  set  of  points  that  can  be 
reached  from  the  origin  by  crossing  n  —  1  Bragg  planes,  but  no  fewer. 

These  definitions  are  illustrated  in  two  dimensions  in  Figure  9.7.  The  surface  of 
the  first  three  zones  for  the  fee  and  bcc  lattices  are  shown  in  Figure  9.8.  Both  definitions 
emphasize  the  physically  important  fact  that  the  zones  are  bounded  by  Bragg  planes. 
Thus  they  are  regions  at  whose  surfaces  the  effects  of  a  weak  periodic  potential  are 
important  (i.e.,  first  order),  but  in  whose  interior  the  free  electron  energy  levels  are 
only  perturbed  in  second  order. 


Figure  9.7 

Illustration  of  the  definition  of  the  Brillouin 
zones  for  a  two-dimensional  square  Bravais 
lattice.  The  reciprocal  lattice  is  also  a  square 
lattice  of  side  b.  The  figure  shows  all  Bragg 
planes  (lines,  in  two  dimensions)  that  lie  within 
the  square  of  side  2b  centered  on  the  origin. 
These  Bragg  planes  divide  that  square  into 
regions  belonging  to  zones  1  to  6.  (Only  zones 
1,  2,  and  3  are  entirely  contained  within  the 
square,  however.) 


It  is  very  important  to  note  that  each  Brillouin  zone  is  a  primitive  cell  of  the 
reciprocal  lattice.  This  is  because  the  nth  Brillouin  zone  is  simply  the  set  of  points 
that  have  the  origin  as  the  nth  nearest  reciprocal  lattice  point  (a  reciprocal  lattice 
point  K  is  nearer  to  a  point  k  than  k  is  to  the  origin  if  and  only  if  k  is  separated  from 
the  origin  by  the  Bragg  plane  determined  by  K).  Given  this,  the  proof  that  the  nth 
Brillouin  zone  is  a  primitive  cell  is  identical  to  the  proof  on  page  73  that  the  Wigner- 
Seitz  cell  (i.e.,  the  first  Brillouin  zone)  is  primitive,  provided  that  the  phrase  “nth 
nearest  neighbor”  is  substituted  for  “nearest  neighbor”  throughout  the  argument. 


12  We  exclude  from  consideration  points  lying  on  Bragg  planes,  which  turn  out  to  be  points  common 
to  the  surface  of  two  or  more  zones.  We  define  the  zones  in  terms  of  their  interior  points. 
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Figure  9.8 

Surfaces  of  the  first,  second, 
and  third  Brillouin  zones  for 

(a)  body-centered  cubic  and 

(b)  face-centered  cubic  crys¬ 
tals.  (Only  the  exterior  sur¬ 
faces  are  shown.  It  follows 
from  the  definition  on  page 
163  that  the  interior  surface 
of  the  nth  zone  is  identical 
to  the  exterior  surface  of  the 
(n  —  l)th  zone.)  Evidently 
the  surfaces  bounding  the 
zones  become  increasingly 
complex  as  the  zone  number 
increases.  In  practice  it  is 
often  simplest  to  construct 
free  electron  Fermi  surfaces 
by  procedures  (such  as  those 
described  in  Problem  4)  that 
avoid  making  use  of  the  ex¬ 
plicit  form  of  the  Brillouin 
zones.  (After  R.  Luck,  doc¬ 
toral  dissertation,  Techni- 
sche  Hochschule.  Stuttgart. 
1965.) 


(a)  (b) 

Because  each  zone  is  a  primitive  cell,  there  is  a  simple  algorithm  for  constructing 

the  branches  of  the  Fermi  surface  in  the  repeated-zone  scheme13: 

1.  Draw  the  free  electron  Fermi  sphere. 

2.  Deform  it  slightly  (as  illustrated  in  Figure  9.6)  in  the  immediate  vicinity  of  every 
Bragg  plane.  (In  the  limit  of  exceedingly  weak  potentials  this  step  is  sometimes 
ignored  to  a  first  approximation.) 

3.  Take  that  portion  of  the  surface  of  the  free  electron  sphere  lying  within  the  nth 
Brillouin  zone,  and  translate  it  through  all  reciprocal  lattice  vectors.  The  resulting 
surface  is  the  branch  of  the  Fermi  surface  (conventionally  assigned  to  the  nth 
band)  in  the  repeated-zone  scheme.14 


13  The  representation  of  the  Fermi  surface  in  the  repeated-zone  scheme  is  the  most  general.  After 
surveying  each  branch  in  its  full  periodic  splendor,  one  can  pick  that  primitive  cell  which  most  lucidly 
represents  the  topological  structure  of  the  whole  (which  is  often,  but  by  no  means  always,  the  first  Brillouin 
zone). 

14  An  alternative  procedure  is  to  translate  the  pieces  of  the  Fermi  surface  in  the  nth  zone  through 
those  reciprocal  lattice  vectors  that  take  the  pieces  of  the  nth  zone  in  which  they  are  contained,  into  the 
first  zone.  (Such  translations  exist  because  the  nth  zone  is  a  primitive  cell.)  This  is  illustrated  in  Figure  9.9. 
The  Fermi  surface  in  the  repeated-zone  scheme  is  then  constructed  by  translating  t3-  -esulting  first  zone 
structures  through  all  reciprocal  lattice  vectors. 
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Figure  9.9 

The  free  electron  Fermi  sphere  for  a  face-centered  cubic  metal  of  valence  4  The  first  zone 
lies  entirely  within  the  interior  of  the  sphere,  and  the  sphere  does  not  extend  beyond  the 
fourth  zone.  Thus  the  only  zone  surfaces  intersected  by  the  surface  of  the  sphere  are  the 
(exterior)  surfaces  of  the  second  and  third  zones  (cf.  Figure  9.8b).  The  second-zone  Fermi 
surface  consists  of  those  parts  of  the  surface  of  the  sphere  lying  entirely  within  the  poly¬ 
hedron  bounding  the  second  zone  (i.e.,  all  of  the  sphere  except  the  parts  extending  beyond 
the  polyhedron  in  (a)).  When  translated  through  reciprocal  lattice  vectors  into  the  first 
zone,  the  pieces  of  the  second-zone  surface  give  the  simply  connected  figure  shown  in  (c). 
(It  is  known  as  a  “hole  surface";  the  levels  it  encloses  have  higher  energies  than  those 
outside).  The  third-zone  Fermi  surface  consists  of  those  parts  of  the  surface  of  the  sphere 
lying  outside  of  the  second  zone  (i.e.,  the  parts  extending  beyond  the  polyhedron  in  (a)) 
that  do  not  lie  outside  the  third  zone  (i.e.,  that  are  contained  within  the  polyhedron  shown 
in  (b)).  When  translated  through  reciprocal  lattice  vectors  into  the  first  zone,  these  pieces 
of  sphere  give  the  multiply  connected  structure  shown  in  (d).  The  fourth-zone  Fermi  sur¬ 
face  consists  of  the  remaining  parts  of  the  surface  of  the  sphere  that  lie  outside  the  third 
zone  (as  shown  in  (b) ).  When  translated  through  reciprocal  lattice  vectors  into  the  first 
zone  they  form  the  “pockets  of  electrons”  shown  in  (e).  For  clarity  (d)  and  (e)  show  only 
the  intersection  of  the  third  and  fourth  zone  Fermi  surfaces  with  the  surface  of  the  first 
zone.  (From  R.  Luck,  op.  cit.) 
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Generally  speaking,  the  effect  of  the  weak  periodic  potential  on  the  surfaces 
constructed  from  the  free  electron  Fermi  sphere  without  step  2,  is  simply  to  round 
off  the  sharp  edges  and  corners.  If,  however,  a  branch  of  the  Fermi  surface  consists 
of  very  small  pieces  of  surface  (surrounding  either  occupied  or  unoccupied  levels, 
known  as  “pockets  of  electrons”  or  “pockets  of  holes”),  then  a  weak  periodic  potential 
may  cause  these  to  disappear.  In  addition,  if  the  free  electron  Fermi  surface  has 
parts  with  a  very  narrow  cross  section,  a  weak  periodic  potential  may  cause  it  to 
become  disconnected  at  such  points. 

Some  further  constructions  appropriate  to  the  discussion  of  almost  free  electrons 
in  fee  crystals  are  illustrated  in  Figure  9.10.  These  free-electron-like  Fermi  surfaces 
are  of  great  importance  in  understanding  the  real  Fermi  surfaces  of  many  metals. 
This  will  be  illustrated  in  Chapter  15. 


The  free  electron  Fermi  surfaces  for  face-centered  cubic  metals  of  valence  2  and  3.  (For 
valence  1  the  surface  lies  entirely  within  the  interior  of  the  first  zone  and  therefore  remains 
a  sphere  to  lowest  order;  the  surface  for  valence  4  is  shown  in  Figure  9.9.)  All  branches 
of  the  Fermi  surface  are  shown.  The  primitive  cells  in  which  they  are  displayed  have  the 
shape  and  orientation  of  the  first  Brillouin  zone.  However,  the  cell  is  actually  the  first  zone 
(i.e.,  is  centered  on  K  =  0)  only  in  the  figures  illustrating  the  second  zone  surfaces.  In  the 
first  and  third  zone  figures  K  =  0  lies  at  the  center  of  one  of  the  horizontal  faces,  while 
for  the  fourth  zone  figure  it  lies  at  the  center  of  the  hexagonal  face  on  the  upper  right 
(or  the  parallel  face  opposite  it  (hidden)).  The  six  tiny  pockets  of  electrons  constituting 
the  fourth  zone  surface  for  valence  3  lie  at  the  comers  of  the  regular  hexagon  given  by 
displacing  that  hexagonal  face  in  the  [1 1 1]  direction  by  half  the  distance  to  the  face  op¬ 
posite  it.  (After  W.  Harrison,  Phys.  Rev.  118,  1 190  (I960).)  Corresponding  constructions 
for  body-centered  cubic  metals  can  be  found  in  the  Harrison  paper. 


THE  GEOMETRICAL  STRUCTURE  FACTOR  IN  MONATOMIC 
LATTICES  WITH  BASES 

Nothing  said  up  to  now  has  exploited  any  properties  of  the  potential  l/(r)  other 
than  its  periodicity,  and,  for  convenience,  inversion  symmetry.  If  we  pay  somewhat 
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closer  attention  to  the  form  of  U,  recognizing  that  it  will  be  made  up  of  a  sum  of  atomic 
potentials  centered  at  the  positions  of  the  ions,  then  we  can  draw  some  further  con¬ 
clusions  that  are  important  in  studying  the  electronic  structure  of  monatomic  lattices 
with  a  basis,  such  as  the  diamond  and  hexagonal  close-packed  (hep)  structures. 

Suppose  that  the  basis  consists  of  identical  ions  at  positions  dj.  Then  the  periodic 
potential  U( r)  will  have  the  form 


U(  r)  =  H«r-R-dJ-). 

K  j 

If  we  place  this  into  Eq.  (8.32)  for  UK ,  we  find  that 


UK  =- 

v 

_  1 

V 


or 


dr  e~‘K  r  £  (j>(r  —  R  —  d,) 

*11  R.j 

dr  e-*'1  Y,  $(r  —  dj), 

all  j 

space 

UK  =-<£<K)V, 


where  <£(K)  is  the  Fourier  transform  of  the  atomic  potential. 


m 


=1 


all 

space 


dr  e  |K  ,^>( r), 


(9.31) 


(9.32) 

(9.33) 


(9-34) 


and  SK  is  the  geometrical  structure  factor  introduced  in  our  discussion  of  X-ray 
diffraction  (Chapter  6): 


SK  =  £>*->,.  (9.35) 

j 

Thus  when  the  basis  leads  to  a  vanishing  structure  factor  for  some  Bragg  planes, 
i.e.,  when  the  X-ray  diffraction  peaks  from  these  planes  are  missing,  then  the  Fourier 
component  of  the  periodic  potential  associated  with  such  planes  vanishes;  i.e.,  the 
lowest-order  splitting  in  the  free  electron  levels  disappears. 

This  result  is  of  particular  importance  in  the  theory  of  metals  with  the  hexagonal 
close-packed  structure,  of  which  there  are  over  25  (Table  4.4).  The  first  Brillouin 
zone  for  the  simple  hexagonal  lattice  is  a  prism  on  a  regular  hexagon  base.  However, 
the  structure  factor  associated  with  the  hexagonal  top  and  bottom  of  the  prism 
vanishes  (Problem  3,  Chapter  6). 

Therefore,  according  to  nearly  free  electron  theory,  there  is  no  first-order  splitting 
of  the  free  electron  levels  at  these  faces.  It  might  appear  that  small  splittings  would 
still  occur  as  a  result  of  second-order  (and  higher-order)  effects.  However,  if  the 
one-electron  Hamiltonian  is  independent  of  the  spin,  then  in  the  hep  structure  any 
Bloch  level  with  wave  vector  k  on  the  hexagonal  face  of  the  first  Brillouin  zone 
can  be  proved  to  be  at  least  twofold  degenerate.  Accordingly,  the  splitting  is  rigorously 
zero.  In  a  situation  like  this  it  is  often  more  convenient  to  consider  a  representation 
of  the  zone  structure  in  which  those  planes  with  zero  gap  are  actually  ignored.  The 
regions  that  one  is  then  led  to  consider  are  known  as  Jones  zones  or  large  zones. 
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(c)  Third  zone 

Figure  9.11 


Free  electron  Fermi  surface  for  a  divalent  hep  metal  with  ideal 
c/a  =  1.633.  Since  the  hep  structure  is  simple  hexagonal  with  two 
atoms  per  primitive  cell,  there  are  four  electrons  per  primitive  cell 
to  be  accommodated.  The  resulting  Fermi  surface  comes  in  many 
pieces,  whose  names  reveal  an  interesting  level  of  imagination  and 
taste,  (a)  The  cap.  The  first  zone  is  almost  entirely  filled  by  the  free 
electron  sphere,  but  there  are  small  unoccupied  regions  in  the  six 
upper  and  six  lower  comers.  These  can  be  assembled,  by  transla¬ 
tions  through  reciprocal  lattice  vectors,  into  two  of  the  objects 
shown,  (b)  The  monster.  Portions  of  the  free  electron  sphere  in  the 
second  zone  can  be  translated  back  into  the  first  zone  to  form  one 
of  the  large  structures  shown  in  the  second-zone  picture.  The  mon¬ 
ster  encloses  unoccupied  levels,  (c)  Portions  of  the  free  electron 
sphere  in  the  third  zone  can  be  reassembled  into  several  electron¬ 
enclosing  surfaces.  There  is  one  lens,  two  cigars,  and  three  butter¬ 
flies.  (d)  Those  few  occupied  free  electron  levels  in  the  fourth  zone 
can  be  reassembled  into  three  pockets  of  the  type  pictured. 

These  structures  arise  when  there  is  significant  splitting  of  the 
free  electron  levels  on  the  hexagonal  faces  of  the  first  zone  as  a 
result  of  spin-orbit  coupling.  When  spin-orbit  coupling  is  weak 
(as  it  is  in  the  lighter  elements),  there  is  negligible  splitting  on  these 
faces,  and  the  appropriate  structures  are  those  shown  in  Figure  9. 1 2. 
(From  J.  B.  Ketterson  and  R.  W.  Stark,  Phys.  Rev.  156,  751  (1967).) 
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IMPORTANCE  OF  SPIN-ORBIT  COUPLING  AT  POINTS  OF  HIGH 
SYMMETRY 

Until  now  we  have  regarded  the  electron  spin  as  being  completely  inert  dynamically. 
In  fact,  however,  an  electron  moving  through  an  electric  field,  such  as  that  of  the 
periodic  potential  (7(r),  experiences  a  potential  proportional  to  the  scalar  product  of 
its  spin  magnetic  moment  with  the  vector  product  of  its  velocity  and  the  electric  field. 
This  additional  interaction  is  referred  to  as  the  spin-orbit  coupling,  and  is  of  great 
importance  in  atomic  physics  (see  Chapter  31).  Spin-orbit  coupling  is  important  in 
calculating  the  almost  free  electron  levels  at  points  in  /(-space  of  high  symmetry, 
since  it  often  happens  that  levels  that  are  rigorously  degenerate  when  it  is  ignored 
are  split  by  the  spin-orbit  coupling. 

For  example,  the  splitting  of  the  electronic  levels  on  the  hexagonal  faces  of  the 
first  zone  in  hep  metals  is  entirely  due  to  spin-orbit  coupling.  Since  the  strength  of 
spin-orbit  coupling  increases  with  atomic  number,  this  splitting  is  appreciable  in  the 
heavy  hexagonal  metals,  but  can  be  small  enough  to  be  ignored  in  the  light  ones. 
Correspondingly,  there  are  two  different  schemes  for  constructing  free  electronlike 
Fermi  surfaces  in  hexagonal  metals.  These  are  illustrated  in  Figures  9.11  and  9.12. 


A  representation  of  the  Fermi  surface  of  a  divalent  hep  metal 
obtained  by  reassembling  those  pieces  in  Figure  9.11  that  were 
severed  from  each  other  by  the  horizontal  hexagonal  faces  of  the 
first  Brillouin  zone.  The  first  and  second  zones  together  make  up 
the  structure  on  the  left,  and  the  many  pieces  in  the  third  and 
fourth  zones  lead  to  the  structure  on  the  right.  This  representa¬ 
tion  ignores  the  spin  orbit  splitting  across  the  hexagonal  face. 
(After  W.  Harrison,  Phys.  Rev.  118,  1190  (I960).) 


PROBLEMS 


1.  Nearly  Free  Electron  Fermi  Surface  Near  a  Single  Bragg  Plane 
To  investigate  the  nearly  free  electron  band  structure  given  by  (9.26)  near  a  Bragg  plane,  it  is 
convenient  to  measure  the  wave  vector  q  with  respect  10  the  point  on  the  Bragg  plane.  If 

we  write  q  =  +  k,  and  resolve  k  into  its  components  parallel  (A||)  and  perpendicular  (k±) 
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to  K,  then  (9.26)  becomes 

h2  l  h2  V12 

£  =  fiS/,  +  ±  (<>2ny  +  W)  (9‘36) 

It  is  also  convenient  to  measure  the  Fermi  energy  £F  with  respect  to  the  lowest  value  assumed 
by  either  of  the  bands  given  by  (9.36)  in  the  Bragg  plane,  writing; 

£F  =  -  \UK\  +  A,  (9.37) 

so  that  when  A  <  0,  no  Fermi  surface  intersects  the  Bragg  plane. 

(a)  Show  that  when  0  <  A  <  2|I/K|,  the  Fermi  surface  lies  entirely  in  the  lower  band  and 
intersects  the  Bragg  plane  in  a  circle  of  radius 


P  = 


(9.38) 


(b)  Show  that  if  A  >  |2l/K|,  the  Fermi  surface  lies  in  both  bands,  cutting  the  Bragg  plane 
in  two  circles  of  radii  and  p2  (Figure  9.6),  and  that  the  difference  in  the  areas  of  the  two 
circles  is 


n(p2 


2\  _  4nm\i,  I 

Pi  )  ~  ft2  I^KI- 


(9.39) 


(The  area  of  these  circles  can  be  measured  directly  in  some  metals  through  the  de  Haas-van 
Alphen  effect  (Chapter  14),  and  therefore  |b'K|  can  be  determined  directly  from  experiment  for 
such  nearly  free  electron  metals.) 


2.  Density  of  Levels  for  a  Two-Band  Model 

To  some  extent  this  problem  is  artificial  in  that  the  effects  of  neglected  Bragg  planes  can  lead 
to  corrections  comparable  to  the  deviations  we  shall  find  here  from  the  free  electron  result.  On 
the  other  hand,  the  problem  is  instructive  in  that  the  qualitative  features  are  general. 

If  we  resolve  q  into  its  components  parallel  ( q B)  and  perpendicular  (q±)  to  K,  then  (9.26) 
becomes 

£  =  —<?!  +  (9-4°) 

where 

+i(K2-2^||X)] 

+  -  V)J  +  |<  {9'41) 

is  only  a  function  of  q:l.  The  density  of  levels  can  be  evaluated  from  (8.57)  by  performing  the 
integral  in  an  appropriate  primitive  cell  over  wave  vectors  q  in  cylindrical  coordinates  with  the 
z-axis  along  K. 

(a)  Show  that  when  the  integral  over  q  is  performed,  the  result  for  each  band  is 

p(£)  =  i(f^)(9r-9n  (9-42> 

w  here,  for  each  band,  and  q']’in  are  the  solutions  to  £  =  h+(qn).  Verify  that  the  familiar  free 

electron  result  is  obtained  in  the  limit  |(7K|  -»  0. 
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Figure  9.13 

Density  of  levels  in  the  two-band  approxima¬ 
tion.  The  dashed  line  is  the  free  electron  result 
Eq.  (2.63).  Note  that  in  contrast  to  earlier 
figures  in  this  chapter,  this  one  explicitly  shows 
second-order  corrections  to  the  free  electron 
result  far  from  the  Bragg  plane. 


q] T  =  -  +  0(UK2),  (£  >  0),  gf*  =  iK 


(9.43) 


for  the  lower  band,  if  the  constant  energy  surface  (at  energy  E)  cuts  the  zone  plane  (that  is, 
eK/2  —  [ E'kI  ^  ^  £k/2  +  |£/k|)- 

(c)  Show  that  for  the  upper  band  (9.42)  should  be  interpreted  as  giving  a  density  of  levels 


0+(£)  =  i  (fr)  faT  -  for  6  >  6K ,  +  \UK\.  (9.44) 

(d)  Show  that  dg/dl  is  singular  at  £  =  gK,j  ±  \UK\,  so  that  the  density  of  levels  has  the  form 
shown  in  Figure  9.13.  (These  singularities  are  not  peculiar  cither  to  the  weak  potential  or  two- 
band  approximations.  See  page  145.) 


3.  Effect  of  Weak  Periodic  Potential  at  Places  in  k-Space  Where  Bragg  Planes  Meet 
Consider  the  point  W  (kw  =  (27r/a)(l,  j,  0))  in  the  Brillouin  zone  of  the  fee  structure  shown  (see 
Figure  9.14).  Here  three  Bragg  planes  ((200),  (111),  (111))  meet,  and  accordingly  the  free  electron 
energies 


are  degenerate  when  k 


£°  -  —  k2 
£l  ~2 mk  ’ 


T'l’i’i))2’ 


$ 


£(k-r-‘-4 
6s-£(k-?,2-o-o)J 

kw,  and  equal  to  £w  =  h2kw2/2m. 


(9.45) 


Figure  9.14 

First  Brillouin  zone  for  a  face-centered  cubic  crystal 
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(a)  Show  that  in  a  region  of  k- space  near  W,  the  first-order  energies  are  given  by  solutions  to15 

e?  -  e  c/,  ut  u2 

u,  £§-£  u2  ui  = 

v  1  u2  E°  -  £  Ul 

u2  u1  u,  £2  -  e 

where  V2  =  U200,  U,  =  UItl  =  Ult j,  and  that  at  W  the  roots  are 

8  =  6w  —  U2  (twice),  £  =  £w  +  U2  ±  2Ut.  (9.46) 

(b)  Using  a  similar  method,  show  that  the  energies  at  the  point  U  (ku  =  (2tt/a)(l,  5,  5))  are 

£  =  £u  —  U2,  £  =  £,j  +  \U2  ±  i(U22  +  8(7j2),/2,  (9.47) 

where  =  h2kv2/2m. 

4.  Alternative  Definition  of  Brillouin  Zones 

Let  k  be  a  point  in  reciprocal  space.  Suppose  spheres  of  radius  k  are  drawn  about  every  reciprocal 
lattice  point  K  except  the  origin.  Show  that  if  k  is  in  the  interior  of  n  —  I  spheres,  and  on  the 
surface  of  none,  then  it  lies  in  the  interior  of  the  nth  Brillouin  zone.  Show  that  if  k  is  in  the 
interior  of  n  —  1  spheres  and  on  the  surface  of  m  additional  spheres,  then  it  is  a  point  common 
to  the  boundaries  of  the  nth,  (n  +  l)th, . . . ,  (n  +  m)th  Brillouin  zones. 

5,  Brillouin  Zones  in  a  Two-Dimensional  Square  Lattice 
Consider  a  two-dimensional  square  lattice  with  lattice  constant  a. 

(a)  Write  down,  in  units  of  2it,a,  the  radius  of  a  circle  that  can  accommodate  m  free  elec¬ 
trons  per  primitive  cell.  Construct  a  table  listing  which  or  the  first  seven  zones  of  the  square 
lattice  (Figure  9.15a)  are  completely  full,  which  are  partially  empty,  and  which  are  completely 
empty  for  m  =  1,  2, ... ,  12.  Verify  that  if  m  $  12,  the  occupied  levels  lie  entirely  within  the 
first  seven  zones,  and  that  when  m  ^  13,  levels  in  the  eighth  and  higher  zones  become  occupied. 

(b)  Draw  pictures  in  suitable  primitive  cells  of  all  branches  of  the  Fermi  surface  for  the  cases 
m  —  1,  2, . . . ,  7.  The  third  zone  surface  for  m  =  4,  for  example,  can  be  displayed  as  in  Figure 
9.15b. 


15 


Assume  that  the  periodic  potential  V  has  inversion  symmetry  so  that  the  l/K  are  real. 


Problems  173 


Figure  9.15 

(a)  The  first  seven  Brillouin 
zones  for  the  two-dimensional 
square  lattice.  Because  of  the 
lattice  symmetry  it  is  necessary 
to  display  only  one  eighth  of 
the  figure.  The  rest  can  be 
reconstructed  by  reflection  in 
the  dashed  lines  (which  are  not 
zone  boundaries),  (b)  Fermi 
surface  in  the  third  zone  for  a 
square  lattice  with  four  elec¬ 
trons  per  unit  cell.  (The  scale 
in  (b)  has  been  expanded  con¬ 
siderably.) 
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In  Chapter  9  we  calculated  electronic  levels  in  a  metal  by  viewing  it  as  a  gas  of  nearly 
free  conduction  electrons,  only  weakly  perturbed  by  the  periodic  potential  of  the 
ions.  We  can  also  take  a  very  different  point  of  view,  regarding  a  solid  (metal  or 
insulator)  as  a  collection  of  weakly  interacting  neutral  atoms.  As  an  extreme  example 
of  this,  imagine  assembling  a  group  of  sodium  atoms  into  a  body-centered  cubic 
array  with  a  lattice  constant  of  the  order  of  centimeters  rather  than  angstroms.  All 
electrons  would  then  be  in  atomic  levels  localized  at  lattice  sites,  bearing  no  resem¬ 
blance  to  the  linear  combinations  of  a  few  plane  waves  described  in  Chapter  9. 

If  we  were  to  shrink  the  artificially  large  lattice  constant  of  our  array  of  sodium 
atoms,  at  some  point  before  the  actual  lattice  constant  of  metallic  sodium  was  reached 
we  would  have  to  modify  our  identification  of  the  electronic  levels  of  the  array  with 
the  atomic  levels  of  isolated  sodium  atoms.  This  would  become  necessary  for  a 
particular  atomic  level,  when  the  interatomic  spacing  became  comparable  to  the 
spatial  extent  of  its  wave  function,  for  an  electron  in  that  level  would  then  feel  the 
presence  of  the  neighboring  atoms. 

The  actual  state  of  affairs  for  the  Is,  2s,  2 p  and  3s  levels  of  atomic  sodium  is  shown 
in  Figure  10.1.  The  atomic  wave  functions  for  these  levels  are  drawn  about  two  nuclei 
separated  by  3.7  A,  the  nearest-neighbor  distance  in  metallic  sodium.  The  overlap 
of  the  Is  wave  functions  centered  on  the  two  sites  is  utterly  negligible,  indicating 
that  these  atomic  levels  are  essentially  unaltered  in  metallic  sodium.  The  overlap  of 
the  2s-  and  2p-levels  is  exceedingly  small,  and  one  might  hope  to  find  levels  in  the 
metal  very  closely  related  to  these.  However,  the  overlap  of  the  3s-levels  (which  hold 
the  atomic  valence  electrons)  is  substantial,  and  there  is  no  reason  to  expect  the 
actual  electronic  levels  of  the  metal  to  resemble  these  atomic  levels. 

The  tight-binding  approximation  deals  with  the  case  in  which  the  overlap  of  atomic 
wave  functions  is  enough  to  require  corrections  to  the  picture  of  isolated  atoms,  but 
not  so  much  as  to  render  the  atomic  description  completely  irrelevant.  The  approxi¬ 
mation  is  most  useful  for  describing  the  energy  bands  that  arise  from  the  partially 
filled  d-shells  of  transition  metal  atoms  and  for  describing  the  electronic  structure 
of  insulators. 

Quite  apart  from  its  practical  utility,  the  tight-binding  approximation  provides  an 
instructive  way  of  viewing  Bloch  levels  complementary  to  that  of  the  nearly  free 
electron  picture,  permitting  a  reconciliation  between  the  apparently  contradictory 
features  of  localized  atomic  levels  on  the  one  hand,  and  free  electron-like  plane-wave 
levels  on  the  other. 


GENERAL  FORMULATION 

In  developing  the  tight-binding  approximation,  we  assume  that  in  the  vicinity  of 
each  lattice  point  the  full  periodic  crystal  Hamiltonian,  H,  can  be  approximated  by 
the  Hamiltonian,  Hat,  of  a  single  atom  located  at  the  lattice  point.  We  also  assume 
that  the  bound  levels  of  the  atomic  Hamiltonian  are  well  localized;  i.e.,  if  <pn  is  a 
bound  level  of  Hat  for  an  atom  at  the  origin, 

Wal^„  =  Enipn, 


(10.1) 
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Figure  10.1 

Calculated  electron  wave  functions  for  the  levels  of  atomic  sodium,  plotted  about  two  nuclei 
separated  by  the  nearest-neighbor  distance  in  metallic  sodium,  3.7  A.  The  solid  curves  are  np(r) 
for  the  lx,  2s,  and  3s  levels.  The  dashed  curve  is  r  times  the  radial  wave  function  for  the  2 p  levels. 
Note  how  the  3s  curves  overlap  extensively,  the  2s  and  2 p  curves  overlap  only  a  little,  and  the  Is 
curves  have  essentially  no  overlap.  The  curves  are  taken  from  calculations  by  D.  R.  Hartree  and 
W.  Hartree,  Proc.  Roy.  Soc.  A193,  299  (1948).  The  scale  on  the  r-axis  is  in  angstroms. 

then  we  require  that  i/r„(r)  be  very  small  when  r  exceeds  a  distance  of  the  order  of 
the  lattice  constant,  which  we  shall  refer  to  as  the  “range”  of  i //„. 

In  the  extreme  case  in  which  the  crystal  Hamiltonian  begins  to  differ  from  HM 
(for  an  atom  whose  lattice  point  we  take  as  the  origin)  only  at  distances  from  r  =  0 
that  exceed  the  range  of  the  wave  function  t//„(r)  will  be  an  excellent  approxi¬ 
mation  to  a  stationary-state  wave  function  for  the  full  Hamiltonian,  with  eigenvalue 
£„.  So  also  will  the  wave  functions  ip„(r  -  R)  for  all  R  in  the  Bravais  lattice,  since 
H  has  the  periodicity  of  the  lattice. 

To  calculate  corrections  to  this  extreme  case,  we  write  the  crystal  Hamiltonian 
H  as 


H  =  H»  +  Al/(r), 


(10.2) 
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where  AL'(r)  contains  all  corrections  to  the  atomic  potential  required  to  produce  the 
full  periodic  potential  of  the  crystal  (see  Figure  10.2).  If  ip„{r)  satisfies  the  atomic 
Schrodinger  equation  (10.1),  then  it  will  also  satisfy  the  crystal  Schrbdinger  equation 
(10.2),  provided  that  AL'(r)  vanishes  wherever  ipn(r)  does  not.  If  this  were  indeed  the 
case,  then  each  atomic  level  i/r„(r)  would  yield  N  levels  in  the  periodic  potential,  with 
wave  functions  tp„(r  —  R),  for  each  of  the  N  sites  R  in  the  lattice.  To  preserve  the 
Bloch  description  we  must  find  the  N  linear  combinations  of  these  degenerate  wave 
functions  that  satisfy  the  Bloch  condition  (see  Eq.  (8.6)): 

«A(r  +  R)  =  e*  VM.  (10.3) 


\ 


Figure  10.2 

The  lower  curve  depicts  the  function  Al/(r)  drawn  along  a  line  of  atomic  sites.  When  Ab’(r)  is 
added  to  a  single  atomic  potential  localized  at  the  origin,  the  full  periodic  potential  L'(r)  is  re¬ 
covered.  The  upper  curve  represents  r  times  an  atomic  wave  function  localized  at  the  origin. 
When  r<p( r)  is  large,  At/(r)  is  small,  and  vice  versa. 

The  N  linear  combinations  we  require  are 


lUr)  =  I^'R«Mr-R),  (10.4) 

R 

where  k  ranges  through  the  N  values  in  the  first  Brillouin  zone  consistent  with  the 
Born-von  Karman  periodic  boundary  condition.1  The  Bloch  condition  (10.3)  is 
verified  for  the  wave  functions  (10.4)  by  noting  that 


1  Except  when  explicitly  studying  surface  effects,  one  should  avoid  the  temptation  to  treat  a  finite 
crystal  by  restricting  the  summation  on  R  in  (10.4)  to  the  sites  of  a  finite  portion  of  the  Bravais  lattice. 
It  is  far  more  convenient  to  sum  over  an  infinite  Bravais  lattice  (the  sum  converging  rapidly  because  of 
the  short  range  of  the  atomic  wave  function  (}/„)  and  to  represent  the  finite  crystal  with  the  usual  Born-von 
Karman  boundary  condition,  which  places  the  standard  restriction  (8.27)  on  k,  when  the  Bloch  condition 
holds.  With  the  sum  taken  over  all  sites,  for  example,  it  is  permissible  to  make  the  crucial  replacement 
of  the  summation  variable  R'  by  R  =  R'  —  R,  in  the  second  to  last  line  of  Eq.  (10.5). 
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<p(r  +  R)  =  X  <?'*'R'tWr  +  R  -  R') 

R-RV„(r  -  (R'  -  R))J 

—  R  I  y  c’k '  Ri//„(r  —  R) 


R 

Jk  ■  R 


=  e,k'lty(r). 


(10.5) 


Thus  the  wave  functions  (10.4)  satisfy  the  Bloch  condition  with  wave  vector  k, 
while  continuing  to  display  the  atomic  character  of  the  levels.  The  energy  bands 
arrived  at  in  this  way,  however,  have  little  structure,  f.„(k)  being  simply  the  energy 
of  the  atomic  level,  £„,  regardless  of  the  value  of  k.  To  remedy  this  deficiency  we  must 
recognize  that  a  more  realistic  assumption  is  that  i/f„(r)  becomes  small,  but  not 
precisely  zero,  before  AL/(r)  becomes  appreciable  (see  Figure  10.2).  This  suggests 
that  we  seek  a  solution  to  the  full  crystal  Schrodinger  equation  that  retains  the 
general  form  of(10.4):2 


m  =  x  e'k  *Vj(r  -  R)> 


(10.6) 


but  with  the  function  (j>( r)  not  necessarily  an  exact  atomic  stationary-state  wave 
function,  but  one  to  be  determined  by  further  calculation.  If  the  product  AL'(r)i/y„(r), 
though  nonzero,  is  exceedingly  small,  we  might  expect  the  function  <£(r)  to  be  quite 
close  to  the  atomic  wave  function  ^„(r)  or  to  wave  functions  with  which  ipn(r)  is 
degenerate.  Based  on  this  expectation,  one  seeks  a  r)  that  can  be  expanded  in  a 
relatively  small  number  of  localized  atomic  wave  functions:3-4 

«  =  (10-7) 

n 

If  we  multiply  the  crystal  Schrodinger  equation 

Hip(r)  =  (Hal  +  AU{r))<p(r)  =  £(k)^(r)  (10.8) 

by  the  atomic  wave  function  <j/m*(r).  integrate  over  all  r,  and  use  the  fact  that 


1 


lL*Or)tf.ilMr)  dr  =  (HMipm(r) ) 


we  find  that 


=  j(HJ/Jr)f^r)dr  =  Em  | 

(£(k)  -  EJ  | ipm*(r)ij/(r)  <lr  =  J <pm*(r)  AL'(r)i//(r)  dr. 


MMr)  dr,  (10.9) 


(10.10) 


2  It  turns  out  (see  p.  187)  that  any  Bloch  function  can  be  written  in  the  form  (10.6),  the  function  4> 
being  known  as  a  Wannier  Junction ,  so  no  generality  is  lost  in  this  assumption. 

3  By  including  only  localized  (i.e.,  bound)  atomic  v/ave  functions  in  (10.7)  we  make  our  first  serious 
approximation.  A  complete  set  of  atomic  levels  includes  the  ionized  ones  as  well.  This  is  the  point  at  which 
the  method  ceases  to  be  applicable  to  levels  well  described  by  the  almost  free  electron  approximation. 

4  Because  of  this  method  of  approximating  <£,  the  tight-binding  method  is  someiimes  known  as  the 
method  of  lhe  linear  combination  of  atomic  orbitals  (or  LCAO). 
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Placing  (10.6)  and  (10.7)  into  (10.10)  and  using  the  orthonormality  of  the  atomic 
wave  functions, 

jVm  *(«#>)*  =  (10.11) 

we  arrive  at  an  eigenvalue  equation  that  determines  the  coefficients  fc„(k)  and  the 
Bloch  energies  £(k): 

(£(k)  -  Em)bm  =  — (£(k)  -  £Jl(l  f  -  R)^  *  *)  K 

n  \R*0  J  / 

+  AL!(r)Wr)  A  j  K 

+  lfl  f^m*(r)At/(r)Wr  -  R)c'1“Bdr]  b„.  (10.12) 

n  \R5iO  J  / 

The  first  term  on  the  right  of  Eq.  (10.12)  contains  integrals  of  the  form5 

L^*(r)«r-  R)-  (10 *-13> 

We  interpret  our  assumption  of  well-localized  atomic  levels  to  mean  that  (10.13) 
is  small  compared  to  unity.  We  assume  that  the  integrals  in  the  third  term  on  the  right 
of  Eq.  (10.12)  are  small,  since  they  also  contain  the  product  of  two  atomic  wave  func¬ 
tions  centered  at  different  sites.  Finally,  we  assume  that  the  second  term  on  the  right 
of  (10.12)  is  small  because  we  expect  the  atomic  wave  functions  to  become  small  at 
distances  large  enough  for  the  periodic  potential  to  deviate  appreciably  from  the 
atomic  one.6 7 * * 

Consequently,  the  right-hand  side  of(10.13)  (and  therefore  (£(k)  —  Em)bm)  is  always 
small.  This  is  possible  if  E(k)  -  Em  is  small  whenever  b,„  is  not  (and  vice  versa).  Thus 
£(k)  must  be  close  to  an  atomic  level,  say  E0,  and  all  the  bm  except  those  going  with 
that  level  and  levels  degenerate  with  (or  close  to)  it  in  energy  must  be  small: 

£(k)  «  E0,  bm  %  0  unless  Em  «  E0.  (10.14) 

If  the  estimates  in  (10.14)  were  strict  equalities,  we  would  be  back  to  the  extreme 
case  in  which  the  crystal  levels  were  identical  to  the  atomic  ones.  Now,  however,  we 


5  Integrals  whose  integrands  contain  a  product  of  wave  functions  centered  on  different  lattice  sites 
are  known  as  overlap  integrals.  The  tight-binding  approximation  exploits  the  smallness  of  such  overlap 
integrals.  They  also  play  an  important  role  in  the  theory  of  magnetism  (Chapter  32). 

«  This  last  assumption  is  on  somewhat  shakier  ground  than  the  others,  since  the  ionic  potentials 
need  not  fall  off  as  rapidly  as  the  atomic  wave  functions.  However,  it  is  also  less  critical  in  determining  the 
conclusions  we  shall  reach,  since  the  term  in  question  does  not  depend  on  k.  In  a  sense  this  term  simply 
plays  the  role  of  correcting  the  atomic  potentials  within  each  cell  to  include  the  fields  of  the  10ns  outside 
the  cell;  it  could  be  made  as  small  as  the  other  two  terms  by  a  judicious  redefinition  of  the  “atomic" 
Hamiltonian  and  levels. 

7  Note  the  similarity  of  this  reasoning  to  that  employed  on  pages  152  to  156.  There,  however,  we 

concluded  that  the  wave  function  was  a  linear  combination  of  only  a  small  number  of  plane  waves, 

whose  free  electron  energies  were  very  dose  together.  Here,  we  conclude  that  the  wave  function  can  be 

represented,  through  (10.7)  and  (10.6),  by  only  a  small  number  of  atomic  wave  functions,  whose  atomic 

energies  are  very  close  together. 
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can  determine  the  levels  in  the  crystal  more  accurately,  exploiting  (10.14)  to  estimate 
the  right-hand  side  of  (10.12)  by  letting  the  sum  over  n  run  only  through  those  levels 
with  energies  either  degenerate  with  or  very  close  to  Ea.  If  the  atomic  level  0  is  non¬ 
degenerate,8  i.e.,  an  s-level,  then  in  this  approximation  (10.12)  reduces  to  a  single 
equation  giving  an  explicit  expression  for  the  energy  of  the  band  arising  from  this 
s-level  (generally  referred  to  as  an  “s-band”).  If  we  are  interested  in  bands  arising  from 
an  atomic  p-level,  which  is  triply  degenerate,  then  (10.12)  would  give  a  set  of  three 
homogeneous  equations,  whose  eigenvalues  would  give  the  £(k)  for  the  three  p-bands, 
and  whose  solutions  fi(k)  would  give  the  appropriate  linear  combinations  of  atomic 
p-levels  making  up  <fi  at  the  various  k’s  in  the  Brillouin  zone.  To  get  a  d-band  from 
atomic  d-levels,  we  should  have  to  solve  a  5  x  5  secular  problem,  etc. 

Should  the  resulting  £(k)  stray  sufficiently  far  from  the  atomic  values  at  certain 
k,  it  would  be  necessary  to  repeat  the  procedure,  adding  to  the  expansion  (10.7)  of  <p 
those  additional  atomic  levels  whose  energies  the  £(k)  are  approaching.  In  practice, 
for  example,  one  generally  solves  a  6  x  6  secular  problem  that  includes  both  d-  and 
s-levels  in  computing  the  band  structure  of  the  transition  metals,  which  have  in  the 
atomic  state  an  outer  s-shell  and  a  partially  filled  d-shell.  This  procedure  goes  under 
the  name  of  “s-cl  mixing”  or  “hybridization.” 

Often  the  atomic  wave  functions  have  so  short  a  range  that  only  nearest-neighbor 
terms  in  the  sums  over  R  in  (10.12)  need  be  retained,  which  very  much  simplifies 
subsequent  analysis.  We  briefly  illustrate  the  band  structure  that  emerges  in  the 
simplest  case.9 


APPLICATION  TO  AN  s-BAND  ARISING  FROM  A  SINGLE  ATOMIC 
s-LEVEL 


If  all  the  coefficients  b  in  (10.12)  are  zero  except  that  for  a  single  atomic  s-level,  then 
(10.12)  gives  directly  the  band  structure  of  the  corresponding  s-band: 


£(k)  =  Es 


P  +  Iy(R)e"‘B 
1  +  Ia(R)eilR’ 


(10.15) 


where  Es  is  the  energy  of  the  atomic  s-level,  and 


p  = 

- 

dr  At/(r)|<£(r)|2, 

(10.16) 

®(R)  = 

J dr  <p*(r)(P(r  -  R), 

(10.17) 

r(R)  = 

- 

r 

dr  (j)*{ r)  AU(r)(P(r  -  R). 

(10.18) 

8  For  the  moment  we  ignore  spin-orbit  coupling.  We  can  therefore  concentrate  entirely  on  the  orbital 
parts  of  the  levels.  Spin  can  then  be  included  by  simply  multiplying  the  orbital  wave  functions  by  the 
appropriate  spinors,  and  doubling  the  degeneracy  of  each  of  the  orbital  levels. 

9  The  simplest  case  is  that  of  an  5-band.  The  next  most  complicated  case,  a  p-band,  is  discussed  in 
Problem  2. 
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The  coefficients  (10.16)  to  (10.18)  may  be  simplified  by  appealing  to  certain  sym¬ 
metries.  Since  4>  is  an  s-level,  (p{r)  is  real  and  depends  only  on  the  magnitude 
r.  From  this  it  follows  that  a(-R)  =  oc(R).  This  and  the  inversion  symmetry  of 
the  Bravais  lattice,  which  requires  that  AL/(  — r)  =  Al/(r),  also  imply  that  >■(  —  R)  = 
y(R).  We  ignore  the  terms  in  a  in  the  denominator  of  (10.15),  since  they  give  small 
corrections  to  the  numerator.  A  final  simplification  comes  from  assuming  that  only 
nearest-neighbor  separations  give  appreciable  overlap  integrals. 

Putting  these  observations  together,  we  may  simplify  (10.15)  to 

£(k)  =  £,-/?-!  v(R)  cos  k  •  R,  (10.19) 

n.n. 

where  the  sum  runs  only  over  those  R  in  the  Bravais  lattice  that  connect  the  origin  to 
its  nearest  neighbors. 

To  be  explicit,  let  us  apply  (10.19)  to  a  face-centered  cubic  crystal.  The  12  nearest 
neighbors  of  the  origin  (see  Figure  10.3)  are  at 

R  =  ~  (±  1,  ±  1,  0),  |(±1,0,  ±1),  |  (0.  ±  1,  ±  1).  (10.20) 

Figure  10.3 

The  12  nearest  neighbors  of  the  origin  in  a  face-centered  cubic 
lattice  with  conventional  cubic  cell  of  side  a 


If  k  =  (kx,  ky,  kz),  then  the  corresponding  12  values  of  k  •  R  are 

k  •  R  =  ~  ( ±  k„  ±  kj),  i,  j  =  x,  y;  y,  z;  z,  x.  (10.21) 

Now  AL'(r)  =  AU(x,  y,  z)  has  the  full  cubic  symmetry  of  the  lattice,  and  is  therefore 
unchanged  by  permutations  of  its  arguments  or  changes  in  their  signs.  This,  together 
with  the  fact  that  the  s-level  wave  function  <p( r)  depends  only  on  the  magnitude  of  r, 
implies  that  y(R)  is  the  same  constant  y  for  all  12  of  the  vectors  (10.20).  Consequently, 
the  sum  in  (10.19)  gives,  with  the  aid  of  (10.21), 


£(k)  =  Es  —  [I  -  4>’(cos  jkxa  cos  \kya 

+  cos  \kya  cos  \kza  +  cos  %k2a  cos  jkxa), 

where 


-1 


dr  4>*(x,  y,  z)  AL'(x,  y,  z)  4>(x  -  \a,y  -  \a,  z). 


(10.22) 

(10.23) 


Equation  (10.22)  reveals  the  characteristic  feature  of  tight-binding  energy  bands: 
The  bandwidth — i.e.,  the  spread  between  the  minimum  and  maximum  energies  in 
the  band — is  proportional  to  the  small  overlap  integral  y.  Thus  the  tight-binding 
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bands  are  narrow  bands,  and  the  smaller  the  overlap,  the  narrower  the  band.  In  the 
limit  of  vanishing  overlap  the  bandwidth  also  vanishes,  and  the  band  becomes  jV-fold 
degenerate,  corresponding  to  the  extreme  case  in  which  the  electron  simply  resides 
on  any  one  of  the  N  isolated  atoms.  The  dependence  of  bandwidth  on  overlap  integral 
is  illustrated  in  Figure  10.4. 


(a)  A/-fold  (b) 

degenerate 
levels 


Bands, 
each 
with 
JV  values 
of  k 


Figure  10.4 

(a)  Schematic  representation  of  nondegenerate  electronic  levels  in 
an  atomic  potential,  (b)  The  energy  levels  for  N  such  atoms  in  a 
periodic  array,  plotted  as  a  function  of  mean  inverse  interatomic 
spacing.  When  the  atoms  are  far  apart  (small  overlap  integrals) 
the  levels  are  nearly  degenerate,  but  when  the  atoms  are  closer 
together  (larger  overlap  integrals),  the  levels  broaden  into  bands. 


In  addition  to  displaying  the  effect  of  overlap  on  bandwidth,  Eq.  (10.22)  illustrates 
several  general  features  of  the  band  structure  of  a  face-centered  cubic  crystal  that  are 
not  peculiar  to  the  tight-binding  case.  Typical  of  these  are  the  following: 

1.  In  the  limit  of  small  ka,  (10.22)  reduces  to: 

£(k)  =£,-/?-  12y  +  yk2a2.  (10.24) 

This  is  independent  of  the  direction  of  k — i.e.,  the  constant-energy  surfaces  in 
the  neighbourhood  of  k  =  0  are  spherical.10 

2.  If  £  is  plotted  along  any  line  perpendicular  to  one  of  the  square  faces  of  the  first 
Brillouin  zone  (Figure  10.5),  it  will  cross  the  square  face  with  vanishing  slope 
(Problem  1). 


Figure  10.5 

The  first  Brillouin  zone  for  face-centered  cubic  crystals.  The  point  F 
is  at  the  center  of  the  zone.  The  names  K,  L,  W,  and  X  are  widely 
used  for  the  points  of  high  symmetry  on  the  zone  boundary. 


10  This  can  be  deduced  quite  generally  for  any  nondegenerate  band  in  a  crystal  with  cubic  symmetry. 
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3.  If  £  is  plotted  along  any  line  perpendicular  to  one  of  the  hexagonal  faces  of  the 
first  Brillouin  zone  (Figure  10.5),  it  need  not,  in  general,  cross  the  face  with 
vanishing  slope  (Problem  l).11 

GENERAL  REMARKS  ON  THE  TIGHT-BINDING  METHOD 

1.  In  cases  of  practical  interest  more  than  one  atomic  level  appears  in  the  ex¬ 
pansion  (10.7),  leading  to  a  3  x  3  secular  problem  in  the  case  of  three  p-levels,  a 
5x5  secular  problem  for  five  d- levels,  etc.  Figure  10.6,  for  example,  shows  the  band 
structure  that  emerges  from  a  tight-binding  calculation  based  on  the  5-fold  degenerate 
atomic  3 -A  levels  in  nickel.  The  bands  are  plotted  for  three  directions  of  symmetry 
in  the  zone,  each  of  which  has  its  characteristic  set  of  degeneracies. 


Figure  10.6 

A  tkht-bindine  calculation  of  the  3d  bands  of  nickel.  (G.  C.  Fletcher,  Proc.  Phys.  Soc.  A65,  192 
(195">) ,  Energies  are  given  in  units  of  £0  =  1-349  eV,  so  the  bands  arc  about  2.7  volts  wide.  The 
lines  along  which  £  is  plotted  are  shown  in  Figure  10.5.  Note  the  characteristic  degeneracies 
along  TX  and  TL,  and  the  absence  of  degeneracy  along  TK.  The  great  width  of  the  bands  indicates 
the  inadequacy  of  so  elementary  a  treatment. 

2.  A  quite  general  feature  of  the  tight-binding  method  is  the  relation  between 
bandwidth  and  the  overlap  integrals 


r,j(R)  = 


I 


dr  4>*(r)  &U(r)4>j(r  -  R). 


(10.25) 


■>  Compare  the  nearly  free  electron  case  (page  158).  where  the  rate  of  change  of  £  along  a  l.ne  normal 
to  a  Bragg  plane  was  always  found  to  vanish  as  the  plane  was  crossed  at  points  far  from  any  other 
Bragg  planes.  The  tight-binding  result  illustrates  the  more  general  possibility  that  arises  because  there  is 
no  plane  of  mirror  symmetry  parallel  to  the  hexagonal  face. 

ir  The  calculated  bands  are  so  wide  as  to  cast  doubt  on  the  validity  of  the  entire  expansion.  A  more 
realistic  calculation  would  have  to  include,  at  the  very  least,  the  effects  of  the  4s-level. 
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If  the  yi}  are  small,  then  the  bandwidth  is  correspondingly  small.  As  a  rule  of 
thumb,  when  the  energy  of  a  given  atomic  level  increases  (i.e.,  the  binding  energy 
decreases)  so  does  the  spatial  extent  of  its  wave  function.  Correspondingly,  the  low- 
lying  bands  in  a  solid  are  very  narrow,  but  bandwidths  increase  with  mean  band 
energy.  In  metals  the  highest  band  (or  bands)  are  very  broad,  since  the  spatial  ranges 
of  the  highest  atomic  levels  are  comparable  to  a  lattice  constant,  and  the  tight-binding 
approximation  is  then  of  doubtful  validity. 

3.  Although  the  tight-binding  wave  function  (10.6)  is  constructed  out  oflocalized 
atomic  levels  <p,  an  electron  in  a  tight-binding  level  will  be  found,  with  equal  prob¬ 
ability,  in  any  cell  of  the  crystal,  since  its  wave  function  (like  any  Bloch  wave  function) 
changes  only  by  the  phase  factor  e'k ' R  as  one  moves  from  one  cell  to  another  a  distance 
R  away.  Thus  as  r  varies  from  cell  to  cell,  there  is  superimposed  on  the  atomic  struc¬ 
ture  within  each  cell  a  sinusoidal  variation  in  the  amplitudes  of  Re  i p  and  Im  \ p,  as 
illustrated  in  Figure  10.7. 
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Figure  10.7 

Characteristic  spatial  variation  of  the  real  (or  imaginary  )  part  of  the  tight-binding  wave  function 
(10.6). 

A  further  indication  that  the  tight-binding  levels  have  a  running  wave  or  itinerant 
character  comes  from  the  theorem  that  the  mean  velocity  of  an  electron  in  a  Bloch 
level  with  wave  vector  k  and  energy  £(k)  is  given  by  v(k)  =  (I/ft)  d£/fk.  (See  Appendix 
E.)  If  £  is  independent  of  k.  dE/f’k  is  zero,  which  is  consistent  with  the  fact  that  in 
genuinely  isolated  atomic  levels  (which  lead  to  zero  bandwidth)  the  electrons  are 
indeed  tied  to  individual  atoms.  If,  however,  there  is  any  nonzero  overlap  in  the  atomic 
wave  functions,  then  £(k)  will  not  be  constant  throughout  the  zone.  Since  a  small 
variation  in  £  implies  a  small  nonzero  value  of  d£/t  k,  and  hence  a  small  but  nonzero 
mean  velocity,  as  long  as  there  is  any  overlap  electrons  will  be  able  to  move  freely 
through  the  crystal !  Decreasing  the  overlap  only  reduces  the  velocity;  it  does  not 
eliminate  the  motion.  One  can  view  this  motion  as  a  quantum-mechanical  tunneling 
from  lattice  site  to  lattice  site.  The  less  the  overlap,  the  lower  the  tunneling  probability, 
and  hence  the  longer  it  takes  to  go  a  given  distance. 

4.  In  solids  that  are  not  monatomic  Bravais  lattices,  the  tight-binding  approxi¬ 
mation  is  more  complicated.  This  problem  arises  in  the  hexagonal  close-packed 
metals,  which  are  simple  hexagonal  with  a  two-point  basis.  Formally,  one  can  treat 
the  two-point  basis  as  a  molecule,  whose  wave  functions  are  assumed  to  be  known, 
and  proceed  as  above,  using  molecular  instead  of  atomic  wave  functions.  If  the 
nearest-neighbor  overlap  remains  small,  then,  in  particular,  it  will  be  small  in  each 
“molecule,”  and  an  atomic  s-level  will  give  rise  to  two  nearly  degenerate  molecular 
levels.  Thus  a  single  atomic  s-level  yields  two  tight-binding  bands  in  the  hexagonal 
close-packed  structure. 
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Alternatively,  one  can  proceed  by  continuing  to  construct  linear  combinations  of 
atomic  levels  centered  at  the  Bravais  lattice  points  and  at  the  basis  points,  generalizing 
(10.6)  to 


Mr)  =  I  -  R)  +  b(j)(T  -  d  -  R)),  (10.26) 

R 

(where  d  is  the  separation  of  the  two  basis  atoms).  This  can  be  viewed  as  essentially 
the  first  approach,  in  which,  however,  approximate  molecular  wave  functions  are 
used,  the  approximation  to  the  molecular  levels  being  combined  with  the  tight- 
binding  approximation  to  the  levels  of  the  entire  crystal.13 

5.  In  the  heavier  elements  spin-orbit  coupling  (see  page  1 69)  is  of  great  importance 
in  determining  the  atomic  levels,  and  should  therefore  be  included  in  a  tight-binding 
treatment  of  the  broadening  of  these  levels  into  bands  in  the  solid.  In  principle  the 
extension  is  straightforward.  We  simply  include  in  AU(r)  the  interaction  between  the 
electron’s  spin  and  the  electric  field  of  all  ions  except  the  one  at  the  origin,  incor¬ 
porating  that  interaction  into  the  atomic  Hamiltonian.  Once  this  is  done  we  can  no 
longer  use  spin-independent  linear  combinations  of  atomic  orbital  wave  functions, 
but  must  work  with  linear  combinations  of  both  orbital  and  spin  levels.  Thus  the 
tight-binding  theory  of  an  s-level,  when  spin-orbit  coupling  is  appreciable,  would 
approximate  4>  not  by  a  single  atomic  s-level  but  by  a  linear  combination  (with  k 
dependent  coefficients)  of  two  levels  with  the  same  orbital  wave  functions  and  two 
opposite  spins.  The  tight-binding  theory  of  a  d-band  would  go  from  a  5  x  5  deter- 
minantal  problem  to  a  10  x  10  one,  etc.  As  mentioned  in  Chapter  9,  effects  of  spin- 
orbit  coupling,  though  often  small,  can  frequently  be  quite  crucial,  as  when  they 
eliminate  degeneracies  that  would  rigorously  be  present  if  such  coupling  were 
ignored.14 

6.  All  the  analysis  of  electronic  levels  in  a  periodic  potential  in  this  chapter  (and 
the  preceding  two)  has  been  done  within  the  independent  electron  approximation, 
which  either  neglects  the  interaction  between  electrons,  or,  at  best,  includes  it  in  some 
average  way  through  the  effective  periodic  potential  experienced  by  each  single 
electron.  We  shall  see  in  Chapter  32  that  the  independent  electron  approximation 
can  fail  when  it  gives  at  least  one  partially  filled  band  that  derives  from  well-localized 
atomic  levels  with  small  overlap  integrals.  In  many  cases  of  interest  (notably  in 
insulators  and  for  the  very  low-lying  bands  in  metals)  this  problem  does  not  arise, 
since  the  tight-binding  bands  are  so  low  in  energy  as  to  be  completely  filled.  However, 
the  possibility  of  such  a  failure  of  the  independent  electron  approximation  must  be 
kept  in  mind  when  narrow  tight-binding  bands  are  derived  from  partially  filled 
atomic  shells — in  metals,  generally  the  d-  and  /-shells.  One  should  be  particularly 
aware  of  this  possibility  in  solids  with  a  magnetic  structure. 

This  failure  of  the  independent  electron  approximation  obscures  the  simple  picture 
the  tight-binding  approximation  suggests:  that  of  a  continuous  transition  from  the 


13  The  “approximate  molecular  wave  functions"  will  thus  be  k-dependent. 

14  The  inclusion  of  spin-orbit  coupling  in  the  light-binding  method  is  discussed  by  J.  Friedel,  P. 
Lenghart,  and  G.  Leman,  J.  Phys.  Chan.  Solids  25,  781  (1964). 
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metallic  to  the  atomic  state  as  the  interatomic  distance  is  continuously  increased.15 
If  we  took  the  tight-binding  approximation  at  face  value,  then  as  the  lattice  constant 
in  a  metal  increased,  the  overlap  between  all  atomic  levels  would  eventually  become 
small,  and  all  bands — even  the  partially  filled  conduction  band  (or  bands) — would 
eventually  become  narrow  tight-binding  bands.  As  the  conduction  band  narrowed, 
the  velocity  of  the  electrons  in  it  would  diminish  and  the  conductivity  of  the  metal 
would  drop.  Thus,  we  would  expect  a  conductivity  that  dropped  continuously  to 
zero  with  the  overlap  integrals  as  the  metal  was  expanded. 

In  fact,  however,  it  is  likely  that  a  full  calculation  going  beyond  the  independent 
electron  approximation  would  predict  that  beyond  a  certain  nearest-neighbor  sepa¬ 
ration  the  conductivity  should  drop  abruptly  to  zero,  the  material  becoming  an 
insulator  (the  so-called  Mott  transition). 

The  reason  for  this  departure  from  the  tight-binding  prediction  lies  in  the  inability 
of  the  independent  electron  approximation  to  treat  the  very  strong  additional  repul¬ 
sion  a  second  electron  feels  at  a  given  atomic  site  when  another  electron  is  already 
there.  We  shall  comment  further  on  this  in  Chapter  32,  but  we  mention  the  problem 
here  because  it  is  sometimes  described  as  a  failure  of  the  tight-binding  method.16 
This  is  somewhat  misleading  in  that  the  failure  occurs  when  the  tight-binding  approxi¬ 
mation  to  the  independent  electron  model  is  at  its  best;  it  is  the  independent  electron 
approximation  itself  that  fails. 


WANNIER  FUNCTIONS 

We  conclude  this  chapter  with  a  demonstration  that  the  Bloch  functions  for  any 
band  can  always  be  written  in  the  form  (10.4)  on  which  the  tight-binding  approxi¬ 
mation  is  based.  The  functions  <p  that  play  the  role  of  the  atomic  wave  functions  are 
known  as  Wannier  functions.  Such  Wannier  functions  can  be  defined  for  any  band, 
whether  or  not  it  is  well  described  by  the  tight-binding  approximation;  but  if  the  band 
is  not  a  narrow  tight-binding  band,  the  Wannier  functions  will  bear  little  resemblance 
to  any  of  the  electronic  wave  functions  for  the  isolated  atom. 

To  establish  that  any  Bloch  function  can  be  written  in  the  form  (10.4),  we 
first  note  that  considered  as  a  function  of  k  for  fixed  r,  ^„k(r)  is  periodic  in  the  reciprocal 
lattice.  It  therefore  has  a  Fourier  series  expansion  in  plane  waves  with  wave  vectors 
in  the  reciprocal  of  the  reciprocal  lattice,  i.e.,  in  the  direct  lattice.  Thus  for  any  fixed 
r  we  can  write 

A*M  =  I/„(R,r)e'Rk,  (10.27) 

R 

where  the  coefficients  in  the  sum  depend  on  r  as  well  as  on  the  “wave  vectors”  R, 
since  for  each  r  it  is  a  different  function  of  k  that  is  being  expanded. 


1 5  A  difficult  procedure  to  realize  in  the  laboratory,  but  a  very  tempting  one  to  visualize  theoretically, 
as  an  aid  in  understanding  the  nature  of  energy  bands. 

16  See,  for  example,  H.  Jones,  The  Theory  of  Brillouin  Zones  and  Electron  States  in  Crystals,  North- 
Holland,  Amsterdam,  1960,  p.  229. 
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The  Fourier  coefficients  in  (10.27)  are  given  by  the  inversion  formula17 

/„(R,  r)  =  -  \dk  ?-"*»).  (10.28) 

lo  J 

Equation  (10.27)  is  of  the  form  (10.4),  provided  that  the  function  /„( R,  r)  depends 
on  r  and  R  only  through  their  difference,  r  —  R.  But  if  r  and  R  are  both  shifted  by 
the  Bravais  lattice  vector  R0,  then /  is  indeed  unchanged  as  a  direct  consequence  of 
(10.28)  and  Bloch’s  theorem,  in  the  form  (8.5).  Thus /„( R,  r)  has  the  form: 

/„(R,  r)  =  cj)„(r  -  R)  (10.29) 

Unlike  tight-binding  atomic  functions  <j>(r),  the  Wannier  functions  (pn{r  —  R)  at  dif¬ 
ferent  sites  (or  with  different  band  indices)  are  orthogonal  (see  Problem  3,  Eq.  (10.35)). 
Since  the  complete  set  of  Bloch  functions  can  be  written  as  linear  combinations  of 
the  Wannier  functions,  the  Wannier  functions  tpjr  —  R)  for  all  n  and  R  form  a 
complete  orthogonal  set.  They  therefore  offer  an  alternative  basis  for  an  exact 
description  of  the  independent  electron  levels  in  a  crystal  potential. 

The  similarity  in  form  of  the  Wannier  functions  to  the  tight-binding  functions 
leads  one  to  hope  that  the  Wannier  functions  will  also  be  localized — i.e.,  that  when 
r  is  very  much  larger  than  some  length  on  the  atomic  scale,  $„(r)  will  be  negligibly 
small.  To  the  extent  that  this  can  be  established,  the  Wannier  functions  offer  an 
ideal  tool  for  discussing  phenomena  in  which  the  spatial  localization  of  electrons 
plays  an  important  role.  Perhaps  the  most  important  areas  of  application  are  these: 

1.  Attempts  to  derive  a  transport  theory  for  Bloch  electrons.  The  analog  of  free 
electron  wave  packets,  electronic  levels  in  a  crystal  that  are  localized  in  both 
r  and  k,  are  conveniently  constructed  with  the  use  of  Wannier  functions.  The 
theory  of  Wannier  functions  is  closely  related  to  the  theory  of  when  and  how 
the  semiclassical  theory  of  transport  by  Bloch  electrons  (Chapters  12  and  13) 
breaks  down. 

2.  Phenomena  involving  localized  electronic  levels,  due,  for  example,  to  attractive 
impurities  that  bind  an  electron.  A  very  important  example  is  the  theory  of  donor 
and  acceptor  levels  in  semiconductors  (Chapter  28). 

3.  Magnetic  phenomena,  in  which  localized  magnetic  moments  are  found  to  exist 
at  suitable  impurity  sites. 

Theoretical  discussions  of  the  range  of  Wannier  functions  are  in  general  quite 
subtle.18  Roughly  speaking,  the  range  of  the  Wannier  function  decreases  as  the  band 
gap  increases  (as  one  might  expect  from  the  tight-binding  approximation,  in  which 
the  bands  become  narrower  as  the  range  of  the  atomic  wave  functions  decreases). 
The  various  “breakdown”  and  “breakthrough”  phenomena  we  shall  mention  in 


17  Here  v0  is  the  volume  in  Ar-space  of  the  first  Brillouin  zone,  and  the  integral  is  over  the  zone. 
Equations  (10.27)  and  (10.28)  (with  r  regarded  as  a  fixed  parameter)  arc  just  Eqs.  (D.l)  and  (D.2)  of 
Appendix  D,  with  direct  and  reciprocal  space  interchanged. 

18  A  relatively  simple  argument,  but  only  in  one  dimension,  is  given  by  W.  Kohn.  Phys.  Rev.  115, 
809  (1959).  A  more  general  discussion  can  be  found  in  E.  I.  Blount,  Solid  State  Physics,  Vol.  13,  Academic 
Press,  New  York,  1962,  p.  305. 
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Chapter  12  that  occur  when  the  band  gap  is  small  find  their  reflection  in  the  fact 
that  theories  based  on  the  localization  of  the  Wannier  functions  become  less  reliable 
in  this  limit. 


PROBLEMS 

1.  (a)  Show  that  along  the  principal  symmetry  directions  shown  in  Figure  10.5  the  tight- 
binding  expression  (10.22)  for  the  energies  of  an  s-band  in  a  face-centered  cubic  crystal  reduces 
to  the  following: 


(i) 

Along  rX 

<v 

=  K 

=  0,  kx  =  g  2 n/a. 

0  <  fi  H  1) 

E  =  E,  -  P  - 

4y(!  +  2  cos  fin). 

(ii) 

Along  TL 

<A  = 

-ky. 

=  K  =  H  2 n/a,  0  $  p  <  i) 

- 

y 

E  =  E.  -  P 

—  1 2y  cos.2  fin. 

(iii) 

Along  TK 

(K- 

=  o. 

K  =  ky  =  fi  2n/a, 

£  =  E,  —  fl  —  4y(cos2  fin  +  2  cos  fin). 

(iv)  Along  TW  (kz  =  0,  kx  =  fi  2n/a,  ky  =  jp  2nja,  0  <  fi  «  1) 

&  =  —  4y(cos  fin  +  cos  \fxn  +  cos  fin  cos  ^fin). 

(b)  Show  that  on  the  square  faces  of  the  zone  the  normal  derivative  of  £  vanishes. 

(c)  Show  that  on  the  hexagonal  faces  of  the  zone,  the  normal  derivative  of  £  vanishes  only 
along  lines  joining  the  center  of  the  hexagon  to  its  vertices. 

2.  Tight-Binding  p-Bands  in  Cubic  Crystals 

In  dealing  with  cubic  crystals,  the  most  convenient  linear  combinations  of  three  degenerate 
atomic  p-Ievels  have  the  form  xip(r),  ytp(r),  and  z<p{r),  where  the  function  ip  depends  only  on 
the  magnitude  of  the  vector  r.  The  energies  of  the  three  corresponding  p-bands  are  found  from 
(10.12)  by  setting  to  zero  the  determinant 

|(E(k)  -  Ep)  bu  +  Pij  +  y.jfk)  =  0,  (10.30) 

where 

MO  =  I  e'k  R>’ij(R); 

R 

>'w(R)  =  -  |dr^.*(r)^.(r  -  R)Al/(r), 

Pit  =  >’i/R  =  0).  (10.31) 

(A  term  multiplying  £(k)  —  Ep,  which  gives  rise  to  very  small  corrections  analogous  to  those 
given  by  the  denominator  of  (10.15)  in  the  s-band  case,  has  been  omitted  from  (10.30).) 

(a)  As  a  consequence  of  cubic  symmetry,  show  that 

Pxx  =  Pyy  =  Az  =  P, 

Pxy  =  0. 


(10.32) 
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(b)  Assuming  that  the  y;j(R)  are  negligible  except  for  nearest-neighbor  R,  show  that  y^fk) 
is  diagonal  for  a  simple  cubic  Bravais  lattice,  so  that  x<p(r),  y<j>(r),  and  :tp(r)  each  generate  inde¬ 
pendent  bands.  (Note  that  this  ceases  to  be  the  case  if  the  y,j(R)  for  next  nearest- neighbor  R  are 
also  retained.) 

(c)  For  a  face-centered  cubic  Bravais  lattice  with  only  nearest-neighbor  y,,  appreciable, 
show  that  the  energy  bands  are  given  by  the  roots  of 


0  = 


£(k)  -  £°(k)  + 

4y0  cos  \kya  cos  \kza 

—  4y,  sin  \kya  sin  \kxa 


— 4y!  sin  {kza  sin  \kxa 


— 4y,  sin  jkxa  sin  jkya 

£(k)  -  £°(k)  + 

4y0  cos  {kza  cos  {kxa 

—  4y,  sin  jkza  sin  \kya 


—  4y,  sin  {kxa  sin  jkza 

— 4y,  sin  \kya  sin  jkza 

£(k)  -  £°(k)  + 

4y0  cos  jkxa  cos  jkya 


(10.33) 


where 


£°(k)  ~Ep-fi 

—  4>'2(cos  jkxa  cos  \kza  +  cos  \kxa  cos  \kya  +  cos  \k/i  cos  jkza\ 
y0  =  -  j  dr  [xz  -  y(y  -  \a)]<p{r)<t>([x2  +  (y  -  |a)2  +  (z  -  1«)2]1/2)  Al/(r), 

Ti  =  -  f dr  x(y  -  -\a)4>{r)<f>(l(x  -  -|o)2  +  (y  -  \a2)  +  z2]l/2)  A£/(r), 

y2  =  -  J dr  x(x  -  \a)4>(r)4>([(x  -  ja)2  +  (y  -  ’a)2  +  z2]I/2)  AU(r).  (10.34) 

(d)  Show  that  all  three  bands  are  degenerate  at  k  =  0,  and  that  when  k  is  directed  along 
either  a  cube  axis  (I"X)  or  a  cube  diagonal  (TL)  there  is  a  double  degeneracy.  Sketch  the  energy 
bands  (in  analogy  to  Figure  10.6)  along  these  directions. 


3.  Prove  that  Wannier  functions  centered  on  different  lattice  sites  are  orthogonal. 


I 


cp*{r  -  R)0„.(r  -  R')  dr  cc  <5R 


(10.35) 


by  appealing  to  the  orthonormality  of  the  Bloch  functions  and  the  identity  (F.4)  of  Appendix  F. 
Show  also  that 


(10.36) 


if  the  integral  of  the  l^fr)!2  over  a  primitive  cell  is  normalized  to  unity. 
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In  Chapters  9  and  10  we  explored  approximate  solutions  to  the  one-electron 
Schrodinger  equation  in  the  limiting  cases  of  nearly  free  electrons,  and  tight  binding. 
In  most  cases  of  interest  the  tight-binding  approximation  (at  least  in  the  simple  form 
outlined  in  Chapter  10)  is  suitable  only  for  the  representation  of  bands  arising  from 
the  ion  core  levels,  while  the  nearly  free  electron  approximation  cannot  be  directly 
applied  to  any  real  solid.1  The  purpose  of  this  chapter  is  therefore  to  describe  some 
of  the  more  common  methods  actually  used  in  the  calculation  of  real  band  structures. 

We  remarked  in  Chapter  8  that  in  merely  writing  down  a  separate  Schrodinger 
equation2 

+  Ufr))  )Mr)  =  mU r)  (11.1) 

for  each  electron  we  are  already  enormously  simplifying  the  actual  problem  of  many 
interacting  electrons  in  a  periodic  potential.  In  an  exact  treatment  each  electron 
cannot  be  described  by  a  wave  function  determined  by  a  single-particle  Schrodinger 
equation,  independent  of  all  the  others. 

The  independent  electron  approximation  does  not  in  fact  entirely  neglect  electron- 
electron  interactions.  Rather  it  assumes  that  most  of  their  important  effects  can  be 
taken  into  account  with  a  sufficiently  clever  choice  for  the  periodic  potential  U{r ) 
appearing  in  the  one-electron  Schrodinger  equation.  Thus  l/(r)  contains  not  only  the 
periodic  potential  due  to  the  ions  alone,  but  also  periodic  effects  due  to  the  interaction 
of  the  electron  (whose  wave  function  appears  in  ( 1 1 . 1 ) )  with  all  the  other  electrons.  The 
latter  interaction  depends  on  the  configuration  of  the  other  electrons;  i.e.,  it  depends 
on  their  individual  wave  functions,  which  are  also  determined  by  a  Schrodinger 
equation  of  the  form  (11.1).  Thus  to  know  the  potential  appearing  in  (1 1.1),  one  must 
first  know  all  the  solutions  to  (1 1.1).  Since,  however,  to  know  the  solutions  one  must 
know  the  potential,  one  is  in  for  some  difficult  mathematical  efforts. 

The  simplest  (and  often  the  most  practical)  procedure  is  to  start  with  a  shrewd 
guess,  U0{ r),  for  L'(r),  calculate  from  (11.1)  the  wave  functions  for  the  occupied 
electronic  levels,  and  from  these  recompute  U( r).  If  the  new  potential,  l/,(r)  is  the 
same  as  (or  very  close  to)  L/0(r),  one  says  that  self-consistency  has  been  achieved  and 
takes  U  =  Ul  for  the  actual  potential.  If  U,  differs  from  l/0,  one  repeats  the  procedure 
starting  with  U, ,  taking  U2  as  the  actual  potential  if  it  is  very  close  to  Ut,  and  other¬ 
wise  continuing  on  to  the  calculation  of  U3.  The  hope  is  that  this  procedure  will 
converge,  eventually  yielding  a  self-consistent  potential  that  reproduces  itself.3 

We  shall  assume  in  this  chapter  (as  in  Chapters  8-10)  that  the  potential  (J( r)  is  a 
given  function ;  i.e.,  that  we  are  either  engaged  in  the  first  step  of  this  iterative  procedure 
or,  by  a  fortunate  guess,  are  able  to  work  with  a  reasonably  self-consistent  L/(r)  from 
the  start.  The  reliability  of  the  methods  we  are  about  to  describe  is  limited  not  only 
by  the  accuracy  of  the  computed  solutions  to  (11.1),  which  can  be  quite  high,  but 
also  by  the  accuracy  with  which  we  have  been  able  to  estimate  the  potential  U( r). 


1  However,  more  sophisticated  techniques  often  yield  an  analysis  very  much  like  the  nearly  free 
electron  approximation  in  a  suitably  modified  potential,  known  as  the  pseudopotential  (see  below). 

2  We  continue  to  drop  explicit  reference  to  the  band  index  n,  except  when  this  would  lead  to  ambiguity. 

3  One  must  remember,  however,  that  even  the  self-consistent  solution  is  still  only  an  approximate 
solution  to  the  vastly  more  complex  many-body  problem. 
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The  resulting  £„(k)  display  a  disconcerting  sensitivity  to  errors  in  the  construction 
of  the  potential,  and  it  is  often  the  case  that  the  final  accuracy  of  the  computed  band 
structure  is  limited  more  by  the  problem  of  finding  the  potential  than  by  the  difficulties 
in  solving  the  Schrodinger  equation  (1 1.1)  for  a  given  U.  This  is  strikingly  illustrated 
in  Figure  1 1.1. 

Figure  11.1 

Energy  bands  for  vanadium,  calculated  for 
two  possible  choices  of  crystal  potential 
V(t).  Vanadium  is  body-centered  cubic  and 
the  bands  are  plotted  along  the  [100]  direc¬ 
tion  from  the  origin  to  the  Brillouin  zone 
boundary.  The  atomic  structure  of  vana¬ 
dium  is  five  electrons  around  a  closed-shell 
argon  configuration.  The  bands  displayed 
are  the  3d  and  4s  derived  bands  (and  higher 
bands),  (a)  The  bands  are  shown  as  calcu¬ 
lated  in  a  t/(r)  derived  from  an  assumed 
3d34s2  configuration  for  atomic  vanadium. 

(b)  The  bands  are  shown  based  on  an  as¬ 
sumed  3(/44s‘  atomic  configuration.  (From 
L.  F.  Matheiss,  Pins.  Rev.  A970  134,  (1964).) 


Another  point  to  emphasize  at  the  start  is  that  none  of  the  methods  we  shall  describe 
can  be  carried  through  analytically,  except  in  the  simplest  one-dimensional  examples. 
All  require  modern,  high-speed  computers  for  their  execution.  Progress  in  the  theo¬ 
retical  calculation  of  energy  bands  has  kept  close  pace  with  development  of  larger 
and  faster  computers,  and  the  kinds  of  approximations  one  is  likely  to  consider  are 
influenced  by  available  computational  techniques.4 

GENERAL  FEATURES  OF  VALENCE-BAND  WAVE  FUNCTIONS 

Since  the  low-lying  core  levels  are  well  described  by  tight-binding  wave  functions 
calculation^  methods  aim  at  the  higher-lying  bands  (which  may  be  either  filled, 
partially  filled,  or  empty).  These  bands  are  referred  to  in  this  context,  in  contrast  to 
the  tight-binding  core  bands,  as  the  valence  bands.5  The  valence  bands  determine  the 
electronic  behavior  of  a  solid  in  a  variety  of  circumstances,  electrons  in  the  core 
levels  being  inert  for  many  purposes. 

The  essential  difficulty  in  practical  calculations  of  the  valence-band  wave  functions 
and  energies  is  revealed  when  one  asks  why  the  nearly  free  electron  approximation 
of  Chapter  9  cannot  be  applied  to  the  valence  bands  in  an  actual  solid.  A  simple. 


See,  for  example.  Computational  Methods  in  Band  Theorv.  P.  M.  Marcus.  J.  F.  Janak  and  A  R 
Williams,  eds..  Plenum  Press,  New  York.  1971;  and  Methods  in  Computational  Physics:  Energy  Bands  in 
Solids.  Vol.  8,  B.  Alder,  S.  Fernbach,  and  M.  Rolenburg.  eds..  Academic  Press,  New  York,  1968. 

Unfortunately  the  same  term,  “valence  band."  is  used  in  the  theory  of  semiconductors  with  a  rather 
more  narrow  meaning.  See  Chapter  28. 
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but  superficial,  reason  is  that  the  potential  is  not  small.  Very  roughly  we  might  estimate 
that,  at  least  well  within  the  ion  core,  U(r)  has  the  coulombic  form 


(11.2) 


where  Za  is  the  atomic  number.  The  contribution  of  (1 1.2)  to  the  Fourier  components 
t/K  in  Eq.  (9.2)  will  be  (see  p.  167  and  Eq.  (17.73)): 


If  we  write  this  as 


(11.3) 


13.6  eV, 


(11.4) 


we  see  that  l/K  can  be  of  the  order  of  several  electron  volts  for  a  very  large  number 
of  reciprocal  lattice  vectors  K  and  is  therefore  comparable  to  the  kinetic  energies 
appearing  in  Eq.  (9.2).  Thus  the  assumption  that  UK  is  small  compared  to  these 
kinetic  energies  is  not  permissible. 

A  deeper  insight  into  this  failure  is  afforded  by  considering  the  nature  of  the  core 
and  valence  wave  functions.  The  core  wave  functions  are  appreciable  only  within  the 
immediate  vicinity  of  the  ion,  where  they  have  the  characteristic  oscillatory  form  of 
atomic  wave  functions  (Figure  11.2a).  These  oscillations  are  a  manifestation  of  the 


(b) 


Figure  11.2 

(a)  Characteristic  spatial  dependence  of  a  core  wave  function  ipi(r).  The  curve  shows  Re  ip 
against  position  along  a  line  of  ions.  Note  the  characteristic  atomic  oscillations  in  the  vicinity 
of  each  ion.  The  dashed  envelope  of  the  atomic  parts  is  sinusoidal,  with  wavelength  X  =  2n/k. 
Between  lattice  sites  the  wave  function  is  negligibly  small,  (b)  Characteristic  spatial  depen¬ 
dence  of  a  valence  wave  function  (//k"(r).  The  atomic  oscillations  are  still  present  in  the  core 
region.  The  wave  function  need  not  be  at  all  small  between  lattice  sites,  but  it  is  likely  to  be 
slowly  varying  and  plane-wavelike  there. 
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high  electronic  kinetic  energy  within  the  core,6  which,  in  combination  with  the  high 
negative  potential  energy,  produces  the  total  energy  of  the  core  levels.  Since  valence 
levels  have  higher  total  energies  than  core  levels,  within  the  core  region,  where  they 
experience  the  same  large  and  negative  potential  energy  as  the  core  electrons,  the 
valence  electrons  must  have  even  higher  kinetic  energies.  Thus  within  the  core 
region  the  valence  wave  functions  must  be  even  more  oscillatory  than  the  core 
wave  functions. 

This  conclusion  can  also  be  reached  by  an  apparently  different  argument: 

Eigenstates  of  the  same  Hamiltonian  with  different  eigenvalues  must  be  orthogonal. 
In  particular  any  valence  wave  function  ij/£{ r)  and  any  core  wave  function 
must  satisfy: 

0=  Jdr^fiWO-).  (US) 

Core  wave  functions  are  appreciable  only  within  the  immediate  vicinity  of  the  ion,  so 
the  main  contribution  of  this  integral  must  come  from  the  core  region.  It  is  enough 
to  consider  the  contribution  to  (1 1 .5)  from  the  core  region  of  a  single  ion,  since  Bloch’s 
theorem  ((8.3))  requires  the  integrand  to  be  the  same  from  cell  to  cell.  Within  this 
core  region  ipkv( r)  must  have  oscillations  that  carefully  interlace  with  those  of  all  the 
^tc(r)  in  order  to  cause  the  integrals  (1 1.5)  to  vanish  for  all  core  levels. 

Either  of  these  arguments  leads  to  the  conclusion  that  a  valence  wave  function 
should  have  the  form  pictured  in  Figure  1 1.2b.  If,  however,  the  valence  wave  functions 
have  an  oscillatory  structure  on  the  scale  of  the  core  region,  a  Fourier  expansion  such 
as  (9.1)  must  contain  many  short  wavelength  plane  waves,  i.e.,  many  terms  with  large 
wave  vectors.  Thus  the  nearly  free  electron  method,  which  leads  to  an  approximate 
wave  function  composed  of  a  very  small  number  of  plane  waves,  must  be  untenable. 

In  one  way  or  another,  all  of  the  calculational  methods  now  in  use  are  attempts 
to  come  to  grips  with  the  necessity  for  reproducing  this  detailed,  atomic-like  structure 
of  the  valence  wave  functions  in  the  core  region,  while  facing  the  fact  that  the  valence 
levels  are  not  of  the  tight-binding  type,  and  therefore  have  appreciable  wave  functions 
in  the  interstitial  regions. 

THE  CELLULAR  METHOD 

The  first  serious  attempt  to  calculate  band  structure  (aside  from  Bloch’s  original  use 
of  the  tight-binding  method)  was  the  cellular  method  of  Wigner  and  Seitz.7  The 
method  begins  by  observing  that  because  of  the  Bloch  relation  (8.6): 

^t(r  +  R)  =  e,k '  'Vk(r),  (11.6) 

it  is  enough  to  solve  the  Schrodinger  equation  (11.1)  within  a  single  primitive  cell 
C0.  The  wave  function  can  then  be  determined  via  (11.6)  in  any  other  primitive  cell 
from  its  values  in  C0. 

However,  not  every  solution  to  (11.1)  within  C0  leads  in  this  way  to  an  acceptable 
wave  function  for  the  entire  crystal,  since  and  V^(r)  must  be  continuous  as  r 


6  The  velocity  operator  is  (fi/mi)V,  which  means  that  the  more  rapidly  a  wave  function  varies  in  a 
region,  the  greater  the  electronic  velocity  must  be  in  that  region. 

7  E.  P.  Wigner  and  F.  Seitz,  Phys.  Rcl\  43,  804  (1933);  46.  509  (1934). 
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crosses  the  primitive  cell  boundary.8  Because  of  (1 1 .6),  this  condition  can  be  phrased 
entirely  in  terms  of  the  values  of  ip  within  and  on  the  surface  of  C0.  It  is  this  boundary 
condition  that  introduces  the  wave  vector  k  into  the  cellular  solution,  and  eliminates 
all  solutions  except  those  for  a  discrete  set  of  energies,  which  are  just  the  band  energies 
£  =  S„(k). 

Boundary  conditions  within  C0  are 

ip(r)  =  e',vV(r  +  R),  (11-7) 

and 

fi(r)  •  Vi^(r)  =  -e',1‘  Rfi(r  +  R)  •  \>p(r  +  R),  (11-8) 

where  r  and  r  +  R  are  both  points  on  the  surface  of  the  cell  and  h  is  an  outward 
normal  (see  Problem  1). 

The  analytical  problem  is  therefore  to  solve  (11.1)  within  the  primitive  cell  C0 
subject  to  these  boundary  conditions.  To  preserve  the  symmetry  of  the  crystal,  one 
takes  the  primitive  cell  C0  to  be  the  Wigner-Seitz  primitive  cell  (Chapter  4)  centered 
on  the  lattice  point  R  =  0. 

The  foregoing  is  an  exact  restatement  of  the  problem.  The  first  approximation  of 
the  cellular  method  is  the  replacement  of  the  periodic  potential  U( r)  within  the 
Wigner-Seitz  primitive  cell  by  a  potential  V(r)  with  spherical  symmetry  about  the 
origin  (see  Figure  11.3).  One  might,  for  example,  choose  V(r)  to  be  the  potential  of 


Figure  1 1 .3 

Equipotentials  (i.e.,  curves  of  constant  C/(r) )  within  a  primitive  cell.  For  the  actual  crystal  potential 
these  will  have  spherical  symmetry  near  the  center  or  the  cell  where  the  potential  is  dominated  by 
the  contribution  from  the  central  ion.  However  near  the  boundary  of  the  cell  the  potential  will 
deviate  substantially  from  spherical  symmetry.  The  cellular  method  approximates  the  potential 
by  a  spherically  symmetric  one  everywhere  within  the  cell,  with  equipotentials  as  shown  on  the 
right. 


8  If  i}/  or  Vi//  were  discontinuous  at  the  cell  boundary,  then  V’l//  would  have  singularities  (which  are 
either  5-functions  or  derivatives  of  5-functions)  on  the  boundary.  Since  there  are  no  such  terms  occurring 
in  L/t/z  on  the  boundary,  the  Schrodinger  equation  could  not  be  satisfied. 
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a  single  ion  at  the  origin,  ignoring  the  fact  that  the  neighbors  of  the  origin  will  also 
contribute  to  U(r)  within  C0,  especially  near  its  boundaries.  This  approximation  is 
made  entirely  for  practical  reasons,  to  render  a  difficult  computational  problem 
more  manageable. 

Once  a  potential  has  been  chosen  spherically  symmetric  inside  C0.  then  within 
the  primitive  cell  a  complete  set  of  solutions  to  the  Schrodinger  equation  (11.1)  can 
be  found  of  the  form9 

M  =  Ylm(0,  <j>)Xl(r),  (11.9) 

where  Ylm(6,  4>)  are  spherical  harmonics  and  y,(r)  satisfies  the  ordinary  differential 
equation 

y/(r)  +  ~  x,'(r)  +  ~  (s  -  V(r)  -  ^4^  xM  =  0.  (11.10) 

Given  the  potential  V(r)  and  given  any  value  of  £,  there  is  a  unique  /,  e  that  solves 
(11.10)  and  is  regular  at  the  origin.10  These  y,  F.  can  be  calculated  numerically,  ordinary 
differential  equations  being  easy  to  handle  on  machines.  Since  any  linear  combination 
of  solutions  to  Schrodinger’s  equation  with  the  same  energy  is  itself  a  solution, 

tMr,£)  =  X  AlmYlm(6,  <j))XiAr)  (H  U) 

lm 

will  solve  (11.1)  at  energy  £  for  arbitrary  coefficients  A,m.  However,  (11.1 1)  will  only 
yield  an  acceptable  wave  function  for  the  crystal  if  it  satisfies  the  boundary  conditions 
(1 1.7)  and  (1 1.8).  It  is  in  the  imposition  of  these  boundary  conditions  that  the  cellular 
method  makes  its  next  major  approximation. 

To  begin  with,  one  takes  only  as  many  terms  in  the  expansion  (11.11)  as  it  is 
calculationally  convenient  to  handle.11  Since  there  is  only  a  finite  number  of  coeffi¬ 
cients  in  the  expansion,  we  can,  for  a  general  cell,  fit  the  boundary  condition  only 
at  a  finite  set  of  points  on  its  surface.  The  imposition  of  this  finite  set  of  boundary 
conditions  (chosen  to  be  as  many  as  there  are  unknown  coefficients)  leads  to  a  set 
of  k-dependent  linear  homogeneous  equations  for  the  Alm,  and  the  values  of  £  for 
which  the  determinant  of  these  equations  vanishes  are  the  required  energies  £„(k). 


9  See,  for  example,  D.  Park,  Introduction  to  the  Quantum  Theory ,  McGraw-Hill,  New  York,  1964, 
pp.  516-519,  or  any  other  book  on  quantum  mechanics.  There  is,  however,  this  important  difference 
compared  with  the  familiar  atomic  case:  In  atomic  physics  the  boundary  condition  (that  ^  vanishes  at 
infinity)  is  also  spherically  symmetric,  and  consequently  a  single  term  of  the  form  (11.9)  gives  a  stationary 
state  (i.e.,  the  angular  momentum  is  a  good  quantum  number).  In  the  present  case  (except  for  the  spherical 
cellular  model  described  below)  the  boundary  condition  does  not  have  spherical  symmetry.  Therefore  the 
stationary  wave  functions  will  be  of  the  form  (11.11)  with  nonvanishing  coefficients  for  several  distinct 
l  and  m  values;  i.e.,  angular  momentum  will  not  be  a  good  quantum  number. 

10  This  assertion  may  be  somewhat  jarring  to  those  who,  from  atomic  physics,  are  used  to  the  fact 
that  only  a  discrete  set  of  eigenvalues  are  found  in  any  atomic  problem,  namely  the  energy  levels  of  the 
atom  for  angular  momentum  /.  This  is  because,  in  the  atomic  problem,  we  have  the  boundary  condition 
that  X/(f)  vanishes  as  r  ->  oo.  Here  we  are  only  interested  in  Xi  within  the  Wigner-Seitz  cell,  and  no  such 
additional  condition  is  required;  ultimately  the  allowed  values  of  6  will  be  determined  by  the  crystal 
boundary  conditions  (11. 7)  and  (1 1.8).  Imposing  these  does  indeed  lead  back  to  a  discrete  set  of  energies: 
the  e„(k). 

11  Comforted  by  the  assurance  that  eventually  the  expansion  must  converge,  since  for  high  enough 
angular  momenta  /  the  wave  function  will  be  very  small  everywhere  within  the  cell. 
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In  this  way  one  can  search  for  the  eigenvalues  £„(k)  for  each  fixed  k.  Alternatively, 
one  can  fix  £,  do  a  single  numerical  integration  of  (1 1.10),  and  then  search  for  values 
of  k  for  which  the  determinant  vanishes.  Provided  that  one  has  not  been  so  unfortunate 
as  to  choose  £  in  an  energy  gap,  such  values  of  k  can  always  be  found,  and  in  this 
way  the  constant-energy  surfaces  can  be  mapped  out. 

Various  ingenious  techniques  have  been  used  to  minimize  the  mismatch  of  the 
wave  function  at  the  boundaries  due  to  the  fact  that  the  boundary  conditions  can 
only  be  imposed  at  a  finite  number  of  points;  such  cleverness,  and  the  ability  of 
computers  to  handle  large  determinants,  have  led  to  cellular  calculations  of  very  high 
accuracy,12  producing  band  structures  in  substantial  agreement  with  some  of  the 
other  methods  we  shall  describe. 

The  most  famous  application  of  the  cellular  method  is  the  original  calculation  by 
Wigner  and  Seitz  of  the  lowest  energy  level  in  the  valence  band  of  sodium  metal. 
Since  the  bottom  of  the  band  is  at  k  =  0,  the  exponential  factor  disappears  from  the 
boundary  conditions  (1 1.7)  and  (11.8).  Wigner  and  Seitz  made  the  further  approxi¬ 
mation  of  replacing  the  Wigner- Seitz  primitive  cell  by  a  sphere  of  radius  r0  with  the 
same  volume,  thereby  achieving  a  boundary  condition  with  the  same  spherical  sym¬ 
metry  as  the  potential  V(r).  They  could  then  consistently  demand  that  the  solution 
ip{ r)  itself  have  spherical  symmetry,  which  requires  that  only  the  single  term  /  =  0, 
in  =  0  be  retained  in  (1 1.1 1).  Under  these  conditions  the  boundary  conditions  reduce 
to 

Xo'(r0)  =  0.  (11.12) 

Thus  the  solutions  to  the  single  equation  (11.10)  for  /  =  0,  subject  to  the  boundary 
condition  (11.12),  give  the  spherically  symmetric  cellular  wave  functions  and  energies. 

Note  that  the  problem  has  the  same  form  as  an  atomic  problem  except  that  the 
atomic  boundary  condition — that  the  wave  function  vanish  at  infinity — is  replaced 
by  the  cellular  boundary  condition — that  the  wave  function  have  a  vanishing  radial 
derivative  at  r0.  The  3s1  atomic  and  cellular  wave  functions  are  plotted  together  in 
Figure  11.4.  Note  that  the  cellular  wave  function  is  larger  than  the  atomic  one  in 
the  interstitial  region,  but  differs  from  it  very  little  in  the  core  region. 


Figure  11.4 

Comparison  of  3s1  cellular  (solid  curve)  and  atomic  (dashed  curve) 
wave  functions  for  sodium. 


12  Notably  by  S.  L.  Altmann  and  co-workers  (see  Proc.  Ruy.  Soc.  A244,  141,  153  (1958)). 
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There  are  perhaps  two  major  difficulties  with  the  cellular  method: 

1.  The  computational  difficulties  involved  in  numerically  satisfying  a  boundary 
condition  over  the  surface  of  the  Wigner-Seitz  primitive  cell,  a  fairly  complex 
polyhedral  structure. 

2.  The  physically  questionable  point  of  whether  a  potential  representing  an  isolated 
ion  is  the  best  approximation  to  the  correct  potential  within  the  entire  Wigner- 
Seitz  primitive  cell.  In  particular,  the  potential  used  in  the  cellular  calculations 
has  a  discontinuous  derivative  whenever  the  boundary  between  two  cells  is 
crossed  (Figure  11.5),  whereas  in  actual  fact  the  potential  is  quite  flat  in  such 
regions. 


- Cellular  method  potential 

- -  Actual  crystal  potential 

. Individual  atomic  potential) 

-  • — ■ —  Cell  boundaries 


Figure  11.5 

The  cellular  method  potential  has  a  discontinuous  derivative 
midway  between  lattice  points,  but  the  actual  potential  is  quite 
flat  there. 

A  potential  that  overcomes  both  objections  is  the  muffin-tin  potential,  which  is 
taken  to  represent  an  isolated  ion  within  a  sphere  of  specified  radius  r0  about  each  lat¬ 
tice  point,  and  taken  to  be  zero  (i.e.,  constant)  elsewhere  (with  r0  chosen  small  enough 
that  the  spheres  do  not  overlap).  (See  Figure  11.6.)  The  muffin-tin  potential  mitigates 
both  problems,  being  flat  in  the  interstitial  regions,  and  leading  to  matching  conditions 
on  a  spherical  rather  than  a  polyhedral  surface. 

Formally,  the  muffin-tin  potential  can  be  defined  (for  all  R)  by: 

C/(r)  =  F(|r  -  R]),  when  r  -  R|  <  r0  (the  core  or  atomic  region), 

-  V(r0)  =  0,  when  r  -  R|  >  r0  (the  interstitial  region),  (11.13) 

where  r0  is  less  than  half  the  nearest-neighbor  distance.13 


13  Frequently  r0  is  taken  to  be  half  the  nearest- neighbor  distance;  i.e..  the  sphere  is  the  inscribed 
sphere  in  the  Wigner-Seitz  cell.  There  are  minor  technical  complications  in  the  analysis  in  that  case,  which 
we  avoid  by  requiring  r  to  be  less  than  that  distance. 
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Figure  11.6 

(a)  The  muffin-tin  potential,  plotted  along  a  line  of  ions,  (b)  The 
muffin-tin  potential  is  constant  (zero)  in  the  interstitial  regions  and 
represents  an  isolated  ion  in  each  core  region. 

If  we  agree  that  the  function  V(r)  is  zero  when  its  argument  exceeds  r0,  then  we 
can  write  U(i)  very  simply  as 

V(r)  =  X  K(|r  -  R|).  (11-14) 

R 

Two  methods  are  in  wide  use  for  computing  the  bands  in  a  muffin-tin  potential:  the 
augmented  plane-wave  (APW)  method  and  the  method  of  Korringa,  Kohn.  and 
Rostoker  (KKR). 

THE  AUGMENTED  PLANE-WAVE  METHOD  (APW) 

This  approach,  due  to  J.  C.  Slater,14  represents  ^k(r)  as  a  superposition  of  a  finite 
number  of  plane  waves  in  the  flat  interstitial  region,  while  forcing  it  to  have  a  more 
rapid  oscillatory  atomic  behavior  in  the  core  region.  This  is  achieved  by  expanding 
ipkt  in  a  set  of  augmented  plane  waves.15  The  APW  is  defined  as  follows: 

1-  <P k,c  =  e*-'  in  the  interstitial  region.  It  is  important  to  note  that  there  is  no 
constraint  relating  E  and  k  (such  as,  for  example,  E  =  h2k2/2m).  One  can  define 


14  Pliys.  Rev.  51.  846  (1937). 

is  We  append  the  energy  of  a  level  as  an  additional  subscript  when  its  explicit  specification  helps 
to  avoid  possible  ambiguities. 
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an  APW  for  any  energy  £  and  any  wave  vector  k.  Thus  any  single  APW  does 
not  satisfy  the  crystal  Schrodinger  equation  for  energy  £  in  the  interstitial  region. 

2.  4> kE  is  continuous  at  the  boundary  between  atomic  and  interstitial  regions. 

3.  In  the  atomic  region  about  R,  <pkfi  does  satisfy  the  atomic  Schrodinger  equation: 

h1 

-  —  V2</>kt( r)  +  F(|r  -  R|)<bk,£(r)  =  Etfy^r),  |r  —  R|  <  r0.  (11.15) 

Since  k  does  not  appear  in  this  equation.  tf>k  c  gets  its  k  dependence  only  via'  the 
boundary  condition  (2)  and  the  k  dependence  determined  by  (1)  in  the  interstitial 
region.  I 

It  can  be  shown  that  these  conditions  determine  a  unique  APW  4>kl  for  all  k 
and  £.  Note  that  in  the  interstitial  region  the  APW  satisfies  not  (11.15)  but  JI(jjk L  = 
[h2k2 j2m)fkt.  Note  also  that,  in  general,  rj)kz  will  have  a  discontinuous  derivative  on 
the  boundary  between  interstitial  and  atomic  regions,  so  that  V2$kx  will  have  delta- 
function  singularities  there. 

The  APW  method  tries  to  approximate  the  correct  solution  to  the  crystal 
Schrodinger  equation  (11.1)  by  a  superposition  of  APW’s,  all  with  the  same  energy. 
For  any  reciprocal  lattice  vector  K  the  APW  c/>k+Kf  satisfies  the  Bloch  condition 
with  wave  vector  k  (Problem  2),  and  therefore  the  expansion  of  i//k(r)  will  be  of  the 
form 


Wr)  =  I  c'K^k+K.E,v,(r), 


(11.16) 


where  the  sum  is  over  reciprocal  lattice  vectors. 

By  taking  the  energy  of  the  APW  to  be  the  actual  energy  of  the  Bloch  level,  we 
guarantee  that  i^k(r)  satisfies  the  crystal  Schrodinger  equation  in  the  atomic  regions. 
The  hope  is  that  not  too  many  augmented  plane  waves  will  suffice  to  approximate 
the  solutions  to  the  full  Schrodinger  equation  in  the  interstitial  region16  and  at  the 
boundary.  In  practice,  as  many  as  a  hundred  APW's  can  be  used;  by  the  time  this 
stage  is  reached,  E(k)  does  not  change  appreciably  when  more  APW’s  are  added,  and 
one  feels  with  some  confidence  that  good  convergence  has  been  achieved. 

Because  each  APW  has  a  discontinuous  derivative  at  the  boundary  of  the  atomic 
and  interstitial  regions,  it  is  best  to  work  not  with  the  Schrodinger  equation  but  with 
an  equivalent  variational  principle: 

Given  any  differentiable  (but  not  necessarily  twice  differentiable)17  function  ^(r). 
define  the  energy  functional: 


EM  = 


(^|V^r)|2  +  G(r)]^(r)|2j  dr 


J|<A(r)|3 


(11.17) 


dr 


16  The  reader  is  warned  not  to  fall  into  the  trap  of  thinking  that  the  exact  solutions  to  —h2/2tn\2i}/  = 
Zip  in  the  region  of  complex  shape,  where  the  muffin-tin  potential  is  fiat,  must  be  linear  combinations  of 
plane  waves  e* ' r  with  £  =  h2k2/2m. 

17  The  function  tp  may  have  a  kink  where  \ip  is  discontinuous. 
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It  can  be  shown18  that  a  solution  to  the  Schrodinger  equation  (11.1)  satisfying  the 
Bloch  condition  with  wave  vector  k  and  energy  £(k)  makes  (11.17)  stationary  with 
respect  to  differentiable  functions  ij/(r)  that  satisfy  the  Bloch  condition  with  wave 
vector  k.  The  value  of  £[^t]  is  just  the  energy  £(k)  of  the  level 

The  variational  principle  is  exploited  by  using  the  APW  expansion  (11.16)  to 
calculate  £[^t],  This  leads  to  an  approximation  to  £(k)  =  £[^t]  that  depends  on 
the  coefficients  cK.  The  demand  that  £[i//k]  be  stationary  leads  to  the  conditions 
cE/dcK  =  0,  which  are  a  set  of  homogeneous  equations  in  the  cK.  The  coefficients 
in  this  set  of  equations  depend  on  the  sought  for  energy  £(k),  both  through  the  £(k) 
dependence  of  the  APW’s  and  because  the  value  of  £[^k]  at  the  stationary  point 
is  E(k).  Setting  the  determinant  of  these  coefficients  equal  to  zero  gives  an  equation 
whose  roots  determine  the  £(k). 

As  in  the  cellular  case,  it  is  often  preferable  to  work  with  a  set  of  APW’s  of  definite 
energy  and  search  for  the  k  at  which  the  secular  determinant  vanishes,  thereby 
mapping  out  the  constant  energy  surfaces  in  k-space.  With  modern  computing 
techniques  it  appears  possible  to  include  enough  augmented  plane  waves  to  achieve 
excellent  convergence,19  and  the  APW  method  is  one  of  the  more  successful  schemes 
for  calculating  band  structure.20 

In  Figure  11.7  we  show  portions  of  the  energy  bands  for  a  few  metallic  elements, 
as  calculated  by  L.  F.  Mattheiss  using  the  APW  method.  One  of  the  interesting  results 
of  this  analysis  is  the  extent  to  which  the  bands  in  zinc,  which  has  a  filled  atomic 
d-shell,  resemble  the  free  electron  bands.  A  comparison  of  Mattheiss’  curves  for 
titanium  with  the  cellular  calculations  by  Altmann  (Figure  11.8)  should,  however, 
instill  a  healthy  sense  of  caution:  Although  there  are  recognizable  similarities,  there 
are  quite  noticeable  differences.  These  are  probably  due  more  to  the  differences  in 
choice  of  potential  than  to  the  validity  of  the  calculation  methods,  but  they  serve  to 
indicate  that  one  should  be  wary  in  using  the  results  of  first  principles  band-structure 
calculations. 

THE  GREEN’S  FUNCTION  METHOD  OF  KORRINGA,  KOHN,  AND 
ROSTOKER  (KKR) 

An  alternative  approach  to  the  muffin-tin  potential  is  provided  by  a  method  due  to 
Korringa  and  to  Kohn  and  Rostoker.21  This  starts  from  the  integral  form  of  the 
Schrodinger  equation22 

18  For  a  simple  proof  (and  a  more  detailed  statement  of  the  variational  principle)  see  Appendix  G. 

19  In  some  cases  a  very  small  number  of  AP W's  may  suffice  to  give  reasonable  convergence  for  much 
the  same  reasons  as  in  the  case  of  the  orthogonal ized  plane  wave  and  pseudopotential  methods,  discussed 
below. 

20  Complete  details  on  the  method  along  with  sample  computer  programs  may  even  be  found  in 
textbook  form:  T.  L.  Loucks,  Augmented  Plane  Wave  Method ,  W.  A.  Benjamin,  Menlo  Park,  California, 
1967. 

21  J.  Korringa,  Physica  13,  392  (1947);  W.  Kohn  and  N.  Rostoker,  Phys.  Rev.  94, 1111  (1954). 

22  Equation  (1 1.18)  is  the  starting  point  for  the  elementary  theory  of  scattering.  That  it  is  equivalent 
to  the  ordinary  Schrodinger  equation  (11.1)  follows  from  the  fact  (Chapter  17,  Problem  3)  that  G  satisfies 
{£  +  frV2/2m)G(r  —  r')  =  <5(r  —  r').  For  an  elementary  discussion  of  these  facts  see,  for  example,  D.  S. 
Saxon,  Elementary  Quantum  Mechanics ,  Holden-Day,  San  Francisco,  1968,  p.  360  et  seq.  In  scattering 
theory  it  is  customary  to  include  an  inhomogeneous  term  e ik't  in  (11.18),  where  hk  —  N/2 to  satisfy 
the  boundary  condition  appropriate  to  an  incoming  plane  wave.  Here,  however,  the  boundary  condition 
is  the  Bloch  relation,  which  is  satisfied  by  (11.18)  without  an  inhomogeneous  term. 
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Figure  11.7 

APW  energy  bands  for  iron,  copper,  and  zinc,  calculated  by  L.  F. 
Mattheis,  Phys.  Rev.  134,  A970  (1964).  The  bands  are  plotted  from  the 
origin  of  k-space  to  the  points  indicated  on  the  zone  surfaces.  Note  the 
striking  resemblance  between  the  calculated  bands  of  zinc  and  the  free 
electron  bands  (pictured  to  the  right).  Zinc  has  two  s-electrons  outside 
of  a  closed-shell  configuration.  The  horizontal  dashed  lines  mark  the 
Fermi  energy. 


WO  = 


I 


dr'  Gwkl(r  -  r')(/(r')i//t(r'), 


(11.18) 


where  the  integral  is  over  all  space  and 


Ge(r- 


,  _  _  2m  e,ylr~r'l 

h2  47rjr  -  r'|  ’ 

K  =  yj2me,/h\  E  >  0, 

=  iyj2m(  —  E)/fr,  E  <  0. 


(11.19) 


Substituting  the  form  (11.14)  for  the  muffin-tin  potential  into  (11.18),  and  making 
the  change  of  variables  r"  =  r'  -  R  in  each  term  of  the  resulting  sum,  we  can  rewrite 
(11.18)  as 

WO  =  I  J dr"  Gf>  -  r"  -  R)F(r")^(r"  +  R).  (11.20) 

The  Bloch  condition  gives  \pk{ r"  +  R)  =  c*'%( r"),  and  we  can  therefore  rewrite 
(11.20)  (replacing  r"  by  r'): 


^k(0  = 


dr'  Staler  -  r')F(r')^t(r'). 


(11.21) 
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Figure  11.8 

Three  calculated  band  structures  for  titanium.  Curves  (a)  and  (b)  were  calculated  by  the  cellular 
method  for  two  possible  potentials.  They  are  taken  from  S.  L.  Altmann,  in  Soft  X-Ray  Band 
Spectra.  D.  Fabian  (ed.).  Academic  Press  London.  1968.  Curve  (c)  is  from  the  APW  calculation 
of  Mattheis. 

where 


Sk,.(r  -  r')  =  X  Gc(r  -  r  -  R)e‘k«.  (11.22) 

K 

Equation  (1 1.21)  has  the  pleasing  feature  that  all  of  the  dependence  on  both  wave 
vector  k  and  crystal  structure  is  contained  in  the  function  Skx.  which  can  be  calculated, 
once  and  for  all.  for  a  variety  of  crystal  structures  for  specified  values  of  £  and  k.23 
It  is  shown  in  Problem  3  that  Eq.  (11.21)  implies  that  on  the  sphere  of  radius  r0, 
the  values  of  ipk  are  constrained  to  satisfy  the  following  integral  equation: 


Stnt,M<2\  roe‘(p')  —  i!/(r&'<p') 
cr 


=ro 


o^')  r0'<l>') 


=ro- 


(11.23) 


21  To  do  the  R-sum  one  uses  the  same  techniques  as  in  calculations  of  the  lattice  energies  of  ionic 
crystals  {Chapter  20). 
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Since  the  function  i]/k  is  continuous^ it  retains  the  form  determined  by  the  atomic 
problem  (Eqs.  (1 1.9)  to  (1 1.1 1))  at  r0.  Tne  approximation  of  the  KKR  method  (which 
is  exact  for  the  muffin-tin  potential  up  to  this  point)  is  to  assume  that  will  be  given 
to  a  reasonable  degree  of  accuracy  by  keeping  only  a  finite  number  (say  N)  of  spherical 
harmonics  in  the  expansion  (11.11).  By  placing  this  truncated  expansion  in  (11.23), 
multiplying  by  Ylm(6,  <p),  and  integrating  the  result  over  the  solid  angle  d0  d<j>  for  all 
/  and  m  appearing  in  the  truncated  expansion,  we  obtain  a  set  of  N  linear  equations 
for  the  Alm  appearing  in  the  expansion  (11.11).  The  coefficients  in  these  equations 
depend  on  £(k)  and  k  through  gtc(k)  and  through  the  radial  wave  function  y,f  and 
its  derivative  -/lz.  Setting  the  N  x  N  determinant  of  the  coefficients  equal  to  zero 
once  again  gives  an  equation  determining  the  relation  between  £  and  k.  As  in  the 
methods  described  earlier,  one  can  either  search  for  values  of  £  giving  a  solution  for 
fixed  k,  or  fix  £  and  map  out  the  surface  in  k-space  at  which  the  determinant  vanishes, 
which  will  then  give  the  constant-energy  surface  £(k)  =  £. 

Both  the  KKR  and  APW  methods  can  be  regarded  as  techniques  which,  if  carried 
out  exactly  for  the  muffin-tin  potential,  would  lead  to  infinite-order  determinantal 
conditions.  These  are  then  approximated  by  taking  only  a  finite  subdeterminant.  In 
the  APW  method  the  truncation  is  in  K;  the  wave  function  is  approximated  in  the 
interstitial  region.  In  KKR,  on  the  other  hand,  the  sum  over  all  K  is  effectively  per¬ 
formed  when  gte  is  computed.24  Instead,  the  approximation  is  in  the  form  of  the 
wave  function  in  the  atomic  region.  In  both  cases  the  procedure  converges  well  if 
sufficiently  many  terms  are  retained;  in  practice  the  KKR  method  appears  to  require 
fewer  terms  in  the  spherical  harmonic  expansion  than  the  APW  technique  requires 
in  the  K  expansion.  When  the  APW  and  KKR  methods  are  applied  to  the  same 
muffin-tin  potential,  they  give  results  in  substantial  agreement. 

The  results  of  a  KKR  calculation  for  the  3s2  and  3 p1  derived  bands  of  aluminum 
are  displayed  in  Figure  11.9.  Note  the  extraordinary  resemblance  of  the  calculated 


Figure  11.9 

Calculated  valence  bands  for 
aluminum  (three  electrons 
outside  of  a  closed-shell  neon 
configuration)comparedwith 
free  electron  bands  (dashed 
lines).  The  bands  are  com¬ 
puted  by  the  KKR  method. 
(B.  Segall,  Phys.  Rev.  124. 
1797(1961).) 


24  It  is  not  necessary  to  calculate  for  all  values  of  r,  but  only  the  integrals 

dQ  dCl '  Y,m*W^Jro04j,  ro0-4>')YVmW4>')  and 


da  da-  Ylm*(6<j>)  —  S uMA  rO'cP'] 


YrA&Ph 


These  have  been  tabulated  for  various  cr>stal  structures  over  a  range  of  £  and  k  values,  with  r0  usually 
taken  as  the  radius  or  *i  sphere  inscribed  in  a  Wigner-Seitz  cell. 
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bands  to  the  free  electron  levels,  plotted  as  dashed  lines  in  the  same  figure.  The  only 
discernible  effects  of  the  interaction  between  electrons  and  ions  are,  as  predicted  by 
nearly  free  electron  theory,  to  split  the  band  degeneracies.  This  is  a  striking  illustration 
of  our  observation  (see  page  152)  that  metals  whose  atomic  configuration  consists 
of  a  small  number  of  s  and  p  electrons  outside  of  a  rare-gas  configuration  have  band 
structures  that  can  be  reproduced  very  well  by  the  nearly  free  electron  bands.  The 
next  two  methods  to  be  discussed  attempt  to  shed  some  light  on  this  remarkable  fact. 


THE  ORTHOGONALIZED  PLANE-WAVE  METHOD  (OPW) 

An  alternative  method  of  combining  rapid  oscillations  in  the  ion  core  region  with 
plane-wavelike  behavior  interstitially,  is  the  method  of  orthogonalized  plane  waves, 
due  to  Herring.25  The  OPW  method  does  not  require  a  muffin-tin  potential  to  make 
calculations  feasible,  and  is  therefore  of  particular  value  if  one  insists  on  using  an 
undoctored  potential.  In  addition,  the  method  affords  some  insight  into  why  the 
nearly  free  electron  approximation  does  so  remarkably  well  in  predicting  the  band 
structures  of  a  variety  of  metals. 

We  begin  by  explicitly  distinguishing  between  the  core  electrons  and  the  valence 
electrons.  The  core  wave  functions  are  well  localized  about  the  lattice  sites.  The 
valence  electrons,  on  the  other  hand,  can  be  found  with  appreciable  probability  in 
the  interstitial  regions,  where  our  hope  is  that  their  wave  functions  will  be  well 
approximated  be  a  very  small  number  of  plane  waves.  Throughout  this  and  the  next 
section  we  shall  affix  superscripts  c  or  v  to  wave  functions  to  indicate  whether  they 
describe  core  or  valence  levels. 

The  difficulty  with  approximating  a  valence  wave  function  by  a  few  plane  waves 
everywhere  in  space  (as  in  the  nearly  free  electron  method)  is  that  this  hopelessly 
fails  to  produce  the  rapid  oscillatory  behavior  required  in  the  core  region.  Herring 
noted  that  this  could  be  taken  care  of  by  using  not  simple  plane  waves,  but  plane 
waves  orthogonalized  to  the  core  levels  right  from  the  start.  Thus  we  define  the 
orthogonalized  plane  wave  (OPW)  <pk  by: 


+ 1 WM. 

c 


(11.24) 


where  the  sum  is  over  all  core  levels  with  Bloch  wave  vector  k.  The  core  wave  functions 
are  assumed  to  be  known  (generally  they  are  taken  to  be  tight-binding  combinations 
of  calculated  atomic  levels),  and  the  constants  bc  are  determined  by  requiring  that 
<j>k  be  orthogonal  to  every  core  level:26 


which  implies  that 


1 


dr  ipck*(r)tpk(  r)  =  0, 


dr  ip[*(r)eik''. 


(11.25) 

(11.26) 


25  C.  Herring,  Phys.  Rev.  57, 1 169  (1940). 

26  We  assume  the  normalization  condition  J  dr  |i//kr|2  =  1.  Note  that  0k. is  also  orthogonal  to  0kc, 
with  k'  ^  k  because  of  the  Bloch  condition. 
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The  OPW  fa  has  the  following  properties  characteristic  of  valence  level  wave 
functions: 

1.  By  explicit  construction  it  is  orthogonal  to  all  the  core  levels.  It  therefore  also 
has  the  required  rapid  oscillations  in  the  core  region.  This  is  particularly  evident 
from  (11.24),  since  the  core  wave  functions  ^kc(r)  appearing  in  fa  themselves 
oscillate  in  the  core  region. 

2.  Because  the  core  levels  are  localized  about  lattice  points,  the  second  term  in 
(1 1.24)  is  small  in  the  interstitial  region,  where  fa  is  very  close  to  the  single  plane 
wave  e'k ' '. 

Since  the  plane  wave  <?'w  and  the  core  wave  functions  ^kc(r)  satisfy  the  Bloch 
condition  with  wave  vector  k,  so  will  the  OPW  fa.  We  may  therefore,  as  in  the  APW 
method,  seek  an  expansion  of  the  actual  electronic  eigenstates  of  the  Schrodinger 
equation  as  linear  combinations  of  OPW’s: 

^k=ECK^k  +  K-  (11.27) 

K 

As  in  the  APW  method,  we  can  determine  the  coefficients  cK  in  (1 1.27)  and  the  energies 
E(k)  by  inserting  (11.27)  into  the  variational  principle  (11.17),  and  requiring  that  the 
derivatives  of  the  resulting  expression  with  respect  to  all  the  cK’s  vanish.  The  crystal 
potential  U(r)  will  enter  into  the  resulting  secular  problem  only  through  its  OPW 
matrix  elements: 

j^+K(r)t/(r)</>k+K(r)*.  (11.28) 

The  OPW  method  owes  its  success  to  the  Tact  that  although  the  plane-wave 
matrix  elements  of  U  are  large,  its  OPW  matrix  elements  turn  out  to  be  much  smaller. 
Therefore,  although  it  is  hopeless  to  try  to  get  convergence  by  expanding  fa  in  plane 
waves,  the  convergence  of  the  expansion  in  OPW’s  is  very  much  faster. 

In  practice  the  OPW  method  is  used  in  two  very  different  ways.  On  the  one  hand, 
one  may  carry  out  numerically  a  first  principles  OPW  calculation,  starting  with  an 
atomic  potential,  calculating  its  OPW  matrix  elements,  and  working  with  large 
enough  secular  problems  (which  may  sometimes  turn  out  to  be  remarkably  small, 
but  which  can  also  require  as  many  as  a  hundred  OPW’s)  to  ensure  good  convergence. 

On  the  other  hand,  one  frequently  encounters  “calculations”  of  band  structure 
that  appear  to  be  nothing  but  the  nearly  free  electron  theory  of  Chapter  9,  in  which 
the  Fourier  components  UK  of  the  potential  are  treated  as  adjustable  parameters 
rather  than  known  quantities.  The  UK  are  determined  by  fitting  the  nearly  free  electron 
bands  either  to  empirical  data  or  to  the  bands  calculated  in  detail  by  one  of  the  more 
realistic  methods.  As  an  example  of  this,  the  KKR  bands  for  aluminum,  shown  in 
Figure  11.9,  can  be  reproduced  with  remarkable  precision  throughout  the  zone  by 
a  nearly  free  electron  calculation  that  uses  only  four  plane  waves  and  requires  only 
two  parameters27:  U m  and  U2oo- 


n  B.  Segall,  Phys.  Rev.  124,  1797  (1961).  (A  third  parameter  is  used  in  the  form  for  the  free  electron 
energy,  which  is  written  as  afi2/c2/2m.)  As  it  happens,  these  bands  do  not  lead  to  a  Fermi  surface  with  the 
correct  detailed  structure  (an  illustration  of  lhe  difficulty  of  getting  precise  potentials). 
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Since  nearly  free  electron  theory  surely  cannot  work  so  well,  it  must  be  that  the 
apparently  nearly  free  electron  secular  problem  is  actually  the  final  stage  of  a  much 
more  complicated  analysis,  such  as  that  of  the  OPW  method,  the  Fourier  components 
UK  being  OPW  rather  than  plane-wave  matrix  elements  of  the  potential.  One  there¬ 
fore  refers  to  such  a  calculation  as  an  OPW  calculation.  In  this  context,  however, 
such  a  designation  is  little  more  than  a  reminder  that  although  the  analysis  is  formally 
identical  to  nearly  free  electron  theory,  it  can  be  placed  on  a  more  secure  theoretical 
footing. 

It  is  not  at  all  clear,  however,  that  the  OPW  approach  is  the  best  way  to  reduce 
the  actual  problem  of  an  electron  in  a  periodic  potential  to  an  effectively  “nearly 
free”  electron  calculation.  A  more  systematic  way  of  studying  this  problem,  as  well 
as  a  variety  of  additional  calculational  approaches,  is  offered  by  the  pseudopotential 
methods. 


THE  PSEUDOPOTENTIAL 

The  theory  of  the  pseudopotential  began  as  an  extension  of  the  OPW  method.  Aside 
from  the  possibility  it  offers  of  refining  OPW  calculations,  it  also  provides  at  least  a 
partial  explanation  for  the  success  of  nearly  free  electron  calculations  in  fitting  actual 
band  structures. 

We  describe  the  pseudopotential  method  only  in  its  earliest  formulation,28  which 
is  basically  a  recasting  of  the  OPW  approach.  Suppose  that  we  write  the  exact  wave 
function  for  a  valence  level  as  a  linear  combination  of  OPW’s,  as  in  (11.27).  Let 
4>kv  be  the  plane-wave  part  of  this  expansion: 


=  (11.29) 

K 

Then  we  can  rewrite  the  expansions  (1 1.27)  and  (1 1.24)  as 

^‘(D  =  <-/V'(r)  -  I  Q*'  ^*(r>/V(r')J  iMO-  (11.30) 

Since  ipk  is  an  exact  valence  wave  function,  it  satisfies  Schrodinger’s  equation  with 
eigenvalue  £tc: 

Ihhv  =  EkW-  (11.31) 

Substitution  of  ( 1 1.30)  into  (11.31)  gives 

WK’  - 1 


dr  /W  =  V  (W  -  Z  ( 


dr'VvW  ■  (11-32) 


If  we  note  that  Hi]/kc  =  Zkipk  for  the  exact  core  levels,  then  we  can  rewrite  (1 1.32)  as 

(H  +  VR)<h"  =  W-  (H-33) 


28  E.  Antoncik.  J.  Plus.  Chan.  Solids  10.  314  (1959);  J.  C.  Phillips  and  L  Eleinman.  Pins.  Rev.  116. 
287,  880(1959). 
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where  we  have  buried  some  rather  cumbersome  terms  in  the  operator  VR,  which  is 
defined  by 


(11.34) 


We  have  therefore  arrived  at  an  effective  Schrodinger  equation  (1 1.33)  satisfied  by 
<pk,  the  smooth  part  of  the  Bloch  function.  Since  experience  with  the  OPW  method 
suggests  that  4>k  can  be  approximated  by  a  linear  combination  of  a  small  number 
of  plane  waves,  we  might  expect  that  the  nearly  free  electron  theory  of  Chapter  9 
could  be  applied  to  finding  the  valence  levels  of  H  +  \ R.  This  is  the  starting  point 
for  pseudopotential  calculation  and  analysis. 

The  pseudopotential  is  defined  to  be  the  sum  of  the  actual  periodic  potential  U, 
and  VR: 

ti2 

H  +  VR  =  — —  V2  +  Tp”udo.  (11.35) 

2m 


The  hope  is  that  the  pseudopotential  is  sufficiently  small  to  justify  a  nearly  free  electron 
calculation  of  the  valence  levels.  One  can  see  a  hint  that  this  might  be  so  from  the 
fact  that  although  the  actual  periodic  potential  is  attractive  near  the  ion  cores,  and 
thus  (i j/,  U\jj)  =  J  dr  i//*(r)C(r)i//(r)  is  negative,  the  corresponding  matrix  element  of 
the  potential  l/R  is,  according  to  (11.34), 

(*,  =  -z*) 

c 


I 


dr 


(11.36) 


Since  the  valence  energies  lie  above  the  core  energies,  this  is  always  positive.  Thus 
adding  VR  to  U  provides  at  least  a  partial  cancellation,  and  one  might  optimistically 
hope  for  it  to  lead  to  a  potential  weak  enough  to  do  nearly  free  electron  calculations 
for  (j)kv  (the  so-called  pseudo  wave  function),  treating  the  pseudopotential  as  a  weak 
perturbation. 

There  are  some  peculiar  features  to  the  pseudopotential.  Equation  (11.34)  implies 
that  VR  (and  hence  the  pseudopotential)  is  nonlocal;  i.e.,  its  effect  on  a  wave  function 
<//( r)  is  not  merely  to  multiply  it  by  some  function  of  r.  In  addition,  the  pseudopotential 
depends  on  the  energy  of  the  level  being  sought,  £kr,  which  means  that  many  of  the 
basic  theorems  one  is  used  to  applying  without  further  thought  (such  as  the  orthog¬ 
onality  of  eigenfunctions  belonging  to  different  eigenvalues)  are  no  longer  applicable 
to  Hpseud°. 

The  second  difficulty  can  be  removed  by  setting  in  (1 1.34)  and  in  l/pM:l,d"  equal 
to  the  energy  of  the  levels  one  is  most  interested  in — generally  the  Fermi  energy.  Of 
course,  once  this  replacement  has  been  made,  the  eigenvalues  of  H  +  VR  are  no 
longer  exactly  those  of  the  original  Hamiltonian,  except  for  the  levels  at  the  Fermi 
energy.  Since  these  are  frequently  the  levels  of  greatest  interest,  this  need  not  be  too 
great  a  price  to  pay.  For  example,  one  can,  in  this  way,  find  the  set  of  k  for  which 
£k"  =  SF,  thereby  mapping  out  the  Fermi  surface. 

There  turn  out  to  be  many  ways  other  than  (11.34)  to  define  a  VR  such  that 
H  +  VR  has  the  same  valence  eigenvalues  as  the  actual  crystal  Hamiltonian  H. 
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From  such  choices  has  arisen  a  wealth  of  pseudopotential  lore,  whose  usefulness 
for  anything  other  than  justifying  the  nearly  free  electron  Fermi  surfaces  has  yet 
to  be  convincingly  established.29 

COMBINED  METHODS 

People  have,  of  course,  exercised  considerable  further  ingenuity  in  combining  the 
various  techniques.  Thus,  for  example,  it  may  be  useful  to  treat  the  r/- bands  of  tran¬ 
sition  elements  in  a  manner  suggested  by  the  tight-binding  approximation  and  yet 
allow  for  s-d  mixing,  not  by  adding  tight-binding  functions  for  the  s-band  as  well, 
but  by  combining  one  of  the  plane-wave  methods  we  have  described  in  some  suitably 
self-consistent  way.  It  goes  without  saying  that  we  have  only  scratched  the  surface 
of  several  vast  fields  of  endeavor  in  this  survey  of  methods  for  computing  the  energy 
bands. 

This  and  the  preceding  three  chapters  have  been  concerned  with  the  abstract 
structural  features  of  band  structure.  We  now  turn  to  some  of  the  more  direct  obser¬ 
vational  manifestations  of  the  electronic  energy  bands.  Chapters  12  and  13  discuss 
the  generalization  of  the  transport  theory  of  Drude  and  Sommerfeld  to  Bloch  elec¬ 
trons;  Chapter  14  discusses  some  of  the  techniques  for  direct  observation  of  the  Fermi 
surface;  and  Chapter  15  describes  the  band  structures  of  some  of  the  more  familiar 
metals. 

PROBLEMS 


1.  Boundary  Conditions  on  Electron  Ware-Functions  in  Crystals 
Lei  r  locate  a  point  just  within  the  boundary  of  a  primitive  cell  C0,  and  r'  another  point  infini¬ 
tesimally  displaced  from  r  just  outside  the  same  boundary.  The  continuity  equations  for  i/(r)  are 

lim  OW  -  Wi]  =  0, 

lim  [Vt//(r)  -  Vi//(r')]  =  0.  (11.37) 

(a)  Verify  that  any  point  r  on  the  surface  of  a  primitive  cell  is  separated  by  some  Bravais 
lattice  vector  R  from  another  surface  point  and  that  the  normals  to  the  cell  at  r  and  r  +  R  are 
oppositely  directed. 

(b)  Using  the  fact  that  xp  can  be  chosen  to  have  the  Bloch  form,  show  that  the  continuity 
conditions  can  equally  well  be  written  in  terms  of  the  values  of  \p  entirely  within  a  primitive  cell: 

tj/{  r)  =  e~,k'Ri//(r  +  R), 

Vip(r)  =  e~n‘KVxp(r  +  R),  (11.38) 

for  pairs  of  points  on  the  surface  separated  by  direct  lattice  vectors  R. 


29  A  review  of  the  pseudopotential  and  its  applications  tan  be  found  in  Solid  State  Physics,  Vol.  24, 
D.  Turnbull  and  F.  Seitz,  eds..  Academic.  New  York,  1970. 
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(c)  Show  that  the  only  information  in  the  second  of  equations  (1 1.38)  not  already  contained 
in  the  first  is  in  the  equation 

n(r)  •  V^r)  =  -e_,k  Rfi(r  +  R)  •  V<//(r  +  R),  (11.39) 

where  the  vector  fi  is  normal  to  the  surface  of  the  cell. 

2.  Using  the  fact  that  the  APW  is  continuous  on  the  surfaces  defining  the  muffin-tin  potential, 
give  an  argument  to  show  that  the  APW  <pt+w.t  satisfies  the  Bloch  condition  with  wave  vector  k. 

3.  The  integral  equation  for  a  Bloch  function  in  a  periodic  potential  is  given  by  Eq.  (11.21) 
where,  for  potentials  of  the  muffin-tin  type,  the  region  of  integration  is  confined  to  |r'|  <  r0. 

(a)  From  the  definition  (1 1.22)  of  S  show  that 

(£ v"2 + e) Ske(r  -  ° = 5(r  - o>  r'  <  r°-  c1-40) 

(b)  Show  by  writing 

9V2lp  =  V'  •  (SV'i//  -  V'i 8)  +  iV2S, 

that  (11.21),  (1 1.40),  and  the  Schrodinger  equation  for  r  <  r0  lead  to 

0  =  f  dr'  V  •  [Sk,aM(r  -  r')V'^(r')  -  ^t(r')V'Skakl(r  -  r')].  (11.41) 

Jr'<r0 

(c)  Use  Gauss’  theorem  to  transform  (11.41)  to  an  integral  over  the  surface  of  a  sphere  of 
radius  i/  =  r0  and  show  that  when  r  is  also  set  equal  to  r0,  Eq.  (11.23)  results. 
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The  Bloch  theory  (Chapter  8)  extends  the  equilibrium  free  electron  theory  of  Sommer- 
feld  (Chapter  2)  to  the  case  in  which  a  (nonconstant)  periodic  potential  is  present.  In 
Table  12.1  we  compare  the  major  features  of  the  two  theories. 


Table  12.1 

COMPARISON  OF  SOMMERFELD  AND  BLOCH  ONE-ELECTRON  EQUILIBRIUM 
LEVELS 


SOMMERFELD 

BLOCH 

QUANTUM  NUMBERS 
(EXCLUDING  SPIN) 

k  ( /ik  is  the 
momentum.) 

k,  n  (ftk  is  the  crystal  momentum 
and  n  is  the  band  index.) 

RANGE  OF  QUANTUM 

NUMBERS 

k  runs  through  all  of  k- 
space  consistent  with  the 
Born-von  Karman 
periodic  boundary 
condition. 

For  each  n,  k  runs  through  all  wave 
vectors  in  a  single  primitive  cell  of  the 
reciprocal  lattice  consistent  with  the 
Born-von  Karman  periodic  boundary 
condition;  n  runs  through  an  infinite 
set  of  discrete  values. 

ENERGY 

h2k 2 

£(k)  =  i«r 

For  a  given  band  index  n,  £„(k)  has  no 
simple  explicit  form.  The  only  general 
property  is  periodicity  in  the  reciprocal 
lattice: 

6„(k  +  K)  =  E„(k). 

VELOCITY 

The  mean  velocity  of  an 
electron  in  a  level  with 
wave  vector  k  is: 

The  mean  velocity  of  an  electron  in  a 
level  with  band  index  n  and  wave 
vector  k  is: 

hk  1 

m  h  rk 

h  ck 

WAVE  FUNCTION 

The  wave  function  of  an 
electron  with  wave  vector 
k  is: 

=  T7T72- 

The  wave  function  of  an  electron  with 
band  index  n  and  wave  vector  k  is: 
l/uM  =  e*  ru„k(r) 

where  the  function  i/nk  has  no  simple 
explicit  form.  The  only  general  property 
is  periodicity  in  the  direct  lattice : 

K  i(r  +  R)  =  "rt(r). 

To  discuss  conduction  we  had  to  extend  Sommerfeld’s  equilibrium  theory  to 
nonequilibrium  cases.  We  argued  in  Chapter  2  that  one  could  calculate  the  dynamic 
behavior  of  the  free  electron  gas  using  ordinary  classical  mechanics,  provided  that 
there  was  no  need  to  localize  an  electron  on  a  scale  comparable  to  the  interelectronic 
distance.  Thus  the  trajectory  of  each  electron  between  collisions  was  calculated 
according  to  the  usual  classical  equations  of  motion  for  a  particle  of  momentum  fik: 
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If  pressed  to  justify  this  procedure  from  a  quantum-mechanical  point  of  view  we 
would  argue  that  (12.1)  actually  describes  the  behavior  of  a  wave  packet  of  free 
electron  levels, 

_ ,  f  /  tik'2t\ 

>P(r,  t)  =  I <?(k')  exP  (k'  ‘  r  -  y-JJ. 

g{ k')  «  0,  |k'  —  k  >  A k,  (12.2) 

where  k  and  r  are  the  mean  position  and  momentum  about  which  the  wave  packet 
is  localized  (to  within  the  limitation  Ax  A k  >  1  imposed  by  the  uncertainty  principle). 

This  approach  has  a  simple  and  elegant  generalization  to  electrons  in  a  general 
periodic  potential,  which  is  known  as  the  semiclassical  model.  Justifying  the  semi¬ 
classical  model  in  detail  is  a  formidable  task,  considerably  more  difficult  than  jus¬ 
tifying  the  ordinary  classical  limit  for  free  electrons.  In  this  book  we  shall  not  offer 
a  systematic  derivation.  Our  emphasis  instead  will  be  on  how  the  semiclassical  model 
is  used.  We  shall  therefore  simply  describe  the  model,  state  the  limitations  on  its 
validity,  and  extract  some  of  its  major  physical  consequences.1 

The  reader  who  is  dissatisfied  with  the  very  incomplete  and  merely  suggestive 
bases  we  shall  offer  for  the  semiclassical  model  is  urged  to  examine  the  broad  array 
of  mysteries  and  anomalies  of  free  electron  theory  that  the  model  resolves.  Perhaps 
a  suitable  attitude  to  take  is  this:  If  there  were  no  underlying  microscopic  quantum 
theory  of  electrons  in  solids,  one  could  still  imagine  a  semiclassical  mechanics  (guessed 
by  some  late  nineteenth-century  Newton  of  crystalline  spaces)  that  was  brilliantly 
confirmed  by  its  account  of  observed  electronic  behavior,  just  as  classical  mechanics 
was  confirmed  by  its  accounting  for  planetary  motion,  and  only  very  much  later  given 
a  more  fundamental  derivation  as  a  limiting  form  of  quantum  mechanics. 

As  with  free  electrons,  two  questions  arise  in  discussing  conduction  by  Bloch 
electrons2:  (a)  What  is  the  nature  of  the  collisions?  (b)  How  do  Bloch  electrons  move 
between  collisions?  The  semiclassical  model  deals  entirely  with  the  second  question, 
but  the  Bloch  theory  also  critically  affects  the  first.  Drude  assumed  that  the  electrons 
collided  with  the  fixed  heavy  ions.  This  assumption  cannot  be  reconciled  with  the 
very  long  mean  free  paths  possible  in  metals,  and  fails  to  account  for  their  observed 
temperature  dependence.3  The  Bloch  theory  excludes  it  on  theoretical  grounds  as 
well.  Bloch  levels  are  stationary  solutions  to  the  Schrodinger  equation  in  the  presence 
of  the  full  periodic  potential  of  the  ions.  If  an  electron  in  the  level  tj/nk  has  a  mean 
nonvanishing  velocity  (as  it  does  unless  d£n(k)/dk  happens  to  vanish),  then  that 
velocity  persists  forever.4  One  cannot  appeal  to  collisions  with  static  ions  as  a  mecha¬ 
nism  to  degrade  the  velocity,  because  the  interaction  of  the  electron  with  the  fixed 
periodic  array  of  ions  has  been  fully  taken  into  account  ab  initio  in  the  Schrodinger 
equation  solved  by  the  Bloch  wave  function.  Thus  the  conductivity  of  a  perfect 
periodic  crystal  is  infinite. 

1  For  one  of  the  more  recent  efforts  at  a  systematic  derivation  see  J.  Zak,  Phys.  Rev.  168,  686  (1968). 
References  to  much  of  the  earlier  work  are  given  therein.  A  very  appealing  treatment  of  Bloch  electrons 
in  a  magnetic  field  (perhaps  the  most  difficult  area  in  which  to  derive  the  semiclassical  model)  is  given 
by  R.  G.  Chambers,  Proc.  Phys.  Soc.  89,  695  (1966),  who  explicitly  constructs  a  time-dependent  wave 
packet  whose  center  moves  along  the  orbit  determined  by  the  semiclassical  equations  of  motion. 

2  We  shall  use  the  term  “Bloch  electrons’’  to  mean  “electrons  in  a  general  periodic  potential.” 

3  Page  9. 

4  Seepage  Ml. 
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This  result,  so  disconcerting  to  one’s  classical  inclination  to  picture  the  electrons 
as  suffering  current  degrading  bumps  with  individual  ions,  can  be  understood  as  a 
simple  manifestation  of  the  wave  nature  of  electrons.  In  a  periodic  array  of  scatterers 
a  wave  can  propagate  without  attenuation  because  of  the  coherent  constructive 
interference  of  the  scattered  waves.5 

Metals  have  an  electrical  resistance  because  no  real  solid  is  a  perfect  crystal.  There 
are  always  impurities,  missing  ions,  or  other  imperfections  that  can  scatter  electrons, 
and  at  very  low  temperatures  it  is  these  that  limit  conduction.  Even  if  imperfections 
could  be  entirely  eliminated,  however,  the  conductivity  would  remain  finite  because 
of  thermal  vibrations  of  the  ions,  which  produce  temperature-dependent  distortions 
from  perfect  periodicity  in  the  potential  the  electrons  experience.  These  deviations 
from  periodicity  are  capable  of  scattering  electrons,  and  are  the  source  of  the  tem¬ 
perature  dependence  of  the  electronic  relaxation  time  that  was  noted  in  Chapter  1. 

We  defer  a  full  discussion  of  the  actual  scattering  mechanisms  to  Chapters  16  and 
26.  Here  we  only  note  that  the  Bloch  theory  now  forces  us  to  abandon  Drude’s  naive 
picture  of  electron-ion  scattering.  We  shall  nevertheless  continue  to  extract  conse¬ 
quences  that  follow  from  the  simple  assumption  that  some  scattering  mechanism 
exists,  irrespective  of  its  detailed  features. 

Thus  the  main  problem  we  face  is  how  to  describe  the  motion  of  Bloch  electrons 
between  collisions.  The  fact  that  the  mean  velocity  of  an  electron  in  a  definite  Bloch 
level  ^  is6 


v„(k)  = 


1  c£„(k) 
h  rk 


(12.3) 


is  very  suggestive.  Consider  a  wave  packet  of  Bloch  levels  from  a  given  band,  con¬ 
structed  in  analogy  to  the  free  electron  wave  packet  (12.2): 

Wr-  0  =  £  ff(k’)lMr)  exp  [-  j  e„(k')fl,  g( k')  *  0,  |k'  -  k|  >  A k 

k  L  h  J  (12.4) 


Let  the  spread  in  wave  vector  Ak  be  small  compared  with  the  dimensions  of  the 
Brillouin  zone,  so  that  £„(k)  varies  little  over  all  levels  appearing  in  the  wave  packet. 
The  formula  for  the  velocity  (12.3)  can  then  be  viewed  as  the  familiar  assertion  that 
the  group  velocity  of  a  wave  packet  is  dw/£k  =  (d/ck){C/fi). 

The  semiclassical  model  describes  such  wave  packets  when  it  is  unnecessary  to 
specify  the  position  of  an  electron  on  a  scale  comparable  with  the  spread  of  the 
packet. 

Let  us  estimate  how  broad  the  wave  packet  (12.4)  must  be  when  the  spread  in 
wave  vector  is  small  compared  with  the  dimensions  of  the  Brillouin  zone.  We  examine 
the  wave  packet  at  points  separated  by  a  Bravais  lattice  vector.  Setting  r  =  r0  +  R, 
and  using  the  basic  property  (8.6)  of  the  Bloch  function,  we  can  write  (12.4)  as 


Wr0  +  R,  0  =  I  |j/(k')<L,  (r0)]  exp  i  ^k'  •  R  -  i  E„(k')t  j 


(12.5) 


5  For  a  unified  view  of  a  variety  of  such  phenomena,  see  L.  Brillouin,  Wave  Propagation  in  Periodic 
Structures ,  Dover,  New  York,  1953. 

b  See  page  141.  The  result  is  proved  in  Appendix  £. 
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Viewed  as  a  function  of  R  for  fixed  r0,  this  is  just  a  superposition  of  plane  waves,  of 
the  form  (12.2),  with  a  w  eight  function  y(k)  =  [f/(k)i//nl(r0)].  Thus  if  A k  measures  the 
region  within  which  g  (and  hence  g)  is  appreciable,7  then  \pn( r0  +  R),  in  accordance 
with  the  usual  rules  for  wave  packets,  should  be  appreciable  within  a  region  of 
dimensions  A R  x  1/A k.  Since  A k  is  small  compared  with  the  zone  dimensions,  which 
are  of  the  order  of  the  inverse  lattice  constant  1/a,  it  follows  that  A R  must  be  large 
compared  with  a.  This  conclusion  is  independent  of  the  particular  value  of  r0,  and 
we  therefore  conclude  that  a  wave  packet  of  Bloch  levels  with  a  wave  vector  that  is 
well  defined  on  the  scale  of  the  Brillouin  zone  must  be  spread  in  real  space  over  many 
primitive  cells. 

The  semiclassical  model  describes  the  response  of  the  electrons  to  externally 
applied  electric  and  magnetic  fields  that  vary  slowly  over  the  dimensions  of  such  a 
wave  packet  (Figure  12.1)  and  therefore  exceedingly  slowly  over  a  few  primitive  cells. 

Figure  12.1 

Schematic  view  of  the  situation 
described  by  the  semiclassical 
model.  The  length  over  which  the 
applied  field  (dashed  line)  varies  is 
much  greater  than  the  spread  in 
the  wave  packet  of  the  electron  — h —  Spread  of  wave  packet 

(solid  line),  which  in  turn  is  much  — ■* —  Lattice  constant 

larger  than  the  lattice  constant.  ^ _ 

Wavelength  of  applied  field 

In  the  semiclassical  model  such  fields  give  rise  to  ordinary  classical  forces  in  an 
equation  of  motion  describing  the  evolution  of  the  position  and  wave  vector  of  the 
packet.  The  subtlety  of  the  semiclassical  model  that  makes  it  more  complicated  than 
the  ordinary  classical  limit  of  free  electrons,  is  that  the  periodic  potential  of  the 
lattice  varies  over  dimensions  that  are  small  compared  with  the  spread  of  the  wave 
packet,  and  therefore  cannot  be  treated  classically.  Thus  the  semiclassical  model  is 
a  partial  classical  limit:  The  externally  applied  fields  are  treated  classically,  but  the 
periodic  field  of  the  ions  is  not. 

DESCRIPTION  OF  THE  SEMICLASSICAL  MODEL 

The  semiclassical  model  predicts  how,  in  the  absence  of  collisions,  the  position  r 
and  wave  vector  k  of  each  electron8  evolve  in  the  presence  of  externally  applied 
electric  and  magnetic  fields.  This  prediction  is  based  entirely  upon  a  knowledge  of  the 
band  structure  of  the  metal,  /.<?.,  upon  the  forms  of  the  functions  E„(k),  and  upon  no 
other  explicit  information  about  the  periodic  potential  of  the  ions.  The  model  takes  the 
£„(k)  as  given  functions,  and  says  nothing  about  how  to  compute  them.  The  aim  of 
the  model  is  to  relate  the  band  structure  to  the  transport  properties,  i.e.,  the  response 

7  If  g  is  appreciable  only  in  a  neighborhood  of  k  small  compared  with  the  dimensions  of  the  zone, 
then  tk„i(r0)  will  vary  little  over  this  range,  and  as  a  function  of  k.  g  will  differ  little  from  a  constant  times 

g- 

s  Hereafter  we  shall  speak  of  an  electron  as  having  both  a  position  and  a  wave  vector.  What  we  are 
referring  to,  of  course,  is  a  wave  packet,  as  described  above. 
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of  the  electrons  to  applied  fields  or  temperature  gradients.  One  uses  the  model  both 
to  deduce  transport  properties  from  a  given  (calculated)  band  structure  and  to  deduce 
features  of  the  band  structure  from  the  observed  transport  properties. 

Given  the  functions  £„(k),  the  semiclassical  model  associates  with  each  electron 
a  position  r.  a  wave  vector  k,  and  a  band  index  n.  In  the  course  of  time  and  in  the 
presence  of  external  electric  and  magnetic  fields  E(r,  t)  and  H(r,  t)  the  position,  wave 
vector,  and  band  index  are  taken  to  evolve  according  to  the  following  rules: 

1.  The  band  index  n  is  a  constant  of  the  motion.  The  semiclassical  model  ignores 
the  possibility  of  “interband  transitions”. 

2.  The  time  evolution  of  the  position  and  wave  vector  of  an  electron  with  band 
index  n  are  determined  by  the  equations  of  motion: 


h  ck 


1 


fik  = 


'  E(r, 


f)  +  -  v„(k)  x  H(r,  f) 
c 


(12.6a) 

(12.6b) 


3.  (This  rule  simply  restates  those  features  of  the  full  quantum  mechanical  Bloch 
theory  that  are  retained  in  the  semiclassical  model.)  The  wave  vector  of  an  electron 
is  only  defined  to  within  an  additive  reciprocal  lattice  vector  K.  One  cannot 
have  two  distinct  electrons  with  the  same  band  index  n  and  position  r,  whose 
wave  vectors  k  and  k'  differ  by  a  reciprocal  lattice  vector  K;  the  labels  n,  r,  k 
and  n,  r,  k  +  K  are  completely  equivalent  ways  of  describing  the  same  electron.9 
All  distinct  wave  vectors  for  a  single  band  therefore  lie  in  a  single  primitive  cell 
of  the  reciprocal  lattice.  In  thermal  equilibrium  the  contribution  to  the  electronic 
density  from  those  electrons  in  the  nth  band  with  wave  vectors  in  the  infinitesimal 
volume  element  dk  of  /c-space  is  given  by  the  usual  Fermi  distribution  (2.56):10 


/(e„(k)) 


d  k 
4n* 


dk/4n3 

£{  HnW-u)/kBT  +  j- 


(12.7) 


COMMENTS  AND  RESTRICTIONS 
A  Many-Carrier  Theory 

Because  the  applied  fields  are  assumed  to  cause  no  interband  transitions,  one  can 
consider  each  band  to  contain  a  fixed  number  of  electrons  of  a  particular  type.  The 
properties  of  these  types  may  differ  considerably  from  band  to  band,  since  the  kind 
of  motion  electrons  with  band  index  n  can  undergo  depends  on  the  particular  form 
of  £„(k).  In  (or  near)  equilibrium,  bands  with  all  energies  many  kBT above  the  Fermi 
energy  £f  will  be  unoccupied.  Thus  one  need  not  consider  infinitely  many  carrier 


9  The  semiclassical  equations  of  motion  (12.6)  preserve  this  equivalence  as  time  evolves.  If  r(/),  k(t) 
give  a  solution  for  the  nth  band,  then  so  will  r(f),  k(f)  +  K  for  any  reciprocal  lattice  vector  K,  as  a 
consequence  of  ihe  periodicity  of  S„(k). 

10  This  assumes  that  interactions  of  the  electron  spin  with  any  magnetic  fields  are  of  no  consequence; 
if  they  are,  then  each  spin  population  makes  a  contribution  to  n  given  by  half  (12.7)  where  S„(k)  must 
include  the  interaction  energy  of  the  given  spin  with  the  magnetic  field. 
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tvoes  but  only  those  in  bands  with  energies  within  a  few  kBT  of,  or  lower  than  , 
Furthermore  we  shall  see  below  that  bands  in  which  all  energ.es  are :  many  kBT  less 
than  f  — i  e  ’  bands  that  are  completely  filled  in  equilibrium— can  also  be  ignore. 

As  a  result,  mly  a  small  number  of  bands  (or  carrier  types)  need  be  considered 
the  description  of  a  real  metal  or  semiconductor. 

Crystal  Momentum  Is  Not  Momentum 

Note  .ha,  within  each  band  ,he  equations  of  motion  UM)  to  the  satro«  the 
electron  equations  (12.1)  except  that  £„(k)  appears  instead  of  the  free  electron  eR, 
h2k2/2m  Nevertheless,  the  crystal  momentum  fik  is  not  the  momentum 
electron  as  emphasized  in  Chapter  8.  The  rate  of  change  of  an  electron  s  momentum 
5  force  on  She  electron,  bn.  the  rate  of  change  of  an  elecuon  s 

crystal  momentum  is  given  by  Eq.  (116).  in  which  forces  are  everted  only  b,  the 
external  fields  and  not  by  the  periodic  field  of  the  lattice. 

Limits  of  Validity 

In  the  limit  of  zero  periodic  potential  the  semiclassical  model  must  br^  do^n’ 
in  thaUimi,  the  electron  will  be  a  free  electron.  In  a  un.form  etanc I  fie  d  a  fee 
electron  can  continually  increase  its  kinetic  energy  at  the  expense  of  electros! 
potential  energy.  However,  the  semiclassical  model  forbids  mterband  transi .  >  , 
and  therefore  requires  that  the  energy  of  any  electron 

limits  of  the  band  in  which  the  electron  originally  found  itself.  Thus  there  mu 
of  the  slowly  varying  external  electric  and  magnetic  fields  satisfy 

(12.8) 

(12.9) 


F  ,  rMk)]2 

h(oc  « - - 

Op 


In  these  inequalities  the  length  a  is  of  the  order  of  a  lattice  constant,  £  /k)  »s  the 
difference  between  S„(k)  and  the  nearest  energy  Mk)  at  **«*£»*  m  k-space 
but  in  a  different  band,  and  coc  is  the  angular  cyclotron  frequency  (Eq.(l. 

Condition  (12  8)  is  never  close  to  being  violated  in  a  metal.  Eve 
d.S tS af.0’  amp/cm2  and  a  teal,.™,,  as  large  as  .00  pohm-cm. , fie  field 

’  u  *.  periodic  Him* 

(through  the  structure  of  the  function  S„(k)  determine  y  P  to  localize 

position-dependent  force.  To  probe  a  force  with  the  penodicity ^  ^  °ne  wo  ^ 

an  electron  within  a  single  primitive  ceH.  Such  a  ;-a‘“;'™;^;“,"1^snprradovei  many  lattice  sites, 
packets  underlying  the  semiclassical  mode  bee  g  •  crosses  a  Bragg  piane,  since 

12  This  requirement  is  violated  every  time  the  free  electron  wave ;  ve 

the  electron  then  jumps  from  the  lower  free  electron  band  to  the  higher  one. 

13  A  rough  justification  is  given  in  Appendix  J. 
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in  the  metal  will  only  be  E  =  pj  =  10  2  volt/cm.  Hence  for  a  on  the  order  of  10  8 
cm,  eEa  is  of  order  10“ 10  eV.  Since  Gf  is  of  the  order  of  an  electron  volt  or  more, 
8gap(k)  must  be  as  small  as  10“ 5  eV  before  condition  (12.8)  is  violated.  In  practice, 
gaps  this  small  are  never  encountered  except  near  points  where  two  bands  become 
degenerate,  and  then  only  in  an  exceedingly  small  region  of  k-space  about  such 
points.  Typical  small  band  gaps  are  of  the  order  of  10“ 1  eV,  and  therefore  (12.8) 
is  satisfied  with  a  factor  of  10“s  to  spare.  The  condition  is  of  practical  concern  only 
in  insulators  and  in  homogeneous  semiconductors,  where  it  is  possible  to  establish 
immense  electric  fields;  when  the  condition  is  violated  electrons  can  make  an  inter¬ 
band  transition  driven  by  the  field,  a  phenomenon  known  as  electric  breakdown. 

The  condition  (12.9)  on  the  magnetic  field  strength  is  not  as  difficult  to  violate. 
The  energy  hwc  is  of  order  10“4  eV  in  a  field  of  104  gauss,  in  which  case  (12.9)  fails 
for  gaps  as  large  as  10“ 2  eV.  Although  this  is  still  a  small  energy  gap,  such  gaps 
are  not  at  all  uncommon,  especially  when  the  gap  is  entirely  due  to  a  degeneracy 
split  by  spin-orbit  coupling.  When  condition  (12.9)  fails  to  hold,  electrons  may  not 
follow  the  orbits  determined  by  the  semiclassical  equations  of  motion  (12.6),  a  phe¬ 
nomenon  known  as  magnetic  breakthrough  (or  breakdown  ).  The  possibility  of 
magnetic  breakthrough  must  always  be  kept  in  mind  in  interpreting  electronic 
properties  in  very  strong  magnetic  fields. 

In  addition  to  the  conditions  (12.8)  and  (12.9)  on  the  amplitude  of  the  applied 
fields,  one  must  add  a  low-frequency  condition  on  the  fields, 

hw  «  £gap ,  (12.10) 

or  else  a  single  photon  could  supply  enough  energy  to  produce  an  interband  transi¬ 
tion.  There  is  also  the  condition  on  the  wavelength  of  the  applied  fields, 

A»o,  (12-ID 

that  is  necessary  if  wave  packets  can  be  meaningfully  introduced  at  all.14 

Basis  for  the  Equations  of  Motion 

As  discussed  above,  Eq.  (12.6a)  is  simply  the  statement  that  the  velocity  of  a  semi¬ 
classical  electron  is  the  group  velocity  of  the  underlying  wave  packet.  Equation 
(12.6b)  is  considerably  more  difficult  to  justify.  It  is  highly  plausible  in  the  presence 
of  a  static  electric  field  as  the  simplest  way  to  guarantee  conservation  of  energy,  for 
if  the  field  is  given  by  E  =  -  \<j>,  then  we  should  expect  each  wave  packet  to  move 
so  that  the  energy 

8„(k(t))  -  e<p(i(t))  (12.12) 

remains  constant.  The  time  derivative  of  this  energy  is 

k-eV^-r,  (12.13) 

c  k 


,+  It  is  also  sometimes  necessary  lo  take  into  account  further  quantum  effects  due  to  the  possibility 
of  closed  electronic  k- space  orbits  in  a  magnetic  field.  This  can  be  handled  by  an  ingenious  extension  of 
the  semiclassical  model,  and  is  therefore  not  a  limitation  in  the  sense  of  the  restrictions  described  above. 
The  problem  arises  in  the  theory  of  the  de  Haas— van  Alphen  effect  and  related  phenomena,  and  is  described 
in  Chapter  14. 


Consequences  of  the  Semiclassical  Equations  of  Motion  221 


which  Eq.  (12.6a)  permits  us  to  write  as 

v„(k)  •  [fik  —  cV<£].  (12.14) 

This  will  vanish  if 

fik  =  e\<p  =  -eE,  (12.15) 

which  is  Eq.  (1 2.6b)  in  the  absence  of  a  magnetic  field.  However,  (12.15)  is  not  necessary 
for  energy  to  be  conserved,  since  (12.14)  vanishes  if  any  term  perpendicular  to  v„(k) 
is  added  to  (12.15).  To  justify  with  rigor  that  the  only  additional  term  should  be 
[v„(k)/c]  x  H,  and  that  the  resulting  equation  should  hold  for  time-dependent  fields 
as  well,  is  a  most  difficult  matter,  which  we  shall  not  pursue  further.  The  dissatisfied 
reader  is  referred  to  Appendix  H  for  a  further  way  of  rendering  the  semiclassical 
equations  more  plausible.  There  it  is  shown  that  they  can  be  written  in  a  very  compact 
Hamiltonian  form.  To  find  a  really  compelling  set  of  arguments,  however,  it  is 
necessary  to  delve  rather  deeply  into  the  (still  growing)  literature  on  the  subject.15 

CONSEQUENCES  OF  THE  SEMICLASSICAL  EQUATIONS  OF 
MOTION 

The  rest  of  this  chapter  surveys  some  of  the  fundamental  direct  consequences  of  the 
semiclassical  equations  of  motion.  In  Chapter  13  we  shall  turn  to  a  more  systematic 
way  of  extracting  theories  of  conduction. 

In  most  of  the  discussions  that  follow  we  shall  consider  a  single  band  at  a  time, 
and  shall  therefore  drop  reference  to  the  band  index  except  when  explicitly  comparing 
the  properties  of  two  or  more  bands.  For  simplicity  we  shall  also  take  the  electronic 
equilibrium  distribution  function  to  be  that  appropriate  to  zero  temperature.  In 
metals  finite  temperature  effects  will  have  negligible  influence  on  the  properties  dis¬ 
cussed  below.  Thermoelectric  effects  in  metals  will  be  discussed  in  Chapter  1 3,  and 
semiconductors  will  be  treated  in  Chapter  28. 

The  spirit  of  the  analysis  that  follows  is  quite  similar  to  that  in  which  we  discussed 
transport  properties  in  Chapters  1  and  2:  We  shall  describe  collisions  in  terms  of  a 
simple  relaxation-time  approximation,  and  focus  most  of  our  attention  on  the  motion 
of  electrons  between  collisions  as  determined  (in  contrast  to  Chapters  1  and  2)  by 
the  semiclassical  equations  of  motion  ( 1 2.6). 

Filled  Bands  Are  Inert 

A  filled  band  is  one  in  which  all  the  energies  lie  below16  Ef.  Electrons  in  a  filled  band 
with  wave  vectors  in  a  region  of  k-space  of  volume  rfk  contribute  dk/4it3  to  the  total 
electronic  density  (Eq.  (12.7)).  Thus  the  number  of  such  electrons  in  a  region  of 
position  space  of  volume  dr  will  be  dr  dk/4n3.  One  can  therefore  characterize  a  filled 
band  semiclassically  by  the  fact  that  the  density  of  electrons  in  a  six-dimensional 
rk-space  (called  phase  space,  in  analogy  to  the  rp-space  of  ordinary  classical  me¬ 
chanics)  is  l/47t3. 


15  See,  for  example,  the  references  given  in  footnote  1. 

16  More  generally,  the  energies  should  be  so  far  below  the  chemical  potential  fi  compared  with  kBT 
that  the  Fermi  function  is  indistinguishable  from  unity  throughout  the  band. 
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The  semiclassical  equations  (12.6)  imply  that  a  filled  band  remains  a  filled  band 
at  all  times,  even  in  the  presence  of  space-  and  time-dependent  electric  and  magnetic 
fields.  This  is  a  direct  consequence  of  the  semiclassical  analogue  of  Liouville’s  theorem, 
which  asserts  the  following:17 

Given  any  region  of  six-dimensional  phase  space  Q„  consider  the  point  r',  k'  into 
which  each  point  r,  k  in  Cl,  is  taken  by  the  semiclassical  equations  of  motion  between 
times18  t  and  t'.  The  set  of  all  such  points  r',  k'  constitutes  a  new  region  Cl,.,  whose 
volume  is  the  same  as  the  volume  of  C2,  (see  Figure  12.2);  i.e.,  phase  space  volumes 
are  conserved  by  the  semiclassical  equations  of  motion. 


Figure  12.2 

Semiclassical  trajectories  in  rk- space.  The  region  Cl,,  contains 
at  time  f'  just  those  points  that  the  semiclassical  motion  has 
carried  from  the  region  fi,  at  time  t.  Liouville’s  theorem  asserts 
that  fi,  and  Cl,  have  the  same  volume.  (The  illustration  is  for 
a  two-dimensional  i  A-space  lying  in  the  plane  of  the  page,  i.e., 
for  semiclassical  motion  in  one  dimension.) 


This  immediately  implies  that  if  the  phase  space  density  is  1/47Z3  at  time  zero,  it 
must  remain  so  at  all  times,  for  consider  any  region  Cl  at  time  t.  The  electrons  in  Cl 
at  time  t  are  just  those  that  were  in  some  other  region  fi0  at  time  zero  where,  according 
to  Liouville’s  theorem,  Cl0  has  the  same  volume  as  f).  Since  the  two  regions  also  have 
the  same  number  of  electrons,  they  have  the  same  phase  space  density  of  electrons. 
Because  that  density  was  l/47i3,  independent  of  the  region  at  time  0,  it  must  also  be 


17  See  Appendix  H  for  a  proof  that  the  theorem  applies  to  semiclassical  motion.  From  a  quantum 
mechanical  point  of  view  the  inertness  of  filled  bands  is  a  simple  consequence  of  the  Pauli  exclusion 
principle:  The  "phase  space  density”  cannot  increase  if  every  level  contains  the  maximum  number  of 
electrons  allowed  by  the  Pauli  principle;  furthermore,  if  interband  transitions  are  prohibited,  neither  can 
it  decrease,  for  the  number  of  electrons  in  a  level  can  only  be  reduced  if  there  are  some  incompletely  filled 
levels  in  the  band  for  those  electrons  to  move  into.  For  logical  consistency,  however,  it  is  necessary  to 
demonstrate  that  this  conclusion  also  follows  directly  from  the  semiclassical  equations  of  motion,  without 
reinvoking  the  underlying  quantum  mechanical  theory  that  the  model  is  meant  to  replace. 

18  The  time  t'  need  not  be  greater  than  r;  i.e.,  the  regions  from  which  D,  evolved  have  the  same  volume 
as  n„  as  well  as  the  regions  into  which  n,  will  evolve. 
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1/47T3,  independent  of  the  region  at  time  t.  Thus  semiclassical  motion  between 
collisions  cannot  alter  the  configuration  of  a  filled  band,  even  in  the  presence  of 
space-  and  time-dependent  external  fields.19 

However,  a  band  with  a  constant  phase  space  density  l,47t3  cannot  contribute  to 
an  electric  or  thermal  current.  To  see  this,  note  that  an  infinitesimal  phase  space 
volume  element  dk  about  the  point  k  will  contribute  e/k/47i3  electrons  per  unit 
volume,  all  with  velocity  v(k)  =  (l//i)  c£(k)/ck  to  the  current.  Summing  this  over  all 
k  in  the  Brillouin  zone,  we  find  that  the  total  contribution  to  the  electric  and  energy 
current  densities  from  a  filled  band  is 


j  =  (~e) 


f  dk  1  els 

J  47t3  h  rk  ’ 

f  rfk  1  c£  _  1  frfk  1  d_ 

£  47i3  h  c  k  2  47t3  h  ck 


(E(k))2 


(12.16) 


But  both  of  these  vanish  as  a  consequence  of  the  theorem20  that  the  integral  over 
any  primitive  cell  of  the  gradient  of  a  periodic  function  must  vanish. 

Thus  only  partially  filled  bands  need  be  considered  in  calculating  the  electronic 
properties  of  a  solid.  This  explains  how  that  mysterious  parameter  of  free  electron 
theory,  the  number  of  conduction  electrons,  is  to  be  arrived  at:  Conduction  is  due 
only  to  those  electrons  that  are  found  in  partially  filed  hands.  The  reason  Drude’s 
assignment  to  each  atom  of  a  number  of  conduction  electrons  equal  to  its  valence  is 
often  successful  is  that  in  many  cases  those  bands  derived  from  the  atomic  valence 
electrons  are  the  only  ones  that  are  partially  filled. 

Evidently  a  solid  in  which  all  bands  are  completely  filled  or  empty  will  be  an 
electrical  and  (at  least  as  far  as  electronic  transport  of  heat  is  concerned)  thermal 
insulator.  Since  the  number  of  levels  in  each  band  is  just  twice  the  number  of  primitive 
cells  in  the  crystal,  all  bands  can  be  filled  or  empty  only  in  solids  with  an  even  number 
of  electrons  per  primitive  cell.  Note  that  the  converse  is  not  true:  Solids  with  an  even 
number  of  electrons  per  primitive  cell  may  be  (and  frequently  are)  conductors,  since 
the  overlap  of  band  energies  can  lead  to  a  ground  state  in  which  several  bands  are 
partially  filled  (see,  for  example.  Figure  12.3).  We  have  thus  derived  a  necessary,  but 
by  no  means  sufficient,  condition  for  a  substance  to  be  an  insulator. 

It  is  a  reassuring  exercise  to  go  through  the  periodic  table  looking  up  the  crystal 
structure  of  all  insulating  solid  elements.  They  will  all  be  found  to  have  either  even 
valence  or  (e.g.,  the  halogens)  a  crystal  structure  that  can  be  characterized  as  a  lattice 
with  a  basis  containing  an  even  number  of  atoms,  thereby  confirming  this  very 
general  rule. 


19  Collisions  cannot  alter  this  stability  of  filled  bands  either,  provided  that  we  retain  our  basic 
assumption  (Chapter  1,  page  6  and  Chapter  13,  page  245)  that  whatever  else  they  do,  the  collisions  cannot 
alter  the  distribution  of  electrons  when  it  has  its  thermal  equilibrium  form.  For  a  distribution  function 
with  the  constant  value  l/4n3  is  precisely  the  zero  temperature  equilibrium  form  for  any  band  all  of  whose 
energies  lie  below  the  Fermi  energy. 

20  The  theorem  is  proved  in  Appendix  I.  The  periodic  functions  in  this  case  are  £(k)  in  the  case  of  j, 
and  £(k)z  in  the  case  of  j. 
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Figure  12.3 

A  two-dimensional  illustration  of  why  a  divalent  solid 
can  be  a  conductor.  A  free  electron  circle,  whose  area 
equals  that  of  the  first  Brillouin  zone  (1)  of  a  square 
Bravais  lattice,  extends  into  the  second  zone  (II),  thus 
producing  two  partially  filled  bands.  Under  the  in¬ 
fluence  of  a  sufficiently  strong  periodic  potential  the 
pockets  of  first-zone  holes  and  second-zone  electrons 
might  shrink  to  zero.  Quite  generally,  however,  a  weak 
periodic  potential  will  always  lead  to  this  kind  of 
overlap  (except  in  one  dimension). 


Semiclassical  Motion  in  an  Applied  DC  Electric  Field 

In  a  uniform  static  electric  field  the  semiclassical  equation  of  motion  for  k  (Eq.  ( 1 2.6) ) 
has  the  general  solution 

k(r)  =  k(0)  -  £-^-.  (12-17> 

Thus  in  a  time  t  every  electron  changes  its  wave  vector  by  the  same  amount.  This  is 
consistent  with  our  observation  that  applied  fields  can  have  no  effect  on  a  filled  band 
in  the  semiclassical  model,  for  a  uniform  shift  in  the  wave  vector  of  every  occupied 
level  does  not  alter  the  phase  space  density  of  electrons  when  that  density  is  constant, 
as  it  is  for  a  filled  band.  However,  it  is  somewhat  jarring  to  one’s  classical  intuition 
that  by  shifting  the  wave  vector  of  every  electron  by  the  same  amount  we  nevertheless 
fail  to  bring  about  a  current-carrying  configuration. 

To  understand  this,  one  must  remember  that  the  current  carried  by  an  electron  is 
proportional  to  its  velocity,  which  is  not  proportional  to  k  in  the  semiclassical  model. 
The  velocity  of  an  electron  at  time  t  will  be 

v(k(r))  =  v^k(0) - (12.18) 

Since  v(k)  is  periodic  in  the  reciprocal  lattice,  the  velocity  (12.18)  is  a  bounded  function 
of  time  and,  when  the  field  E  is  parallel  to  a  reciprocal  lattice  vector,  oscillatory 
This  is  in  striking  contrast  to  the  free  electron  case,  where  v  is  proportional  to  k  and 
grows  linearly  in  time. 

The  k  dependence  (and,  to  within  a  scale  factor,  the  t  dependence)  of  the  velocity 
is  illustrated  in  Figure  12.4,  where  both  £(k)  and  r[k)  are  plotted  in  one  dimension. 
Although  the  velocity  is  linear  in  k  near  the  band  minimum,  it  reaches  a  maximum 
as  the  zone  boundary  is  approached,  and  then  drops  back  down,  going  to  zero  at  the 
zone  edge.  In  the  region  between  the  maximum  of  v  and  the  zone  edge  the  velocity 
actually  decreases  with  increasing  k,  so  that  the  acceleration  of  the  electron  is  opposite 

to  the  externally  applied  electric  force! 

This  extraordinary  behavior  is  a  consequence  of  the  additional  force  exerted  by 
the  periodic  potential,  which,  though  no  longer  explicit  in  the  semiclassical  model, 
lies  buried  in  it  (through  the  functional  form  of  E(k)).  As  an  electron  approaches  a 
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Figure  12.4 

£ (k)  and  r (k)  vs.  k  (or  vs. 
time,  via  Eq.  (12.17))  in  one 
dimension  (or  three  dimen¬ 
sions,  in  a  direction  parallel 
to  a  reciprocal  lattice  vector 
that  determines  one  of  the 
first-zone  faces.) 


Bragg  plane,  the  external  electric  field  moves  it  toward  levels  in  which  it  is  increasingly 
likely  to  be  Bragg-reflected  back  in  the  opposite  direction.21 

Thus  if  an  electron  could  travel  between  collisions  a  distance  in  fc-space  larger 
than  the  dimensions  of  the  zone,  it  would  be  possible  for  a  DC  field  to  induce  an 
alternating  current.  Collisions,  however,  quite  emphatically  exclude  this  possibility. 
For  reasonable  values  of  the  field  and  relaxation  time  the  change  in  wave  vector 
between  two  collisions,  given  by  ( 1 2. 1 7),  is  a  minute  fraction  of  the  zone  dimensions.22 

But  although  the  hypothetical  effects  of  periodic  motion  in  a  DC  field  are  in¬ 
accessible  to  observation,  effects  dominated  by  electrons  that  are  close  enough  to  the 
zone  boundary  to  be  decelerated  by  an  applied  field  are  readily  observable  through 
the  curious  behavior  of  “holes.” 

Holes 

One  of  the  most  impressive  achievements  of  the  semiclassical  model  is  its  explanation 
for  phenomena  that  free  electron  theory  can  account  for  only  if  the  carriers  have  a 
positive  charge.  The  most  notable  of  these  is  the  anomalous  sign  of  the  Hall  coefficient 
in  some  metals  (see  page  58).  There  are  three  important  points  to  grasp  in  under¬ 
standing  how  the  electrons  in  a  band  can  contribute  to  currents  in  a  manner  suggestive 
of  positively  charged  carriers: 

1.  Since  electrons  in  a  volume  element  d k  about  k  contribute  —  ev( k)zf k/4zr 3  to 
the  current  density,  the  contribution  of  all  the  electrons  in  a  given  band  to  the  current 
density  will  be 

j  =  (-e)[  (12-19) 

Joccupied 


21  For  example,  it  is  just  in  the  vicinity  of  Bragg  planes  that  plane- wave  levels  with  different  wave 
vectors  are  most  strongly  mixed  in  the  nearly  free  electron  approximation  (Chapter  9). 

22  With  an  electric  field  of  order  10“ 2  volt/ cm  and  a  relaxation  time  of  order  10" 14  sec,  eE r  h  is 
of  order  10“'  cm-1.  Zone  dimensions  are  of  order  1  'a  ~  10R  cm' l. 
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where  the  integral  is  over  all  occupied  levels  in  the  band.23  By  exploiting  the  fact 
that  a  completely  filled  band  carries  no  current. 


'  dk 

f  dk 

f  dk 

4^V(k)  = 

Jzone 

!  4-3100  + 

J  occupied 

TIT  v(k)> 

Junoccupied 

(12.20) 

we  can  equally  well  write  (12.19)  in  the  form: 

f  dk 

j  =  (+e)  I  7-3  v(k).  (12.21) 

Junoccupied 

Thus  the  current  produced  by  occupying  with  electrons  a  specified  set  of  levels  is 
precisely  the  same  as  the  current  that  would  be  produced  if  (a)  the  specified  levels  were 
unoccupied  and  ( b )  all  other  levels  in  the  band  were  occupied  but  with  particles  of  charge 
+e  (opposite  to  the  electronic  charge ). 

Thus,  even  though  the  only  charge  carriers  are  electrons,  we  may,  whenever  it  is 
convenient,  consider  the  current  to  be  carried  entirely  by  fictitious  particles  of  positive 
charge  that  fill  all  those  levels  in  the  band  that  are  unoccupied  by  electrons.24  The 
fictitious  particles  are  called  holes. 

When  one  chooses  to  regard  the  current  as  being  carried  by  positive  holes  rather 
than  negative  electrons,  the  electrons  are  best  regarded  as  merely  the  absence  of  holes; 
i.e.,  levels  occupied  by  electrons  are  to  be  considered  unoccupied  by  holes.  It  must 
be  emphasized  that  pictures  cannot  be  mixed  within  a  given  band.  If  one  wishes  to 
regard  electrons  as  carrying  the  current,  then  the  unoccupied  levels  make  no  con¬ 
tribution;  if  one  wishes  to  regard  the  holes  as  carrying  the  current,  then  the  electrons 
make  no  contribution.  One  may,  however,  regard  some  bands  in  the  electron  picture 
and  other  bands  in  the  hole  picture,  as  suits  one’s  convenience. 

To  complete  the  theory  of  holes  we  must  consider  the  manner  in  which  the  set  of 
unoccupied  levels  changes  under  the  influence  of  applied  fields: 

2.  The  unoccupied  levels  in  a  band  evolve  in  time  under  the  influence  of  applied 
fields  precisely  as  they  would  if  they  were  occupied  by  real  electrons  (of  charge  —e). 

This  is  because,  given  the  values  of  k  and  r  at  t  =  0,  the  semiclassical  equations, 
being  six  first-order  equations  in  six  variables,  uniquely  determine  k  and  r  at  all 
subsequent  (and  all  prior)  times,  just  as  in  ordinary  classical  mechanics  the  position 
and  momentum  of  a  particle  at  any  instant  determine  its  entire  orbit  in  the  presence 
of  specified  external  fields.  In  Figure  12.5  we  indicate  schematically  the  orbits  deter¬ 
mined  by  the  semiclassical  equations,  as  lines  in  a  seven-dimensional  rkt- space. 
Because  any  point  on  an  orbit  uniquely  specifies  the  entire  orbit,  two  distinct  orbits 
can  have  no  points  in  common.  We  can  therefore  separate  the  orbits  into  occupied 
and  unoccupied  orbits,  according  to  whether  they  contain  occupied  or  unoccupied 
points  at  t  =  0.  At  any  time  after  t  =  0  the  unoccupied  levels  will  lie  on  unoccupied 
orbits,  and  the  occupied  levels  on  occupied  orbits.  Thus  the  evolution  of  both 
occupied  and  unoccupied  levels  is  completely  determined  by  the  structure  of  the 


23  This  need  not  be  the  set  of  levels  with  energies  less  than  £F,  since  we  are  interested  in  the  non¬ 
equilibrium  configurations  brought  about  by  applied  fields. 

24  Note  that  this  contains  as  a  special  case  the  fact  that  a  filled  band  can  carry  no  current,  for  a 
filled  band  has  no  unoccupied  levels  and  therefore  no  fictitious  positive  carriers. 
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Figure  12.5 

Schematic  illustration  of  the  time  evolution 
of  orbits  in  semiclassical  phase  space  (here 
r  and  k  are  each  indicated  by  a  single  co¬ 
ordinate).  The  occupied  region  at  time  t  is 
determined  by  the  orbits  that  lie  in  the 
occupied  region  at  time  t  =  0. 


orbits.  But  this  structure  depends  only  on  the  form  of  the  semiclassical  equations 
(12.6),  and  not  on  whether  an  electron  happens  actually  to  be  following  a  particular 
orbit. 

3.  It  therefore  suffices  to  examine  how  electrons  respond  to  applied  fields  to 
learn  how  holes  do.  The  motion  of  an  electron  is  determined  by  the  semiclassical 
equation: 

fik  =  (-<?)  i  v  x  H^.  (12.22) 

Whether  or  not  the  orbit  of  the  electron  resembles  that  of  a  free  particle  of  negative 
charge  depends  on  whether  the  acceleration,  d\/dt,  is  or  is  not  parallel  to  k.  Should 
the  acceleration  be  opposite  to  k,  then  the  electron  would  respond  more  like  a 
positively  charged  free  particle.  As  it  happens,  it  is  often  the  case  that  d\(k)/dt  is 
indeed  directed  opposite  to  k  when  k  is  the  wave  vector  of  an  unoccupied  level, 
for  the  following  reason: 

In  equilibrium  and  in  configurations  that  do  not  deviate  substantially  from 
equilibrium  (as  is  generally  the  case  for  nonequilibrium  electronic  configurations  of 
interest)  the  unoccupied  levels  often  lie  near  the  top  of  the  band.  If  the  band 
energy  S(k)  has  its  maximum  value  at  k0,  then  if  k  is  sufficiently  close  to  k0,  we  may 
expand  £(k)  about  k0.  The  linear  term  in  k  —  k0  vanishes  at  a  maximum,  and  if  we 
assume,  for  the  moment,  that  k0  is  a  point  of  sufficiently  high  symmetry  (e.g.,  cubic), 
then  the  quadratic  term  will  be  proportional  to  (k  —  k0)2.  Thus 

S(k)  w  e(k0)  -  i4(k  -  k0)2,  (12.23) 


where  A  is  positive  (since  8  is  maximum  at  k0).  It  is  conventional  to  define  a  positive 
quantity  m*  with  dimensions  of  mass  by: 


h2 

2m* 


A. 


(12.24) 


For  levels  with  wave  vectors  near  kc. 
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and  hence 


a 


d_ 

dt 


v(k)  = 


(12.26) 


i.e.,  the  acceleration  is  opposite  to  k. 

Substituting  the  acceleration  wave  vector  relation  (12.26)  into  the  equation  of 
motion  (12.22)  we  find  that  as  long  as  an  electron’s  orbit  is  confined  to  levels  close 
enough  to  the  band  maximum  for  the  expansion  (12.23)  to  be  accurate,  the  (negatively 
charged)  electron  responds  to  driving  fields  as  if  it  had  a  negative  mass  —  m*.  By 
simply  changing  the  sign  of  both  sides,  we  can  equally  well  (and  far  more  intuitively) 
regard  Eq.  (12.22)  as  describing  the  motion  of  a  positively  charged  particle  with  a 
positive  mass  m*. 

Since  the  response  of  a  hole  is  the  same  as  the  response  an  electron  would  have 
if  the  electron  were  in  the  unoccupied  level  (point  2  above),  this  completes  the  demon¬ 
stration  that  holes  behave  in  all  respects  like  ordinary  positively  charged  particles. 

The  requirement  that  the  unoccupied  levels  lie  sufficiently  close  to  a  highly  sym¬ 
metrical  band  maximum  can  be  relaxed  to  a  considerable  degree.25  We  might  expect 
dynamical  behavior  suggestive  of  positively  or  negatively  charged  particles,  according 
to  whether  the  angle  between  k  and  the  acceleration  is  greater  than  or  less  than  90° 
(k  ■  a  negative  or  positive).  Since 


k.a  =  fc.iU  =  k.^  =  !Y 

dt  dt  h  dk  h  jf  '  dfc,  dkj  J’ 


(12.27) 


a  sufficient  condition  that  fc  •  a  be  negative  is  that 


Y  A.-  A:  <  0  (for  any  vector  A).  (12.28) 

ii  dLdk, 


When  k  is  at  a  local  maximum  of  £(k),  (12.28)  must  hold,  for  if  the  inequality  was 
reversed  for  any  vector  A0,  then  the  energy  would  increase  as  k  moved  from  the 
“maximum”  in  the  direction  of  A0.  By  continuity,  (12.28)  will  therefore  hold  in  some 
neighborhood  of  the  maximum,  and  we  can  expect  an  electron  to  respond  in  a 
manner  suggestive  of  positive  charge,  provided  that  its  wave  vector  remains  in  that 
neighborhood. 

The  quantity  m*  determining  the  dynamics  of  holes  near  band  maxima  of  high 
symmetry  is  known  as  the  “hole  effective  mass.”  More  generally,  one  defines  an 
“effective  mass  tensor”: 


[M  L(k)],v 


1  d2e(k)  _  1  di\ 

~  ft2  dA,  ckj  ~  h  dk/ 


(12.29) 


where  the  sign  is  —  or  +  according  to  whether  k  is  near  a  band  maximum  (holes) 
or  minimum  (electrons).  Since 

dv  . 

a  =  —  =  ±  NT  (k)6k.  (12.30) 


2S  If  the  geometry  of  the  unoccupied  region  of  /c-space  becomes  too  complicated,  however,  the  picture 
of  holes  becomes  of  limited  usefulness. 


) 
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the  equation  of  motion  (12.22)  takes  on  the  form 

M(k)a  =  +  e(E  +  -v(k)  x  H).  (12.31) 

The  mass  tensor  plays  an  important  role  in  determining  the  dynamics  of  holes 
located  about  anisotropic  maxima  (or  electrons  located  about  anisotropic  minima). 
If  the  pocket  of  holes  (or  electrons)  is  small  enough,  one  can  replace  the  mass  tensor 
by  its  value  at  the  maximum  (minimum),  leading  to  a  linear  equation  only  slightly 
more  complicated  than  that  for  free  particles.  Such  equations  describe  quite  accu¬ 
rately  the  dynamics  of  electrons  and  holes  in  semiconductors  (Chapter  28). 


Semiclassical  Motion  in  a  Uniform  Magnetic  Field 

A  wealth  of  important  information  on  the  electronic  properties  of  metals  and  semi¬ 
conductors  comes  from  measurements  of  their  response  to  various  probes  in  the 
presence  of  a  uniform  magnetic  field.  In  such  a  field  the  semiclassical  equations  are 


f  =  v(k) 


1  c£(k) 
h  ok 


hi 


=  (  —  e)~  v(k)  x  H. 
c 


(12.32) 

(12.33) 


It  follows  immediately  from  these  equations  that  the  component  of  k  along  the 
magnetic  field  and  the  electronic  energy  £(k)  are  both  constants  of  the  motion.  These 
two  conservation  laws  completely  determine  the  electronic  orbits  in  k-space:  Electrons 
move  along  curves  given  by  the  intersection  of  surfaces  of  constant  energy  with  planes 
perpendicular  to  the  magnetic  field  (Figure  12.6). 

Figure  12.6 

Intersection  of  a  surface  of  constant  energy  with  a  plane 
perpendicular  to  the  magnetic  field.  The  arrow  indicates 
the  direction  of  motion  along  the  orbit  if  the  levels 
enclosed  by  the  surface  have  lower  energy  than  those 
outside. 


The  sense  in  which  the  orbit  is  traversed  follows  from  observing  that  v(k),  being 
proportional  to  the  k-gradient  of  £.  points  in  k- space  from  lower  to  higher  energies. 
In  conjunction  with  (12.33)  this  implies  that  if  one  were  to  imagine  walking  in  k-space 
along  the  orbit  in  the  direction  of  electronic  motion  with  the  magnetic  field  pointing 
from  one’s  feet  to  one’s  head,  then  the  high-energy  side  of  the  orbit  would  be  on 
one's  right.  In  particular,  closed  k- space  orbits  surrounding  levels  with  energies 
higher  than  those  on  the  orbit  (hole  orbits)  are  traversed  in  the  opposite  sense  to 
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) 


closed  orbits  surrounding  levels  of  lower  energy  (electron  orbits).  This  is  consistent 
with,  but  slightly  more  general  than  the  conclusions  reached  in  our  discussion  of 
holes. 

The  projection  of  the  real  space  orbit  in  a  plane  perpendicular  to  the  field,  rA  = 
r  -  H(H  •  r),  can  be  found  by  taking  the  vector  product  of  both  sides  of  (12.33)  with 
a  unit  vector  parallel  to  the  field.  This  yields 

~  .  cH  -  ~  cH 

H  x  /tk  = - (f  -  H(H  ■  f))  = - f , ,  (12.34) 

c  c 

which  integrates  to 

r±(f)  -  rA(0)  =  -  H  x  (k(r)  -  k(0)).  (12.35) 


Since  the  cross  product  of  a  unit  vector  with  a  perpendicular  vector  is  simply  the 
second  vector  rotated  through  90'  about  the  unit  vector,  we  conclude  that  the  projec¬ 
tion  of  the  real  space  orbit  in  a  plane  perpendicular  to  the  field  is  simply  the  /(-space 
orbit,  rotated  through  90°  about  the  field  direction  and  scaled  by  the  factor  hc/eH 
(Figure  12.7).26 


(a) 


Figure  12.7 

The  projection  of  the  r-space  orbit  (b)  in  a  plane  perpendicular  to  the  field  is  obtained  from  the 
fc-space  orbit  (a)  by  scaling  with  the  factor  hc/eH  and  rotation  through  90°  about  the  axis  deter¬ 
mined  by  H. 


26  The  component  of  the  real  space  orbit  parallel  to  the  field  is  not  so  simply  described.  Taking  the 
field  to  be  along  the  z-axis,  we  have 

f'  1  df 

r(f)  =  z(0)  4-  J  14(f)  dt;  c.  =  - 

In  contrast  to  the  free  electron  case,  r,  need  not  be  constant  (even  though  kz  is).  Therefore  the  motion 
of  the  electron  along  the  field  need  not  be  uniform. 
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Note  that  in  the  free  electron  case  (£  =  h2k2j2m)  the  constant  energy  surfaces  are 
spheres,  whose  intersections  with  planes  are  circles.  A  circle  rotated  through  90° 
remains  a  circle,  and  we  recover  the  familiar  result  that  a  free  electron  moves  along 
a  circle  when  its  motion  is  projected  in  a  plane  perpendicular  to  the  field.  In  the 
semiclassical  generalization  the  orbits  need  not  be  circular,  and  in  many  cases 
(Figure  12.8)  they  need  not  even  be  closed  curves. 


\  A.  A 


vY 


Figure  12.8 

Representation  in  the  repeated-zone  scheme  of  a  constant-energy  surface  with  simple 
cubic  symmetry,  capable  of  giving  rise  to  open  orbits  in  suitably  oriented  magnetic  fields. 
One  such  orbit  is  shown  for  a  magnetic  field  parallel  to  [101].  For  another  example 
occurring  in  real  metals,  see  page  292. 


We  can  also  express  the  rate  at  which  the  orbit  is  traversed  in  terms  of  certain 
geometrical  features  of  the  band  structure.  Consider  an  orbit  of  energy  £  in  a  particular 
plane  perpendicular  to  the  field  (Figure  1 2.9a).  The  time  taken  to  traverse  that  portion 
of  the  orbit  lying  between  kj  and  k2  is 


t2 


Eliminating  k  through  Eqs.  (12.32)  and  (12.33)  we  have 


(12.36) 


h2c  fk2  dk 

h~h  =  M  jki  (1237) 

where  (5£/ck)x  is  the  component  of  dS/ck  perpendicular  to  the  field,  i.e.,  its  projection 
in  the  plane  of  the  orbit. 

The  quantity  IfdE/ckJJ  has  the  following  geometrical  interpretation:  Let  A(k)  be  a 
vector  in  the  plane  of  the  orbit  that  is  perpendicular  to  the  orbit  at  point  k  and  that 
joins  the  point  k  to  a  neighboring  orbit  in  the  same  plane  of  energy  £  +  A£  (Figure 
12.9b).  When  A£  is  very  small  we  have 


A£=  —  -  A(k)  = 
ck 


•  A(k). 


(12.38) 
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Figure  12.9 

The  geometry  of  orbit  dynamics.  The  magnetic  field  H  is  along  the  z-axis.  (a)  Portions  of 
two  orbits  with  the  same  kz  ■  lying  on  the  constant-energy  surfaces  £(k)  =  £and£(k)  =  £  + 
A£.  The  time  of  flight  between  k,  and  k2  is  given  by  (12.41).  (b)  A  section  of(a)  in  a  plane 
perpendicular  to  H  containing  the  orbits.  The  line  element  dk  and  vector  A(k)  are  indicated. 
The  shaded  area  is  (cM^/cEIAE. 


Furthermore,  since  8&/dk  is  perpendicular  to  surfaces  of  constant  energy,  the  vector 
(dS/dk^  is  perpendicular  to  the  orbit,  and  hence  parallel  to  A(k).  We  can  therefore 


replace  (12.38)  by 

and  rewrite  (12.37): 


AS 


=  Kill 


A(k) 

(12.39) 

'k2 

A(k)  dk. 

(12.40) 

The  integral  in  Eq.  (12.40)  just  gives  the  area  of  the  plane  between  the  two  neighboring 
orbits  from  k,  to  k2  (Figure  12.9).  Hence  if  we  take  the  limit  of  (12.40)  as  AS  -»  0, 


we  have 


t-,  -  r, 


fi2c  8A1'2 
eH  dt  ’ 


(12.41) 


where  dz4l  2/0S  is  the  rate  at  which  the  portion  of  the  orbit  between  kt  and  k2  starts 
to  sweep  out  area  in  the  given  plane  as  S  is  increased. 

The  result  (12.41)  is  most  frequently  encountered  in  the  case  in  which  the  orbit  is 
a  simple  closed  curve,  and  kt  and  k2  are  chosen  to  give  a  single  complete  circuit 
(kt  =  k2).  The  quantity  f2  —  rt  is  then  the  period  X  of  the  orbit.  If  A  is  the  /c-space 
area  enclosed  by  the  orbit  in  its  plane,  then  (12.41)  gives27 


27  The  quantities  A  and  T  depend  on  the  energy  £  of  the  orbit  and  its  plane,  which  is  specified  by 
kz  where  the  z-axis  is  taken  to  be  along  the  field. 


I 
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ns.  k2) 


h2c  d 

THeE 


A(&,  kz). 


To  make  this  resemble  the  free  electron  result28 


(12.42) 


2n  2nmc 
=  ETC  =  eH  ’ 


it  is  customary  to  define  a  cyclotron  effective  mass  m*( £,  k_): 


m*(£,  kz) 


ft2  &4(fi,  fcz) 

2n  d£ 


(12.43) 


(12.44) 


We  emphasize  that  this  effective  mass  is  not  necessarily  the  same  as  other  effective 
masses  that  it  is  convenient  to  define  in  other  contexts,  such  as  the  specific  heat 
effective  mass.  (See  Problem  2.) 


Semiclassical  Motion  in  Perpendicular  Uniform  Electric  and  Magnetic  Fields 

When  a  uniform  electric  field  E  is  present  in  addition  to  the  static  magnetic  field  H, 
the  equation  (12.35)  for  the  projection  of  the  real  space  orbit  in  a  plane  perpendicular 
to  H  acquires  an  additional  term: 

he  - 

r±(t)  -  r±(0)  =  -  —  H  x  [k(t)  -  k(0)]  +  wt,  (12.45) 

where 

E  ~ 

w  =  c  —  (E  x  H).  (12.46) 

H 

Thus  the  motion  in  real  space  perpendicular  to  H  is  the  superposition  of  (a)  the 
k- space  orbit  rotated  and  scaled  just  as  it  would  be  if  only  the  magnetic  field  were 
present  and  (b)  a  uniform  drift  with  velocity  w.29 

To  determine  the  k- space  orbit  we  note  that  when  E  and  H  are  perpendicular,  the 
equation  of  motion  (12.6b)  can  be  written  in  the  form 

(,2-47) 

where30 

£(k)  =  £(k)  —  fck  •  w.  (12.48) 

Equation  (12.47)  is  the  equation  of  motion  an  electron  would  have  if  only  the 
magnetic  field  H  were  present,  and  if  the  band  structure  were  given  by  £(k)  rather 
than  £(k)  (cf.  Eq.  (12.33)).  We  can  therefore  conclude  from  the  analysis  of  that  case 
that  the  k- space  orbits  are  given  by  intersections  of  surfaces  of  constant  £  with  planes 
perpendicular  to  the  magnetic  field. 


28  See  page  14  and  Eq.  (1.18).  Equition  (12.43)  is  derived  from  the  general  result  (12.42)  in  Problem  1. 
25  The  reader  versed  in  electromagnetic  theory  will  recognize  w  as  the  velocity  of  the  frame  of  reference 
in  which  the  electric  field  vanishes. 

30  For  a  free  electron  £  is  simply  the  electron's  energy  in  the  frame  moving  with  velocity  w  (to  within 
a  E-independent  additive  constant). 
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We  have  thus  found  an  explicit  geometrical  construction  for  the  semiclassical 
orbits  in  crossed  electric  and  magnetic  fields. 


High-Field  Hall  Effect  and  Magnetoresistance31 

We  pursue  further  the  analysis  in  crossed  electric  and  magnetic  fields  when  (a)  the 
magnetic  field  is  very  large  (typically  104  gauss  or  more;  the  pertinent  physical 
condition  depends  on  the  band  structure  and  will  emerge  below),  and  (b)  fi(k)  differs 
only  slightly  from  £(k).  The  second  assumption  is  almost  certain  to  hold  when  the 
first  does,  since  a  typical  wave  vector  k  is  at  most  of  order  l/a0.  Consequently 


hk- 


<  —  c—=  ( £'2N)  ( eEa°] 
a0C  H  \a0)\ha>c)' 


(12.49) 


Since  eEa0  is  at  most32  of  order  10“ 10  eV  and  hwc  is  of  order  10-4  eV  in  a  field  of 
104  gauss,  hk  •  w  is  of  order  10“ 6  Ry.  Since  £(k)  is  typically  an  appreciable  fraction 
of  a  rydberg,  £(k)  (Eq.  (12.48))  is  indeed  close  to  £(k). 

The  limiting  behavior  of  the  current  induced  by  the  electric  field  in  high  magnetic 
fields  is  quite  different  depending  on  whether  (a)  all  occupied  (or  all  unoccupied) 
electronic  levels  lie  on  orbits  that  are  closed  curves  or  (b)  some  of  the  occupied  and 
unoccupied  levels  lie  on  orbits  that  do  not  close  on  themselves,  but  are  extended  or 
“open”  in  fc-space.  Because  £(k)  is  very  close  to  E(k),  we  shall  assume  that  whichever 
of  these  criteria  is  satisfied  by  the  orbits  determined  by  6  is  also  satisfied  by  the  orbits 
determined  by  £. 

Case  1  When  all  occupied  (or  all  unoccupied)  orbits  are  closed,  we  shall  take  the 
high  magnetic  field  condition  to  mean  that  these  orbits  can  be  traversed  many  times 
between  successive  collisions.  In  the  free  electron  case  this  reduces  to  the  condition 
cuct  »  1,  and  we  shall  take  this  as  indicating  the  order  of  magnitude  required  of  the 
field  in  the  general  case.  To  satisfy  the  condition  requires  not  only  very  large  fields 
(104  gauss  or  more)  but  also  very  pure  single  crystals  at  very  low  temperatures,  to 
guarantee  long  relaxation  times.  With  some  effort  values  of  cocz  as  large  as  100  or 
more  have  been  achieved. 

Suppose  then  that  the  period  T  is  small  compared  with  the  relaxation  time  t,  for 
every  orbit  containing  occupied  levels.33  To  calculate  the  current  density  at  time 
t  =  0  we  note34  that  j  =  —  nev,  where  v  is  the  mean  velocity  acquired  by  an  electron 
since  its  last  collision,  averaged  over  all  occupied  levels.  Since  the  mean  time  back 


31  In  applying  the  preceding  analysis  of  crossed  E  and  H  fields  to  the  theory  of  the  Hall  effect  and 
magnetoresistance,  we  restrict  ourselves  to  geometries  symmetrically  enough  disposed  with  respect  to 
the  crystal  axes  that  both  Hall  and  applied  electric  fields  are  perpendicular  to  the  magnetic  field.  However, 
similar  conclusions  can  be  reached  by  the  more  elaborate  methods  of  Chapter  13,  in  the  general  case. 

32  See  the  paragraph  following  Eq.  (12.9). 

33  For  concreteness  we  assume  the  occupied  levels  all  lie  on  closed  orbits.  If  it  is  the  unoccupied 
levels,  we  can  appeal  to  the  discussion  of  holes  to  justify  essentially  the  same  analysis,  except  that  the 
current  density  will  be  given  by  j  =  +  rjhe\,  where  nh  is  the  density  of  holes,  and  v  is  the  mean  velocity 
that  an  electron  would  acquire  in  time  t  averaged  over  all  unoccupied  levels. 

34  See  Eq.  (1.4).  The  analysis  that  follows  is  quite  similar  in  spirit  to  the  Drude  derivation  of  the  DC 
conductivity  in  Chapter  1.  We  shall  calculate  the  current  carried  by  a  single  band,  since  the  contributions 
of  more  than  one  band  simply  add  together. 
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to  the  last  collision  is  t,  we  can  conclude  from  (12.45)  that  the  component  of  this 
velocity  perpendicular  to  the  magnetic  field  for  a  particular  electron  is  just 

rJO)  -  «T(-t)  =  _  x  k(0)  -  k(-r)  +  ^  (12.50) 

T  eH  i 

Since  all  occupied  orbits  are  closed,  Ak  =  k(0)  -  k(-r)  is  bounded  in  time  so  for 
sufficiently  large  i  the  drift  velocity  w  gives  the  dominant  contribution  to  (12.50)  and 

we  have3S 

lim  j±  =  -ne w  =  -  ^rr(E  x  H).  (12.51) 

i/r-*oo  n 

If  it  is  the  unoccupied  levels  that  all  lie  on  closed  orbits,  the  corresponding  result  is 
lim  j^+^lExH)  02-52) 

x IT-*  oo  li 

Equations  (12.51)  and  (12.52)  assert  that  when  all  relevant  orbits  are  closed,  the 
deflection  of  the  Lorentz  force  is  so  effective  in  preventing  electrons  from  acquiring 
energy  from  the  electric  field  that  the  uniform  drift  velocity  w  perpendicular  to  E 
gives  the  dominant  contribution  to  the  current.  The  result  embodied  m  Eqs.  (12.51 
and  (12  52)  is  usually  stated  in  terms  of  the  Hall  coefficient,  defined  (see  Eq.  (1.15) 
as  the  component  of  the  electric  field  perpendicular  to  the  current,  divided  by  the 
product  of  the  magnetic  field  and  the  current  density.  If  the  entire  current  is  came 
by  electrons  from  a  single  band,  for  which  (12.51)  or  (12.52)  is  valid,  then  the  hig  - 
field  Hall  coefficient  is  simply37 

R  — - — ,  electrons;  R^  =  +  holes.  (12.53) 

°°  nec  nhLC 


35  In  writing  the  high-field  limit  in  this  form,  we  are  interpreting  the  limit  to  mean  large  t  at  fixed 
H  rather  than  large  H  at  fixed  r.  To  demonstrate  that  the  same  leading  term  emerges  in  the  latter  case, 
or  to  estimate  the  value  of  oicr  at  which  the  leading  term  begins  to  dominate,  requires  a  somewhat  deeper 
analysis.  We  first  note  that  if  the  electric  field  were  zero,  then  the  net  contribution  to  the  mean  current 
of  the  term  in  (12.50)  proportional  to  Ak  would  vanish  when  averaged  over  occupied  orbits  (since  both 
j  and  w  must  vanish  when  E  =  0).  When  E  #  0,  Ak  no  longer  vanishes  when  averaged  over  orbits,  because 
the  replacement  of  £  by  S  (Eq.  (12.48))  displaces  all  the  k-space  orbits  in  the  same  general  direction.  This 
is  easily  seen  in  the  free  electron  case,  since  if  £(k)  =  hVftm,  then  8(k)  is  given  (to  within  a  dynamically 
irrelevant  additive  constant)  by  £(k)  =  fi2(k  -  ».«/»)>.  Consequently,  when  averaged  over  all  orbits 
Ak  will  no  longer  give  zero,  but  , mv/ft.  It  follows  from  (12.50)  that  the  size  of  the  contribution  of  Ak 
to  the  mean  velocity  v,  when  averaged  over  orbits,  will  be  (nm/fi)(fic/eH)(l/r)  ,v/(<°**).  ,s  Sma  . 
than  the  leading  term  w  by  I  tot.  Thus  the  limiting  form  (12.51)  does  indeed  become  valid  when,  eorb, 
can  be  traversed  many  times  between  collisions.  For  a  general  band  structure  the  average  of  Ak  will  be 
more  complicated  (for  example,  it  will  depend  on  the  particular  orbit),  but  we  can  ex^ct  he  free  elec  ro 
estimate  to  give  the  right  order  of  magnitude  if  m  is  replaced  by  a  suitably  defined  effective ,™ss  . 

*>  Since  (12.51)  and  (12.52)  are  manifestly  different,  there  can  be  no  band  in  which  aU  orbits  (occupied 
and  unoccupied)  are  closed  curves.  The  topologically  minded  reader  is  invited  to  deduce  this  directly 

fr0™,hTWs"rem^LbIy  general  result  is  nothing  but  a  compact  way  of  expressing  the  dominance  in  the 
current  of  the  drift  velocity  w  in  the  high-field  limit.  It  holds  for  quite  general  band  structure  precisely 
because  the  semiclassical  equations  preserve  the  fundamental  role  that  w  plays  m  free  electron  theory 
It  fails  (see  below)  when  some  electron  and  hole  orbits  are  open,  because  w  then  no  longer  dominates  the 
high-field  current. 
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This  is  just  the  elementary  result  (1.21)  of  free  electron  theory,  reappearing  under 
remarkably  more  general  circumstances  provided  that  (a)  all  occupied  (or  all  un¬ 
occupied)  orbits  are  closed,  (b)  the  field  is  large  enough  that  each  orbit  is  traversed 
many  times  between  collisions,  and  (c)  the  carriers  are  taken  to  be  holes  if  it  is  the 
unoccupied  orbits  that  are  closed.  Thus  the  semiclassical  theory  can  account  for  the 
“anomalous”  sign  of  some  measured  Hall  coefficients,38  as  well  as  preserving,  under 
fairly  general  conditions,  the  very  valuable  information  about  the  carrier  density  that 
measured  (high-field)  Hall  coefficients  yield. 

If  several  bands  contribute  to  the  current  density,  each  of  which  has  only  closed 
electron  (or  hole)  orbits,  then  (12.51)  or  (12.52)  holds  separately  for  each  band,  and 
the  total  current  density  in  the  high-field  limit  will  be 

lim  h  =  -  — j~(E  x  H),  (12.54) 

.  t/t  •“♦c©  ri 

where  nefr  is  the  total  density  of  electrons  minus  the  total  density  of  holes.  The  high- 
field  Hall  coefficient  will  then  be 

=  -  —  •  (12.55) 

!^ec 

Further  aspects  of  the  many-band  case,  including  the  question  of  how  (12.55)  must 
be  modified  when  the  density  of  electrons  is  equal  to  the  density  of  holes  (so-called 
compensated  materials),  are  explored  in  Problem  4. 

One  can  also  verify  (Problem  5)  that  since  the  corrections  to  the  high-field  current 
densities  (12.51)  and  (12.52)  are  smaller  by  a  factor  of  order  1/o>ct,  the  transverse 
magnetoresistance39  approaches  a  field-independent  constant  (“saturates”)  in  the 
high-field  limit40  provided  that  the  current  is  carried  by  a  single  band  with  closed 
electron  (or  hole)  orbits.  The  many-band  case  is  investigated  in  Problem  4,  where 
it  is  shown  that  if  all  electron  or  hole  orbits  are  closed  in  each  band,  the  magneto¬ 
resistance  continues  to  saturate  unless  the  material  is  compensated,  in  which  case 
it  grows  without  limit  with  increasing  magnetic  field. 

Case  2  The  above  conclusions  change  drastically  if  there  is  at  least  one  band  in  which 
neither  all  occupied  nor  all  unoccupied  orbits  are  closed.  This  will  be  the  case  if  at 
least  some  orbits  at  the  Fermi  energy  are  open,  unbounded  curves  (Figure  12.8). 
Electrons  in  such  orbits  are  no  longer  forced  by  the  magnetic  field  to  undergo  a 
periodic  motion  along  the  direction  of  the  electric  field,  as  they  are  in  closed  orbits. 
Consequently  the  magnetic  field  is  no  longer  effective  in  frustrating  the  ability  of 
such  electrons  to  acquire  energy  from  the  driving  electric  field.  If  the  unbounded 
orbit  stretches  in  a  real  space  direction  n.  one  would  therefore  expect  to  find  a  con¬ 
tribution  to  the  current  that  does  not  vanish  in  the  high-field  limit,  directed  along  n, 
and  proportional  to  the  projection  of  E  along  n: 


j  =  <7l0ln(n  •  E)  +  c(I)  ■  E, 


c|0)  -*  constant  as  H  -»  oo, 
a,n  -»  0  as  H  ->  oo. 


(12.56) 


38  See  Table  1.4,  Figure  1.4,  and  p.  58. 

39  See  page  12. 

40  In  free  electron  theory  the  magnetoresistance  is  independent  of  magnetic  field  strength  (page  14) 
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(a)  (b) 

Figure  12.10 

A  section  of  constant-energy  surfaces  in  a  plane  perpendicular  to  the  magnetic  field  H,  showing 
occupied  open  (shaded)  and  closed  (unshaded)  orbits.  In  (a)  no  electric  field  is  present  and  the 
currents  carried  by  open  orbits  in  opposite  directions  cancel.  In  (b)  an  electric  field  E  is  present, 
leading  in  the  steady  state  to  an  imbalance  in  oppositely  directed  populated  open  orbits,  and  hence 
a  net  current.  (This  follows  from  the  fact  that  £  (Eq.  (12.48))  is  conserved  by  the  semiclassical 
motion  between  collisions.) 

This  expectation  is  confirmed  by  the  semiclassical  equations,  for  the  growth  in 
wave  vector  (Ak)  of  an  electron  emerging  from  a  collision  on  an  open  orbit  is  not 
bounded  in  time,  but  grows  at  a  rate41  directly  proportional  to  H,  leading  to  a 
contribution  to  the  average  velocity  (12.50)  that  is  independent  of  magnetic  field 
strength  and  directed  along  the  real  space  direction  of  the  open  orbit  in  the  high-field 
limit.42 

The  high-field  form  (12.56)  is  radically  different  from  the  forms  (12.51)  or  (12.52) 
appropriate  to  carriers  all  of  whose  orbits  are  closed.  As  a  result  the  Hall  coefficient 
no  longer  has  the  simple  high-field  limit  (12.53).  Furthermore,  the  conclusion  that  the 
high-field  magnetoresistance  approaches  a  constant  is  no  longer  valid;  indeed,  the 
failure  of  the  magnetoresistance  to  saturate  is  one  of  the  characteristic  signals  that  a 
Fermi  surface  can  support  open  orbits. 

To  understand  the  implications  of  the  limiting  behavior  (12.56)  on  the  high-field 
magnetoresistance,  consider  an  experiment  (Figure  12.11)  in  which  the  direction  of 
current  flow  (determined  by  the  geometry  of  the  specimen)  does  not  lie  along  the 
direction  fi  of  the  open  orbit  in  real  space.  Because  of  (12.56)  this  is  possible  in  the 


41  Since  the  rate  at  which  the  orbit  is  traversed  is  proportional  to  H  (Eq.  (12.41)). 

42  Note  that  when  E  =  0  this  contribution  must  still  give  zero  when  averaged  over  all  open  orbits 
(since  j  and  w  are  zero)  so  there  must  be  oppositely  directed  open  orbits  whose  contributions  cancel.  When 
an  electric  field  is  present,  however,  the  orbits  directed  so  as  to  extract  energy  from  the  field  become  more 
heavily  populated  at  the  expense  of  those  directed  so  as  to  lose  energy  (Figure  12.10).  This  population 
difference  is  proportional  to  the  projection  of  the  drift  velocity  w  along  the  fc-space  direction  of  the  orbit, 
or,  equivalently,  to  the  projection  of  E  along  the  real  space  direction  of  the  orbit.  This  is  the  Source  of 
the  n  *  E  dependence  in  (12.56). 
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Figure  12.11 

Schematic  picture  of  the  current  j  in  a  wire  per¬ 
pendicular  to  a  magnetic  field  H,  when  an  open 
orbit  lies  in  a  real-space  direction  n  perpendicular 
to  the  field.  In  the  high-field  limit  the  total  electric 
field  E  becomes  perpendicular  to  n.  Since  the 
component  Ex  parallel  to  j  is  determined  by  the 
applied  potential,  this  is  brought  about  by  the 
appearance  of  the  transverse  field  Ey ,  due  to  the 
charge  that  accumulates  on  the  surfaces  of  the 
specimen.  Thus  the  Hall  angle  (the  angle  between 
j  and  E)  is  just  the  complement  of  the  angle  between 
j  and  the  open-orbit  direction.  It  therefore  fails 
(in  contrast  to  the  free  electron  case,  page  14)  to 
approach  90°  in  the  high-field  limit. 


high-field  limit  only  if  the  projection  of  the  electric  field  on  n,  E  •  n,  vanishes.43  The 
electric  field  therefore  has  the  form  (see  Figure  12.11) 


E  =  £<0,n'  +  E(1)n,  (12.57) 

where  n'  is  a  unit  vector  perpendicular  to  both  n  and  H  (n'  =  n  x  H),  £(0)  is  inde¬ 
pendent  of  H  in  the  high-field  limit,  and  £U)  vanishes  as  H  -*  oo. 

The  magnetoresistance  is  the  ratio  of  the  component  of  E  along  j,  to  j: 


P 


(12.58) 


When  the  current  is  not  parallel  to  the  direction  fi  of  the  open  orbit,  this  gives  in  the 
high  field-limit 

/  £(0)\  , 

P  =  (  —  1  n'  •  j.  (12.59) 

'  '  ^ 

To  find  Em/j  we  first  substitute  the  electric  field  (12.57)  into  the  field-current  relation 
(12.56)  to  find,  in  the  high-field  limit,  the  leading  behavior 


j  =  <r(0,n£u>  +  c(1)  •  n'£(0).  (12.60) 

Since  n'  is  perpendicular  to  n,  this  implies  that 

n'  -j  =  £(0>fi'  •  oU)  •  n',  (12.61) 


In  the  experiment  (Figure  12.1 1)  the  component  of  E  along  j  is  specified.  However,  in  the  steady 
state  there  will  also  be  a  Hall  field  perpendicular  to  j  (see  p.  12),  which  makes  it  possible  for  E  -n  to  vanish 
in  the  high-field  limit. 
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or 


£(°)  _  fi'  •  j 

j  n'  •  a<n  ■  n' 


(12.62) 


Substituting  this  result  into  Eq.  (12.59),  we  find  that  in  the  high-field  limit  the  leading 
term  in  the  magnetoresistance  is 


P  = 


(n'-j)2 


n  •  <t 


:<1>  .  j 


(12.63) 


Since  <r(1)  vanishes  in  the  high-field  limit,  this  gives  a  magnetoresistance  that  grows 
without  limit  with  increasing  field,  and  is  proportional  to  the  square  of  the  sine  of 
the  angle  between  the  current  and  the  real-space  direction  of  the  open  orbit. 

Thus  the  semiclassical  model  resolves  another  anomaly  of  free  electron  theory, 
providing  two  possible  mechanisms44  by  which  the  magnetoresistance  can  grow 
without  limit  with  increasing  magnetic  field. 


We  defer  until  Chapter  15  some  illustrations  of  how  these  predictions  are  confirmed 
by  the  behavior  of  real  metals,  and  turn  next  to  a  more  systematic  method  of  extracting 
transport  coefficients  from  the  semiclassical  model. 


PROBLEMS 

1.  For  free  electrons  6(k)  =  h2k2/2m.  Calculate  cA(Z,  kz)/(X  and  show  that  the  general  expression 
(12.42)  for  the  period  in  a  magnetic  field  reduces  to  the  free  electron  result  (12.43). 

2.  For  electrons  near  a  band  minimum  (or  maximum)  E(k)  has  the  form 

ft2 

£(k)  =  constant  +  —  (k  —  k0)  •  M  1  •  (k  —  k0),  (12.64) 

where  the  matrix  M  is  independent  of  k.  (Electrons  in  semiconductors  are  almost  always  treated 
in  this  approximation.) 

(a)  Calculate  the  cyclotron  effective  mass  from  (12.44)  and  show  that  it  is  independent  of 
E  and  kz,  and  given  by 

(cyclotron)  (12.65) 

where  |M|  is  the  determinant  of  the  matrix  M. 

(b)  Calculate  the  electronic  specific  heat  (2.80)  resulting  from  the  band  structure  (12.64), 
and  by  comparing  it  with  the  corresponding  free  electron  result,  show  that  the  band  structure 
contribution  to  the  specific  heat  effective  mass  (page  48)  is  given  by 

m*  =  |M|1/3.  (specific  heat)  (12.66) 

3.  When  £(k)  has  the  form  (1 2.64),  the  semiclassical  equations  of  motion  are  linear,  and  therefore 
easily  solved. 

44  Open  orbits  or  (Problem  4)  compensation.  In  free  electron  theory  the  magnetoresistance  is  field- 
independent. 
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(a)  Generalize  the  analysis  in  Chapter  1  to  show  that  for  such  electrons  the  DC  conductivity 
is  given  by 

c  =  ne2tM_I.  (12.67) 


(b)  Rederive  the  result  (12.65)  for  the  cyclotron  effective  mass  by  finding  explicitly  the  time- 
dependent  solutions  to  Eq.  (12.31): 

M-^  =  -*(E  +  \  *  H)>  (12'68> 

and  noting  that  the  angular  frequency  is  related  to  m*  by  m  =  eH/m*c. 

4.  The  free  electron  result  (1.19)  for  the  current  induced  by  an  electric  field  perpendicular  to  a 
uniform  magnetic  field  can  be  written  in  the  form 

E  =  p-j  (12.69) 


where  the  resistivity  tensor  p  has  the  form 


(12.70) 


(It  follows  from  the  definitions  (1.14)  and  (1.15)  that  p  is  the  magnetoresistance  and  R  the  Hall 
coefficient.) 

(a)  Consider  a  metal  with  several  partially  filled  bands,  in  each  of  which  the  induced  current 
is  related  to  the  field  by  E„  =  pj„,  where  the  p„  have  the  form  (12.70):*5 


P„  = 


(12.71) 


Show  that  the  total  induced  current  is  given  by  E  =  p  •  j,  with 


p  =  (ep*  ‘r1. 


(12.72) 


(b)  Show  that  if  there  are  only  two  bands,  then  the  Hall  coefficient  and  magnetoresistance 
are  given  by: 


RjPi2  +  KzPi2  +  *,*i(*.  +  R2)HZ 

(pi  +  P2)2  +  (^1  +  r2)2h2 

PiPi(Pi  +  P2)  +  (PiRi2  +  PiR^H2 
(P\  +  P2)2  +  (^1  +  R2)2  R^2 


(12.73) 

(12.74) 


Note  the  explicit  dependence  on  magnetic  field  strength  even  if  the  and  p,  are  field-independent 
(as  they  are  for  free  electron  bands). 

(c)  Deduce  directly  from  (12.73)  the  form  (12.55)  of  the  high-field  Hall  coefficient  when  both 
bands  have  closed  orbits,  and  discuss  the  limiting  high-field  behavior  in  the  case  nc„  =  0  (i.e.,  for 
a  compensated  two-band  metal).  Show  that  in  this  case  the  magnetoresistance  increases  as 
H2  with  increasing  field. 


45  The  form  (12.71)  does  not,  in  general,  require  that  each  band  be  free-electron-like,  but  only  that 
the  magnetic  field  lie  along  an  axis  of  sufficient  symmetry. 
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5.  Since  the  correction  to  the  high-field  result  (12.51)  is  of  order  H2,  the  general  form  for  the 
current  induced  in  an  electron  band  with  closed  orbits  is 


j=  -~(E  x  H)  +  o(1)  •  E, 
ti 

(12.75) 

where 

lim  H2c{1)  <  oo. 

H~*  oo 

(12.76) 

Show  that  the  high-field  magnetoresistance  is  given  by 

p=  2  lim  //2<C 

(nee)2  h->  tc 

(12.77) 

where  the  y-axis  is  perpendicular  to  the  magnetic  field  and  the  direction  of  current  flow.  Note 
that  Eq.  (12.76)  requires  the  magnetoresistance  to  saturate. 


6.  The  validity  of  the  semiclassical  result  k(r)  =  k(0)  —eEt/h  for  an  electron  in  a  uniform 
electric  field  is  strongly  supported  by  the  following  theorem  (which  also  provides  a  useful  starting 
point  for  a  rigorous  theory  of  electric  breakdown) : 

Consider  the  time-dependent  Schrodinger  equation  for  an  electron  in  a  periodic  potential 
U( r)  and  a  uniform  electric  field: 

ih  ^  V2  +  U(r)  +  eE  ■  r  «>  =  /#.  (12.78) 

ct  |_  2m 

Suppose  that  at  time  t  =  0,  ip(r,  0)  is  a  linear  combination  of  Bloch  levels,  all  of  which  have  the 
same  wave  vector  k.  Then  at  time  t,  \j/( r,  t)  will  be  a  linear  combination  of  Bloch  levels,46  all  of 
which  have  the  wave  vector  k  —  eEt/h. 

Prove  this  theorem  by  noting  that  the  formal  solution  to  the  Schrodinger  equation  is 

ip(r,  t)  =  e-iH,'V(r,0),  (12.79) 

and  by  expressing  the  assumed  property  of  the  initial  level  and  the  property  to  be  proved  of  the 
final  level  in  terms  of  the  effect  on  the  wave  function  of  translations  through  Bravais  lattice 
vectors. 

7.  (a)  Does  the  orbit  in  Figure  12.7  enclose  occupied  or  unoccupied  levels? 

(b)  Do  the  closed  orbits  in  Figure  12.10  enclose  occupied  or  unoccupied  levels? 

46  The  semiclassical  theory  of  an  electron  in  a  uniform  electric  field  is  not  exact  in  spite  of  this  theorem, 
because  the  coefficients  in  the  linear  combination  of  Bloch  levels  will  in  general  depend  on  time;  thus 
interband  transitions  may  occur. 
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Our  discussion  of  electronic  conduction  in  Chapters  1, 2,  and  12  was  often  somewhat 
qualitative  and  frequently  depended  on  simplifying  features  of  the  particular  case 
being  examined.  In  this  chapter  we  describe  a  more  systematic  method  of  calculating 
conductivities,  applicable  to  general  semiclassical  motion  in  the  presence  of  space-  and 
time-dependent  perturbing  fields  and  temperature  gradients.  The  physical  approxi¬ 
mations  underlying  this  analysis  are  no  more  rigorous  or  sophisticated  than  those 
used  in  Chapter  12,  merely  more  precisely  stated.  However,  the  method  by  which 
the  currents  are  calculated  from  the  basic  physical  assumptions  is  more  general  and 
systematic,  and  of  such  a  form  that  comparison  with  more  accurate  theories  can  easily 
be  made  (Chapter  16). 

The  description  of  conduction  in  this  chapter  will  employ  a  nonequilibrium  dis¬ 
tribution  function  gjr,  k,  f)  defined  so  that  gn( r,  k,  t)  dr  dk/4n3  is  the  number  ofelectrons 
in  the  nth  band  at  time  r  in  the  semiclassical  phase  space  volume  dr  dk  about  the  point 
r,  k.  In  equilibrium  g  reduces  to  the  Fermi  function. 


&„(r,  k,  t)  =  /(£„( k)), 

=  e{t--H)ikEr  +  i>  (13.1) 

but  in  the  presence  of  applied  fields  and/or  temperature  gradients  it  will  differ  from 
its  equilibrium  form. 

In  this  chapter  we  shall  derive  a  closed  expression  for  g,  based  on  (a)  the  assumption 
that  between  collisions  the  electronic  motion  is  determined  by  the  semiclassical 
equations  (12.6),  and  (b)  a  particularly  simple  treatment  of  collisions,  known  as  the 
relaxation-time  approximation,  that  gives  a  precise  content  to  the  qualitative  view 
of  collisions  we  have  put  forth  in  earlier  chapters.  We  shall  then  use  the  nonequilibrium 
distribution  function  to  calculate  the  electric  and  thermal  currents  in  several  cases 
of  interest  beyond  those  considered  in  Chapter  12. 


THE  RELAXATION-TIME  APPROXIMATION 

Our  fundamental  picture  of  collisions  retains  the  general  features  described,  in 
Chapter  1,  which  we  now  formulate  more  precisely  in  a  set  of  assumptions  known 
as  the  relaxation-time  approximation.  We  continue  to  assume  that  an  electron  expe¬ 
riences  a  collision  in  an  infinitesimal  time  interval  dt  with  probability  dt/r,  but  now 
allow  for  the  possibility  that  the  collision  rate  depends  on  the  position,  wave  vector, 
and  band  index  of  the  electron:  t  =  t„(r,  k).  We  express  the  fact  that  collisions  drive 
the  electronic  system  toward  local  thermodynamic  equilibrium  in  the  following 
additional  assumptions: 

1.  The  distribution  of  electrons  emerging  from  collisions  at  any  time  does  not 
depend  on  the  structure  of  the  nonequilibrium  distribution  function  g„{ r,  k,  f) 
just  prior  to  the  collision. 
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2.  If  the  electrons  in  a  region  about  r  have  the  equilibrium  distribution  appropriate 
to  a  local  temperature1  T(ij, 


0n(r>  k,  t)  f/„(r,  k)  e(En(k)-^(r))/eBI(r)  i  ’  (13.2) 

then  collisions  will  not  alter  the  form  of  the  distribution  function. 

Assumption  1  asserts  that  collisions  are  completely  effective  in  obliterating  any 
information  about  the  nonequilibrium  configuration  that  the  electrons  may  be 
carrying.  This  almost  certainly  overestimates  the  efficacy  of  collisions  in  restoring 
equilibrium  (see  Chapter  16). 

Assumption  2  is  a  particularly  simple  way  of  representing  quantitatively  the  fact 
that  it  is  the  role  of  collisions  to  maintain  thermodynamic  equilibrium  at  whatever 
local  temperature  is  imposed  by  the  conditions  of  the  experiment.2 

These  two  assumptions  completely  determine  the  form  dg„(  r,  k,  t)  of  the  distribution 
function  describing  just  those  electrons  that  have  emerged  from  a  collision  near  point 
r  in  the  time  interval  dt  about  t.  According  to  assumption  (1)  dg  cannot  depend  on 
the  particular  form  of  the  full  nonequilibrium  distribution  function  gn( r,  k,  t).  It 
therefore  suffices  to  determine  dg  for  any  particular  form  of  g.  The  simplest  case  is 
when  g  has  the  local  equilibrium  form  (13.2),  for  according  to  assumption  (2)  the 
effect  of  collisions  is  then  to  leave  this  form  unaltered.  We  know,  however,  that  in 
the  time  interval  dt  a  fraction  dt/rjr,  k)  of  the  electrons  in  band  n  with  wave  vector 
k  near  position  r  will  suffer  a  collision  that  does  alter  their  band  index  and/or  wave 
vector.  If  the  form  (13.2)  of  the  distribution  function  is  nevertheless  to  be  unaltered, 
then  the  distribution  of  those  electrons  that  emerge  from  collisions  into  band  n  with 
wave  vector  k  during  the  same  interval  must  precisely  compensate  for  this  loss.  Thus: 

dg„( r,  k,  t)  =  — f---  g°( r,  k).  (13.3) 

t„\r,  kj 

Equation  (13.3)  is  the  precise  mathematical  formulation  of  the  relaxation-time 
approximation.3 


1  The  only  case  we  shall  discuss  in  which  the  local  equilibrium  distribution  is  not  the  uniform 
equilibrium  distribution  (13.1)  (with  constant  T  and  fi)  is  when  the  spatially  varying  temperature  T(r)  is 
imposed  by  suitable  application  of  sources  and/or  sinks  of  heat,  as  in  a  thermal  conductiv  ity  measurement. 
In  that  case,  since  the  electronic  density  n  is  constrained  (electrostatically)  to  be  constant,  the  local  chemical 
potential  will  also  depend  on  position,  so  that  /i(r)  =  /icqui]lb  (n,  T(t)  ).  In  the  most  general  case  the  local 
temperature  and  chemical  potential  may  depend  on  time  as  well  as  position.  See.  for  example.  Problem  4 
at  the  end  of  this  chapter,  and  Problem  lb  in  Chapter  16. 

2  A  more  fundamental  theory  would  derive  the  fact  that  collisions  play  such  a  role,  rather  than 
assuming  it. 

3  The  relaxation-time  approximation  is  critically  reexamined  in  Chapter  16,  where  it  is  compared 
with  a  more  accurate  treatment  of  collisions. 
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Given  these  assumptions,  vve  can  compute  the  nonequilibrium  distribution  func¬ 
tion  in  the  presence  of  external  fields  and  temperature  gradients.4 


CALCULATION  OF  THE  NONEQUILIBRIUM 
DISTRIBUTION  FUNCTION 

Consider  the  group  of  electrons  in  the  nth  band  that  find  themselves  at  time  t  in  the 
volume  element  dr  d k  about  r,  k.  The  number  of  electrons  in  this  group  is  given  in 
terms  of  the  distribution  function  by 


dN  =  gn(  r,  k. 


dr  r/k 


(13.4) 


We  can  compute  this  number  by  grouping  the  electrons  according  to  when  they  had 
their  last  collision.  Let  r„(t'),  k„(f')  be  the  solution  to  the  semiclassical  equations  of 
motion  for  the  nth  band  that  passes  through  the  point  r,  k  when  t'  =  t: 

r„(t)  =  r,  k„(t)  =  k.  (13.5) 

Electrons  in  the  volume  element  dr  dk  about  r,  k  at  time  t,  whose  last  collision  prior 
to  r  was  in  the  interval  Jt'  about  f',  must  have  emerged  from  that  last  collision  in 
a  phase  space  volume  element  dr'  dk'  about  r„(t'),  k„(t'),  since  after  t'  their  motion 
is  determined  entirely  by  the  semiclassical  equations,  which  must  get  them  to  r,  k 
at  time  t.  According  to  the  relaxation-time  approximation  (13.3),  the  total  number 
of  electrons  emerging  from  collisions  at  rn(f'),  k„(f')  into  the  volume  element  dr'  dk' 
in  the  interval  dt'  about  t'  is  just: 


dt' 

T„(r„(t'),  k„(t')) 


g£(r„(t'),  k„(t')) 


dr  dk 
47l3  ’ 


where  we  have  exploited  Liouville’s  theorem5  to  make  the  replacement 


(13.6) 


dr'  dk'  =  dr  dk.  (13.7) 

Of  this  number,  only  a  fraction  P„(r,  k,  t;  t')  (which  we  shall  compute  below)  actually 
survive  from  time  t'  to  time  t  without  suffering  any  further  collisions.  Therefore  dN 
is  given  by  multiplying  (13.6)  by  this  probability  and  summing  over  all  possible  times 
t'  for  the  last  collision  prior  to  t: 


dN  =  P  dt  k„(Q)F„( r,  k,  t;  t') 

4«3  J-oo  T„(r„(t'),k„(t')) 


(13.8) 


Note  the  different  roles  played  by  the  fields,  which  determine  the  electronic  motion  between  colli¬ 
sions,  and  the  temperature  gradient,  which  determines  the  form  (1 3.3)  taken  by  the  distribution  of  electrons 
emerging  from  collisions. 

5  The  theorem  states  that  volumes  in  rk- space  are  preserved  by  the  semiclassical  equations  of  motion. 
It  is  proved  in  Appendix  H. 
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Comparing  this  with  (13.4)  gives 

df  g°{r„(t'),  k„(t'))f*„(r,  k,  f;  f') 

Tn(rn(f'),  k„(f')) 

The  structure  of  the  result  (13.9)  is  somewhat  obscured  by  the  notation,  which 
reminds  us  explicitly  that  the  distribution  function  is  for  the  nth  band  evaluated  at 
the  point  r,  k,  and  that  the  t'  dependence  of  the  integrand  is  determined  by  evaluating 
g°  and  z„  at  the  point  r„(f'),  k„(r')  of  the  semiclassical  trajectory  that  passes  through 
r,  k  at  time  t.  To  avoid  obscuring  the  simplicity  of  some  further  manipulations  on 
(13.9)  we  temporarily  adopt  an  abbreviated  notation  in  which  the  band  index  n,  the 
point  r,  k,  and  the  trajectories  r„,  k„  are  fixed  and  left  implicit.  Thus 

gn(r,  k,  t)  ->  g(t),  g°(r„(t'),  k„(r') )  ->  g°(t'), 

T„(rn(t'),  k„(t') )  -  r(f'),  PJl r,  M;t')  -  P(t,  t ’),  (13.10) 

so  that  (13.9)  can  be  written:6 


0„( r,  k,  t)  = 


-i 


g(t)  = 


(13.11) 


We  emphasize  the  simple  structure  of  this  formula.  The  electrons  in  a  given  phase 
space  element  at  time  t  are  grouped  according  to  the  time  of  their  last  collision.  The 
number  of  electrons  whose  last  collisions  were  in  the  time  interval  dt'  about  t'  is  the 
product  of  two  factors: 


1.  The  total  number  emerging  from  collisions  in  that  time  interval  that  are  aimed 
so  that  if  no  further  collision  intervenes  they  will  reach  the  given  phase  space 
element  at  time  t.  This  number  is  determined  by  the  relaxation-time  approxima¬ 
tion  (13.3). 

2.  The  fraction  P(t,  t')  of  those  electrons  specified  in  (1)  that  actually  do  survive 
from  t'  to  t  without  collisions. 


It  remains  to  compute  P(f,  £'),  the  fraction  of  electrons  in  band  n  that  traverse  the 
trajectory  passing  through  r,  k  at  time  t,  without  suffering  collisions  between  t'  and 
t.  The  fraction  that  survive  from  t'  to  t  is  less  than  the  fraction  that  survive  from 
r'  +  dt'  to  t  by  the  factor  [1  —  df'/rjf')],  which  gives  the  probability  of  an  electron 
colliding  between  t'  and  t'  +  dt'.  Thus 

P(t,  t')  =  P(t,  t'  +  dt')  |\  -  ~J.  (13.12) 

In  the  limit  as  dt'  -»  0,  this  gives  the  differential  equation 


d_ 

HP 


P(t,  t') 


pjt,  n 

t (£')  ’ 


(13.13) 


6  This  result  and  the  method  by  which  it  was  constructed  are  associated  with  the  name  of  R.  G. 
Chambers,  Proc.  Phys.  Soc.  (London)  81,  877  (1963). 
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whose  solution,  subject  to  the  boundary  condition 


P(t,  t)  =  1, 

is 


(13.14) 

(13.15) 


We  may  use  Eq.  (13.13)  to  rewrite  the  distribution  function  (13.11)  in  the  form 


gU)  = 


i 


dfg°(f)  —  P(t,  t'). 


(13.16) 


It  is  convenient  to  integrate  Eq.  (13.16)  by  parts,  using  Eq.  (13.14)  and  the  physical 
condition  that  no  electron  can  survive  infinitely  long  without  a  collision :  P(r,  —  oo)  = 
0.  The  result  is 


0(f)  =  g°(t)  - 


dt'  P(t,  t') 


d_ 

dt' 


At'), 


(13.17) 


which  expresses  the  distribution  function  as  the  local  equilibrium  distribution  plus 
a  correction. 

To  evaluate  the  time  derivative  of  g°,  note  (see  Eqs.  (13.10)  and  (13.2))  that  it 
depends  on  time  only  through  S„(k„(f')),  T(rn(t')),  and  g(r „(t')),  so  that7 


dg°(t')  =  rV/° fi£„  _  dk„  dg°dT  dr „  'V fy  .  dr„ 

dt'  fi£„  fik  dt'  8T  fir  dt'  Eg  fir  dt' ' 


(13.18) 


If  we  use  the  semiclassical  equations  of  motion  (12.6)  to  eliminate  drjdt'  and  dkjdt' 
from  Eq.  (13.18),  then  Eq.  (13.17)  can  be  written  as 


0(t)  =  g°+j  dt'P(t.t')  _  (~r^)yT) 


(13.19) 


where /  is  the  Fermi  function  (13.1)  (evaluated  at  the  local  temperature  and  chemical 
potential),  and  all  the  quantities  in  the  brackets8  depend  on  t'  through  their  arguments 
r„(t')  and  k„(f'). 


SIMPLIFICATION  OF  THE  NONEQLILIBRILJM 
DISTRIBUTION  FUNCTION  IN  SPECIAL  CASES 

Equation  (13.19)  gives  the  semiclassical  distribution  function  in  the  relaxation-time 
approximation  under  very  general  conditions,  and  is  therefore  applicable  to  a  great 
variety  of  problems.  In  many  cases,  however,  special  circumstances  permit  consid¬ 
erably  further  simplification: 


7  If  we  were  interested  in  applications  in  which  the  local  temperature  and  chemical  potential  had  an 
explicit  time  dependence,  we  should  have  to  add  terms  in  dT/ct  and  fifi/dt  to  (13.18).  An  example  is  given 
in  Problem  4. 

8  Note  that  a  magnetic  field  H  will  not  appear  explicitly  in  (13.19)  since  the  Lorentz  force  is  per¬ 
pendicular  to  v.  (It  will,  of  course,  appear  implicitly  through  the  time  dependence  of  r„(t')  and  k„(t’).) 
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1.  Weak  Electric  Fields  and  Temperature  Gradients  As  noted  in  Chapter  1,  the 
electric  fields  and  temperature  gradients  commonly  applied  to  metals  are  almost 
invariably  weak  enough  to  permit  calculation  of  the  induced  currents  to  linear  order.9 
Since  the  second  term  in  (13.19)  is  explicitly  linear10  in  E  and  V7,  the  t'  dependence 
of  the  integrand  can  be  calculated  at  zero  electric  field  and  constant  T. 

2.  Spatially  Uniform  Electromagnetic  Fields  and  Temperature  Gradients,  and 
Position-Independent  Relaxation  Times 1 1  In  this  case  the  entire  integrand  in  ( 1 3. 1 9) 
will  be  independent  of  r„(t').  The  only  t'  dependence  (aside  from  possible  explicit 
dependence  of  E  and  Ton  time)  will  be  through  k„(f'),  which  will  be  time-dependent 
if  a  magnetic  field  is  present.  Since  the  Fermi  function  /  depends  on  k  only  through 
£„(k),  which  is  conserved  in  a  magnetic  field,  the  t'  dependence  of  the  integrand  in 
(13.19)  will  be  entirely  contained  in  P(t,  t'),  v(k„(t')),  and  (if  they  are  time-dependent) 
E  and  T. 

3.  Energy-Dependent  Relaxation  Time  If  r  depends  on  wave-vector  only  through 
£„(k),  then  since  £„(k)  is  conserved  in  a  magnetic  field,  r(l')  will  not  depend  on  t',  and 
(13.15)  reduces  to 

P(t ,  t')  =  (13.20) 

There  is  no  compelling  reason  why  t  should  depend  on  k  only  through  £„(k)  in 
anisotropic  systems,  but  when  the  nature  of  the  electronic  scattering  does  depend 
significantly  on  wave  vector,  the  entire  relaxation-time  approximation  is  probably  of 
questionable  validity  (see  Chapter  16).  Therefore  most  calculations  in  the  relaxation¬ 
time  approximation  do  make  this  additional  simplification,  and  often  even  use 
a  constant  (energy-independent)  r.  Since  the  distribution  function  (13.19)  contains 
a  factor  cf  /cZ,  which  is  negligible  except  within  0{kBT )  of  the  Fermi  energy,  only  the 
energy  dependence  of  r(£)  in  the  neighborhood  of  £f  is  significant  in  metals. 

Under  these  conditions  we  can  rewrite  (13.19)  as 
0(k,f)  =  t/°(k)  +  |  jt'e-o-nttrm 

X  V(k(f  ) )  -  r  —  eE(f')  -  Vp(r')  -  ^(f)!.  (13.21) 


9  The  linearization  can  be  justified  directly  from  (13.19)  by  first  observing  that  the  probability  of  an 
electron  suffering  no  collisions  in  an  interval  of  given  length  prior  to  l  becomes  negligibly  small  when  the 
length  of  the  interval  is  appreciably  greater  than  r.  Consequently,  only  limes  t  on  the  order  of  r  contribute 
appreciably  to  the  integral  in  ( 1 3. 19).  However  (see  page  225).  within  such  a  time  an  electric  field  perturbs 
the  electronic  k-vector  by  an  amount  that  is  minute  compared  with  the  dimensions  of  the  zone.  This 
immediately  implies  that  the  E  dependence  of  all  terms  in  (13.19)  is  very  weak.  In  a  similar  way,  one  can 
justify  linearizing  in  the  temperature  gradient  provided  that  the  change  in  temperature  over  a  mean  free 
path  is  a  small  fraction  of  the  prevailing  temperature.  However,  one  cannot  linearize  in  the  magnetic  field, 
since  it  is  entirely  possible  to  produce  magnetic  fields  in  metals  so  strong  that  an  electron  can  move  distances 
in  k-space  comparable  to  the  size  of  the  zone  within  a  relaxation  time. 

10  The  chemical  potential  varies  in  space  only  because  the  temperature  does  (see  footnote  1 )  so  V/i 
is  of  order  VT. 

11  In  general,  one  might  wish  to  allow  x  to  depend  on  position  to  take  into  account,  for  example, 
inhomogeneous  distributions  of  impurities,  special  scattering  effects  associated  with  surfaces,  etc. 
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where  we  have  continued  to  suppress  explicit  reference  to  the  band  index  n,  but  have 
reintroduced  explicitly  the  k  and  t  dependence.12 

We  conclude  this  chapter  with  applications  of  (13.21)  to  several  cases  of  interest. 


DC  ELECTRICAL  CONDUCTIVITY 

If  H  =  0,  the  k(f')  appearing  in  (13.21)  reduces  to  k,  and  the  time  integration  is 
elementary  for  static  E  and  V7T.  If  the  temperature  is  uniform,  we  find: 


ff(k)  =  S°(k)  -  cE  •  v(k)r(£(k) ) 


(13.22) 


Since  the  number  of  electrons  per  unit  volume  in  the  volume  element  rfk  is  r/(k) 
dkjAn 3,  the  current  density  in  a  band  is13 


j 


—  e 


v(k)g. 


(13.23) 


Each  partially  filled  band  makes  such  a  contribution  to  the  current  density;  the  total 
current  density  is  the  sum  of  these  contributions  over  all  bands.  From  (13.22)  and 
(13.23)  it  can  be  written  as  j  =  aE,  where  the  conductivity  tensor  <i  is  a  sum  of  con¬ 
tributions  from  each  band;14 


<r  =  £  <t("),  (13.24) 

n 

=  g2  U£„(k)  )v„(k)v„(k)  ( -  •  (13.25) 

The  following  properties  of  the  conductivity  are  worthy  of  note: 

1.  Anisotropy  In  free  electron  theory  j  is  parallel  to  E,  i.e.  the  tensor  tr  is  diagonal : 
cr/JV  =  (j(5(Jv.  In  a  general  crystal  structure  j  need  not  be  parallel  to  E,  and  the  con¬ 
ductivity  will  be  a  tensor.  In  a  crystal  of  cubic  symmetry,  however,  j  remains  parallel 
to  E,  for  if  the  x-,  y-,  and  z-axes  are  taken  along  the  cubic  axes  then  axx  =  <rys  =  cr„. 
Furthermore,  if  a  field  in  the  x-direction  induced  any  current  in  the  y-direction,  by 
exploiting  the  cubic  symmetry  one  could  equally  well  predict  that  the  same  current 
must  arise  in  the  —  y-direction.  The  only  consistent  possibility  is  zero  current,  so  that 
oxy  must  vanish  (and,  by  symmetry,  so  must  the  other  off-diagonal  components). 
Hence  aMV  =  <rSuv  in  crystals  of  cubic  symmetry. 

2.  Irrelevance  of  Filled  Bands  The  Fermi  function  has  negligible  derivative 
except  when  G  is  within  kBT of  Gf.  Hence  filled  bands  make  no  contribution  to  the 
conductivity,  in  accordance  with  the  general  discussion  on  pages  221-223. 


12  The  quantity  k(f')  is  the  solution  to  the  semiclassical  equation  of  motion  for  band  n  in  a  uniform 
magnetic  field  H,  that  is  equal  to  k  when  t'  =  t. 

13  Here  and  elsewhere  in  this  chapter  integrations  over  k  are  assumed  to  be  over  a  primitive  cell 
unless  otherwise  specified. 

14  Since  no  current  flows  in  equilibrium,  the  leading  term  in  the  distribution  function,  g° ,  makes  no 
contribution  to  (13.23).  We  are  using  a  tensor  notation  in  which  A  =  be  stands  for  AMV  —  h^cv. 
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3.  Equivalence  of  Particle  and  Hole  Pictures  in  Metals  In  a  metal,  to  an  accuracy 
of  order  ( kBT/£,F )2  we  can  evaluate15  (13.25)  at  T  =  0.  Since  (  —  df  /SB)  =  5(6  —  £f), 
the  relaxation  time  can  be  evaluated  at  £f  and  taken  outside  of  the  integral.  Further¬ 
more,  since16 

*»(-&«- (,3!6) 
we  may  integrate  by  parts17  to  find 

=  e2x(EF)  f  ^M-'(k).  (13.27) 

J occupied 
levels 

Since  M  ‘(k)  is  the  derivative  of  a  periodic  function,  its  integral  over  the  entire 
primitive  cell  must  vanish,18  and  we  may  write  (13.27)  in  the  alternative  form 

C  dk 

a  =  e2r(£f)  j-j(-  M_1(k)).  (13.28) 

J  unoccupied 
levels 

Comparing  these  two  forms,  we  find  that  the  contribution  to  the  current  can  be 
regarded  as  coming  from  the  unoccupied  rather  than  the  occupied  levels,  provided 
that  the  sign  of  the  effective  mass  tensor  is  changed.  This  result  was  already  implied 
by  our  discussion  of  holes  (pages  225-229),  but  is  repeated  here  to  emphasize  that 
it  emerges  from  a  more  formal  analysis  as  well. 

4.  Recovery  of  the  Free  Electron  Result  If  M_Ipv  =  (1/m*)  5  „v  independent  of 
k  for  all  occupied  levels  in  the  band,  then  (13.27)  reduces  to  the  Drude  form  (Eq.  (1.6)): 


ne2x 

~m*~' 


(13.29) 


with  an  effective  mass.  If  M  ' ^ 
levels,19  then  (13.28)  reduces  to 


—  (1/m*)  5pv  independent  of  k  for  all  unoccupied 


nhe2x 


(13.30) 


where  nh  is  the  number  of  unoccupied  levels  per  unit  volume;  i.e.,  the  conductivity 
of  the  band  is  of  the  Drude  form,  with  m  replaced  by  the  effective  mass  m*  and  the 
electronic  density  replaced  by  the  density  of  holes. 


15  See  Appendix  C. 

16  We  again  drop  explicit  reference  to  the  band  index.  The  formulas  below  give  the  conductivity  for 
a  solid  with  a  single  band  of  carriers.  If  there  is  more  than  one  band  of  carriers,  one  must  sum  on  n  to 
get  the  full  conductivity. 

17  See  Appendix  I. 

18  This  follows  from  the  identity  (1.1)  of  Appendix  1,  with  one  of  the  periodic  functions  taken  to  be 
unity.  The  mass  tensor  M"  *(k)  is  defined  in  Eq.  (12.29). 

19  Since  the  mass  tensor  is  negative  definite  at  the  band  maximum,  m*  will  be  positive.  For  the  relation 
between  k-space  volumes  and  particle  densities  see  page  35. 
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AC  ELECTRICAL  CONDUCTIVITY 


If  the  electric  field  is  not  static,  but  has  the  time  dependence 


E(t)  =  Re  [E(£o)e~'“'], 

(13.31) 

then  the  derivation  of  the  conductivity  from  (13.21)  proceeds  just  as  in 
except  for  an  additional  factor  e_i“‘  in  the  integrand.  One  finds 

the  DC  case. 

j(t)  =  Re 

(13.32) 

where 

j(<u)  =  <t(m)  •  E(<u),  o(w)  =  £  <J(n)(M), 

(13.33) 

and 

,  ,  [dk  v„(kK(k 

Jtf  [lWc.(k))]-to  ' 

(13.34) 

Thus,  as  in  the  free  electron  case  (Eq.  (1.29)),  the  AC  conductivity  is  just  the  DC 
conductivity  divided  by  1  -  ion,  except  that  we  must  now  allow  for  the  possibility 
that  the  relaxation  time  may  differ  from  band  to  band.20 

The  form  (13.34)  permits  a  simple  direct  test  of  the  validity  of  the  semiclassical 
model  in  the  limit  on  »  1.  where  it  reduces  to 


_£^  ’ 
iw  > 


dk 

4n3 


v„(k)v„(k)(-e//ee), 


E=£.(k)» 


or,  equivalently  (as  derived  in  the  DC  case). 


j£^k» 


i  g2e»(k) 

h2  dk„dk„' 


(13.35) 


(13.36) 


Equation  (13.36)  determines  the  current  induced  to  linear  order  in  the  AC  electric 
field  in  the  absence  of  collisions,  since  the  high  ait  limit  can  be  interpreted  to  mean 
t  -»  oo  for  fixed  w.  In  the  absence  of  collisions,  however,  it  is  an  elementary  quantum- 
mechanical  calculation  to  compute  exactly21  the  change  to  linear  order  in  the  Bloch 
wave  functions  induced  by  the  electric  field.  Given  these  wave  functions,  one  can 
calculate  the  expectation  value  of  the  current  operator  to  linear  order  in  the  field, 
thereby  arriving  at  a  fully  quantum-mechanical  form  for  o(cu)  that  is  not  based  on 
the  approximations  of  the  semiclassical  model.  Such  a  calculation  is  a  straightforward 
exercise  in  first-order  time-dependent  perturbation  theory.  It  is  slightly  too  lengthy 
to  include  here,  and  we  only  quote  the  result:22 


ft4 


m  * 


/ <nk|Y,Jn'k)  <n'k|Vt|«k) 
m2  nhn  V  fiw  +  £„(k)  -  £„(k) 


<nk|Vv|n'k)  <n'k|V„|nk)\ 
-koj  +  £„(k)  -  £„(k)  ) 


(13.37) 


20  Within  each  band  t„(£)  can  be  replaced  by  r JEr)  with  negligible  error  in  metals. 

21  Within  the  independent  electron  approximation. 

22  The  notation  for  the  matrix  elements  of  the  gradient  operator  is  as  in  Appendix  E. 
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In  general,  this  is  quite  different  from  (13.36).  If,  however,  hco  is  small  compared 
with  the  band  gap  for  all  occupied  levels,  then  the  frequencies  in  the  denominators 
of  (13.37)  can  be  ignored,  and  the  quantity  in  brackets  reduces  to  the  expression  for 
<32£„(k )!dk)1  8kv  derived  in  Appendix  E  (Eq.  (E.ll)).  Equation  (13.37)  then  reduces  to 
the  semiclassical  result  (13.36),  confirming  the  assertion  made  in  Chapter  12  that  the 
semiclassical  analysis  should  be  valid  provided  that  hco  «  £gap  (Eq.  (12. 10)). 23 

THERMAL  CONDUCTIVITY 

In  Chapters  1  and  2  we  described  the  thermal  current  density  as  analogous  to  the 
electrical  current  density,  with  thermal  energy  being  carried  rather  than  electric 
charge.  We  can  now  give  a  more  precise  definition  of  the  thermal  current. 

Consider  a  small  fixed  region  of  the  solid  within  which  the  temperature  is  effectively 
constant.  The  rate  at  which  heat  appears  in  the  region  is  just  T  times  the  rate  at  which 
the  entropy  of  the  electrons  within  the  region  changes  ( dQ  =  TdS).  Thus24  the 
thermal  current  density  jq  is  just  the  product  of  the  temperature  with  the  entropy 
current  density,  js: 

j"  =  Tjs.  (13.38) 


Since  the  volume  of  the  region  is  fixed,  changes  in  the  entropy  in  the  region  are 
related  to  changes  in  the  internal  energy  and  number  of  electrons  by  the  thermo¬ 
dynamic  identity: 


TdS  =  dU  -  n  dN, 

(13.39) 

or,  in  terms  of  current  densities. 

rr  =  r  -  Mn 

(13.40) 

where  the  energy  and  number  current  densities  are  given  by25 

(13.41) 

23  As  long  as  hoj  is  small  enough  that  no  denominators  in  (13.37)  vanish,  the  more  general  result 
simply  provides  quantitative  corrections  to  the  semiclassical  approximation,  which  could,  for  example, 
be  cast  in  the  form  of  a  power  series  in  ha j/£gap  -  However,  when  ho  becomes  large  enough  for  denominators 
to  vanish  (i.e.,  when  the  photon  energy  is  large  enough  to  cause  interband  transitions),  then  the  semiclassical 
result  fails  qualitatively  as  well,  for  the  detailed  derivation  of  (13.37)  includes  the  stipulation  that  when 
the  denominator  is  singular,  the  result  is  to  be  interpreted  in  the  limit  as  co  approaches  the  real  axis  of 
the  complex  frequency  plane  from  above.  (When  no  denominator  vanishes,  the  result  is  independent  of 
any  infinitesimal  imaginary  part  co  may  have.)  This  introduces  a  real  part  into  the  conductivity,  providing 
a  mechanism  for  absorption  in  the  absence  of  collisions,  which  the  semiclassical  model  is  incapable  of 
producing.  This  additional  real  part  is  of  critical  importance  in  understanding  the  properties  of  metals 
at  optical  frequencies  (see  Chapter  15),  in  which  interband  transitions  play  a  crucial  role. 

24  This  assumes  that  the  entropy  within  the  region  changes  only  because  electrons  carry  it  in  or  out. 
Entropy  can  also  be  generated  within  the  region  through  collisions.  However,  such  entropy  production 
can  be  shown  to  be  a  second-order  effect  in  the  applied  temperature  gradient  and  electric  field  (Joule 
healing — the  “ I2R  loss” — being  the  most  familiar  example)  and  can  therefore  be  ignored  in  a  linear  theory. 

25  Note  that  these  have  the  same  form  as  the  electrical  current  density  except  that  the  quantity  carried 
by  each  electron  is  no  longer  its  charge  (  —  e)  but  its  energy  (6„(k))  *or  its  number  (unity).  Note  also  that 
the  number  current  is  just  the  electric  current  divided  by  the  charge:  j  =  —ej".  (Do  not  confuse  the 
superscript  n,  indicating  that  i  is  the  number  current  density,  with  the  band  index  n.) 
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Combining  (13.40)  and  (13.41)  we  find  a  thermal  current  density26 

f  dk 

iq  =  I  J  ^3  K(k)  -  l‘]vn(k)sI,(k).  (13.42) 

The  distribution  function  appearing  in  (13.42)  is  given  by  (13.21),  evaluated  at  H  =  0 
in  the  presence  of  a  uniform  static  electric  field  and  temperature  gradient27: 


where 


0(k)  =  g°(  k)  +  t(S(k)) 


v(k)  • 


-e&  + 


m  -  v 

T 


(  — VT) 


\>[i 

E  +  — . 


e 


(13.43) 

(13.44) 


We  can  construct  the  electrical  current  density  (13.23)  and  the  thermal  current  density 
(13.42)  from  this  distribution  function: 

j  =  LllE  +  L12(-VT), 

'?  =  L21S  +  L22(-VT),  (13.45) 

where  the  matrices  L'J  are  defined28  in  terms  of 

£<”  =  T(£(k)  )v(k)v(k)(E(k)  -  nf  (13.46) 

by 

L11  =  £<0> 

L21  =  TL12  =  -  -£(1>, 
e 

L22  =  ^£(2).  (13.47) 

The  structure  of  these  results  is  simplified  by  defining29 

f  dk 

«r(E)  =  e2r(C)  <5(E  -  E(k)  )v(k)v(k),  (13.48) 

in  terms  of  which 

£<0>  =  j' ^  ( ' ~  ®  (C  "  MM€)'  (13’49) 


26  Since  thermal  conductivities  are  normally  measured  under  conditions  in  which  no  electric  current 
flows,  it  is  often  enough  to  identify  the  thermal  current  with  the  energy  current  (as  we  did  in  Chapter  1). 
When,  however,  heat  and  electric  charge  are  carried  simultaneously  (as  in  the  Peltier  effect,  described 
below),  it  is  essential  to  use  ( 1 3.42). 

27  See  the  discussion  on  pages  23  and  24  for  why,  in  general,  a  temperature  gradient  will  be 
accompanied  by  an  electric  field. 

28  To  keep  the  notation  as  simple  as  possible  the  results  that  follow  are  given  for  the  case  in  which 
all  carriers  lie  in  a  single  band,  and  the  band  index  is  suppressed.  In  the  many-band  case,  each  L  must 
be  replaced  by  the  sum  of  the  L’s  for  all  the  partially  filled  bands.  This  generalization  does  not  affect  the 
validity  of  the  Wiedemann-Franz  law,  but  it  can  complicate  the  structure  of  the  thermopower. 

29  Since  (  —  cfff€)  =  6(8  —  EF)  to  an  accuracy  of  order  (kBT/8F)2  in  metals,  the  notation  is  meant 
to  remind  one  that  the  DC  conductivity  of  a  metal  (13.25)  is  essentially  <r(6f). 
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To  evaluate  (13.49)  for  metals  we  can  exploit  the  fact  that  (-df/cZ)  is  negligible 
except  within  0(kBT)  of  n  «  £F.  Since  the  integrands  in  £U)  and  £,2)  have  factors 
that  vanish  when  £  =  fi,  to  evaluate  them  one  must  retain  the  first  temperature 
correction  in  the  Sommerfeld  expansion.30  When  this  is  done,  one  finds  with  an 
accuracy  of  order  (kBT/ZF)2 


where 


Lu  =  a(£F)  =  <r, 

(13.50) 

L2i  =  TL12  =  -jW)V, 

3e 

(13.51) 

(13.52) 

3  e2 

S 

a' =--«(£)  . 

t/o  e=e. 

(13.53) 

Equations  (13.45)  and  (13.50)  to  (13.53)  are  the  basic  results  of  the  theory  of 
electronic  contributions  to  the  thermoelectric  effects.  They  remain  valid  when  more 
than  one  band  is  partially  occupied,  provided  only  that  we  interpret  cr./C)  to  be  the 
sum  of  (13.48)  over  all  partially  occupied  bands. 

To  deduce  the  thermal  conductivity  from  these  results,  we  note  that  it  relates  the 
thermal  current  to  the  temperature  gradient  under  conditions  in  which  no  electric 
current  flows  (as  discussed  in  Chapter  1).  The  first  of  Eqs.  (13.45)  determines  that  if 


zero  current  flows,  then 

£  =  -  (Lu)-'L,2(-VT). 


(13.54) 


Substituting  this  into  the  second  of  Eqs.  (13.45),  we  find  that 

f  =  K(-VT),  (13.55) 


where  K,  the  thermal  conductivity  tensor,  is  given  by 

K  =  L22  -  L2'(Lllr1L12.  (13.56) 

It  follows  from  Eqs.  (13.50)  to  (13.52)  and  the  fact  that  o'  is  typically  of  order 
<t/£f,  that  in  metals  the  first  term  in  (13.56)  exceeds  the  second  by  a  factor  of  order 
(£F/kBT)2.  Thus 

K  =  L22  +  0lkBT/ZF)2.  (13.57) 

This  is  what  one  would  have  found  if  one  had  ignored  the  thermoelectric  field  from 
the  start.  We  emphasize  that  its  validity  requires  degenerate  Fermi  statistics.  In 
semiconductors  (13.57)  is  not  a  good  approximation  to  the  correct  result  (13.56). 

If  Eq.  (13.57)  is  evaluated  using  (13.52),  we  find  that 

To.  (13.58) 

This  is  nothing  but  the  Wiedemann-Franz  law  (see  Eq.  (2.93))  with  a  vastly  more 
general  range  of  validity.  For  arbitrary  band  structure,  component  by  component, 
the  thermal  conductivity  tensor  is  proportional  to  the  electrical  conductivity  tensor 


30 


See  appendix  C  or  Eq.  (2.70),  page  46. 
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with  the  universal  constant  of  proportionality  n2kB2T/3e2.  Thus  this  remarkable 
experimental  observation,  made  over  a  century  ago,  continues  to  reemerge  from 
successively  more  refined  theoretical  models  in  an  essentially  unaltered  form. 

While  rejoicing  that  the  semiclassical  model  preserves  this  elegant  result,  one 
must  not  forget  that  deviations  from  the  Wiedemann-Franz  law  are  observed.31  We 
shall  see  in  Chapter  16  that  this  is  a  failure  not  of  the  semiclassical  method,  but  of 
the  relaxation-time  approximation. 


THE  THERMOELECTRIC  POWER 

When  a  temperature  gradient  is  maintained  in  a  metal  and  no  electric  current  is 
allowed  to  flow,  there  will  be  a  steady-state  electrostatic  potential  difference  between 
the  high-  and  low-temperature  regions  of  the  specimen.32  Measuring  this  potential 
drop  is  not  completely  straightforward  for  several  reasons: 

1.  To  measure  voltages  accurately  enough  to  detect  a  thermoelectric  voltage,  it  is 
essential  that  the  voltmeter  connect  points  of  the  specimen  at  the  same  tempera¬ 
ture.  Otherwise,  since  the  leads  to  the  meter  are  in  thermal  equilibrium  with  the 
specimen  at  the  contact  points,  there  would  be  a  temperature  gradient  within  the 
circuitry  of  the  meter  itself,  accompanied  by  an  additional  thermoelectric  voltage 
Since  no  thermoelectric  voltage  develops  between  points  of  a  single  metal  at  the 
same  temperature,  one  must  use  a  circuit  of  two  different  metals  (Figure  13.1), 
connected  so  that  one  junction  is  at  a  temperature  7,  and  the  other  (bridged  only 
by  the  voltmeter)  at  a  temperature  70  #  7,.  Such  a  measurement  yields  the 
difference  in  the  thermoelectric  voltages  developed  in  the  two  metals. 

Figure  13.1 

Circuit  for  measuring  the  differ¬ 
ence  in  thermoelectric  voltages 
developed  in  two  different  metals, 
in  each  of  which  the  temperature 
varies  from  T0  to  T\. 

2.  To  measure  the  absolute  thermoelectric  voltage  in  a  metal,  one  can  exploit  the 
fact  that  no  thermoelectric  voltage  develops  across  a  superconducting  metal.33 
Hence  when  one  of  the  metals  in  the  bimetallic  circuit  is  superconducting,  the 
measurement  yields  directly  the  thermoelectric  voltage  across  the  other.34 

3.  The  points  in  the  circuit  joined  by  the  voltmeter  have  different  electrostatic 
potentials  and  different  chemical  potentials.35  If,  as  in  most  such  devices,  the 


31  See  Chapter  3. 

32  The  Seebeck  effect.  A  crude  but  elementary  discussion  of  the  underlying  physics  is  given  on  pages 
23-25. 

33  See  page  730. 

3*  This  makes  it  possible  to  measure  the  absolute  thermoelectric  voltage  in  a  metal  at  temperatures 
up  to  20  K  (currently  the  highest  temperature  at  which  superconductivity  has  been  observed).  One  can 
deduce  it  at  higher  temperatures  from  measurements  of  the  Thomson  effect  (Problem  5). 

3*  Although  electrons  will  flow  from  one  metal  to  the  other  so  as  to  equalize  the  chemical  potentials 
at  the  point  of  contact  (see  Chapter  18),  there  is  still  a  chemical  potential  difference  at  the  points  joined 
by  the  voltmeter,  because  the  temperature  variation  of  the  chemical  potential  differ  the  two  metals. 
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voltmeter  reading  is  actually  IR,  where  I  is  the  small  current  flowing  through  a 
large  resistance  R,  then  it  is  essential  to  realize  that  the  current  is  driven  not  just 
by  the  electric  field  E,  but  by  £  =  E  +  (l/ejV^i.  This  is  because  the  chemical 
potential  gradient  leads  to  a  diffusion  current,  in  addition  to  the  current  driven 
mechanically  by  the  electric  field.36  As  a  result,  the  voltmeter  reading  will  not 
be  —  JE  •  dt,  but  — JE  •  dt. 

The  thermoelectric  power  (or  thermopower)  of  a  metal,  Q,  is  defined  as  the  pro¬ 
portionality  constant  between  the  contribution  of  the  metal  to  the  reading  of  such  a 
voltmeter,  and  the  temperature  change: 

-  j£  •  dt  =  Q  AT  (13.59) 

or 

E  =Q\T.  (13.60) 

Since  negligible  current  flows  when  the  thermoelectric  voltage  is  measured,  Eq.  ( 1 3.45) 
gives37 

L12 

Q  =  l Tf>  (13.61) 


or,  from  Eqs.  (13.50)  and  (13.51), 


Q  _  1?  kB2T  a' 

3  e  a 

(13.62) 

This  has  a  considerably  more  complex  structure  than  the  free  electron  estimate 

(2.94),  which  is  independent38  of  the  relaxation  time  r.  We  can  cast  a' 
useful  form,  by  differentiating  (13.48): 

into  a  more 

<r(e)  =  +  ezr(E)  |0<5'(e  -  E(k)Mk)v(k). 

(13.63) 

Since 

v(k)<5'(£  -  E(k))  =  -  ~d(e  -  E(k) ), 

(13.64) 

an  integration  by  parts  gives39 

a'  =  —  a  +  Jdk  5(Ef  —  E(k))M_1(k). 

(13.65) 

36  That  it  is  this  particular  combination  of  field  and  chemical  potential  gradient  that  drives  the  electric 
current  follows  from  Eq.  (13.45).  This  phenomenon  is  often  summarized  in  the  assertion  that  a  voltmeter 
measures  not  the  electric  potential  but  the  “electrochemical  potential”. 

37  For  simplicity  we  limit  the  discussion  to  cubic  metals,  for  which  the  tensors  L,J  are  diagonal. 

38  If  we  take  r  to  be  independent  of  energy,  then  in  the  free  electron  limit  cr'/cr  =  (3/2£f),  and 
(13.62)  reduces  to  Q  =  —  (7T2/ 2e){kB2T /  £f ) .  This  is  a  factor  of  3  larger  than  the  rough  estimate  (2.94). 
The  disparity  is  due  to  the  very  crude  way  in  which  thermal  averages  of  energies  and  velocities  were 
treated  in  Chapters  I  and  2.  It  indicates  that  it  is  largely  a  lucky  accident  that  the  analogous  derivation  of 
the  thermal  conductivity  gave  the  correct  numerical  factor. 

39  Although  it  is  tempting  to  try  to  interpret  <7'(8j,)  as  the  variation  of  the  physically  measured  DC 
conductivity  with  some  suitably  controlled  parameters,  this  cannot  be  justified.  The  quantity  a'(S/0  means 
(within  the  relaxation-ti*  oproximation)  no  more  (or  less)  than  (13.65). 
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If  the  energy  dependence  of  the  relaxation  time  is  unimportant,  then  the  sign  of 
the  thermopower  is  determined  by  the  sign  of  the  effective  mass,  averaged  over  the 
Fermi  surface,  i.e.,  by  whether  the  carriers  are  electrons  or  holes.  This  is  consistent 
with  the  general  theory  ofholes  described  in  Chapter  12,  and  also  provides  a  possible 
explanation  for  another  of  the  anomalies  of  free  electron  theory.40 

However,  the  thermopower  is  not  a  very  valuable  probe  of  the  fundamental 
electronic  properties  of  a  metal;  the  energy  dependence  of  t  is  not  well  understood, 
the  validity  of  the  form  (13.65)  depends  on  the  relaxation-time  approximation,  and, 
most  important,  vibrations  of  the  lattice  can  affect  the  transport  of  thermal  energy  in 
a  way  that  makes  it  very  difficult  to  achieve  an  accurate  theory  of  the  thermopower. 


OTHER  THERMOELECTRIC  EFFECTS 


There  is  a  variety  of  other  thermoelectric  effects.  The  Thomson  effect  is  described  in 
Problem  5,  and  we  mention  here  only  the  Peltier  effect.41  If  an  electric  current  is 
driven  in  a  bimetallic  circuit  that  is  maintained  at  a  uniform  temperature,  then  heat 
will  be  evolved  at  one  junction  and  absorbed  at  the  other  (Figure  1 3.2).  This  is  because 
an  isothermal  electric  current  in  a  metal  is  accompanied  by  a  thermal  current. 


(13.66) 

Figure  13.2 

The  Peltier  effect.  A  current)  is 
driven  in  a  bimetallic  circuit  at 
j,  uniform  temperature  T0.  To 
maintain  the  uniform  tempera¬ 
ture  it  is  necessary  to  supply 
heat  (via  a  thermal  current  jq  at 
one  junction,  and  extract  it  at 
the  other). 


where  n  is  known  as  the  Peltier  coefficient.  Because  the  electric  current  is  uniform  in 
the  closed  circuit  and  the  Peltier  coefficient  differs  from  metal  to  metal,  the  thermal 
currents  in  the  two  metals  will  not  be  equal,  and  the  difference  must  be  evolved  at 
one  junction  and  supplied  to  the  other  if  the  uniform  temperature  is  to  be  maintained. 

If  we  set  the  temperature  gradient  in  (13.45)  equal  to  zero,  then  we  find  that  the 
Peltier  coefficient  is  given  by 

n  =  (13.67) 


Because  of  the  identity  (13.51).  the  Peltier  coefficient  is  simply  related  to  the  thermo- 


power  (13.61)  by 


n  =  TQ, 


(13.68) 


a  relation  first  deduced  by  Lord  Kelvin. 


40  See  Chapter  3. 

41  When  a  magnetic  field,  as  well  as  a  temperature  gradient,  is  present,  the  number  of  possible 
measurements  is  further  multiplied.  The  various  thermomagnetic  effects  (Nernst,  Ettingshausen,  Righi- 
Leduc)  are  compactly  summarized  in  H.  B.  Callen,  Thermodynamics,  Wiley,  New  York  (1960),  Chapter  17. 
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SEMICLASSICAL  CONDUCTIVITY  IN  A  UNIFORM  MAGNETIC 
FIELD 

The  DC  electrical  conductivity  at  uniform  temperature  in  a  uniform  magnetic  field 
H  can  be  cast  in  a  form  quite  similar  to  the  H  =  0  result  (13.25).  In  a  magnetic  field 
v(k(t'))  does  depend  on  £',  and  the  integral  occurring  in  the  nonequilibrium  distribu¬ 
tion  function  (13.21)  can  no  longer  be  explicitly  evaluated  in  the  general  case.  Instead, 
the  zero-field  result  (13.25)  must  be  replaced  by 

a<n>  =  T"(£"(k))v”(k)^(k)  (-  I)6=£,k;  (13-69) 

where  v„(k)  is  a  weighted  average  of  the  velocity  over  the  past  history  of  the  electron 
orbit42  passing  through  k: 


V„(k)  =  f  e'/t"(k,v„(k1)(t ) ).  (13.70) 

J-co  Tn(K) 

In  the  low-field  limit  the  orbit  is  traversed  very  slowly,  only  points  in  the  immediate 
vicinity  of  k  contribute  appreciably  to  the  average  in  (13.70),  and  the  zero  field  result 
is  recovered.  In  the  general  case,  and  even  in  the  high  field  limit,  one  must  resort  to 
some  rather  elaborate  analysis,  even  to  extract  the  information  we  arrived  at  in 
Chapter  12  from  a  direct  examination  of  the  semiclassical  equations  of  motion.  We 
shall  not  pursue  such  computations  further  here,  but  some  of  the  applications  of 
(13.70)  are  illustrated  in  Problem  6. 


PROBLEMS 


1.  On  page  250  we  argued  that  in  a  metal  with  cubic  symmetry  the  conductivity  tensor  is  a 
constant  times  the  unit  matrix,  i.e.,  that  j  is  always  parallel  to  E.  Construct  an  analogous  argu¬ 
ment  for  a  hexagonal  close-packed  metal,  showing  that  the  conductivity  tensor  is  diagonal  in  a 
rectangular  coordinate  system  with  z  taken  along  the  c-axis,  with  axx  =  <ryy,  so  that  the  current 
induced  by  a  field  parallel  or  perpendicular  to  the  c-axis  is  parallel  to  the  field. 

2.  Deduce  from  (13.25)  that  at  T  =  0  (and  hence  to  an  excellent  approximation  at  any  T  «  7» 
the  conductivity  of  a  band  with  cubic  symmetry  is  given  by 

°  =  T5n*h  (13.71) 


42  Here  k„(f)  is  the  solution  to  the  semiclassical  equations  of  motion  (12.6)  in  a  uniform  magnetic 
field  that  passes  through  the  point  k  at  time  zero  (k„(0)  =  k).  (We  have  used  the  fact  that  the  distribution 
function  is  independent  of  time  when  the  fields  are  time-independent,  and  written  the  integral  in  (13.21) 
in  the  form  it  has  at  t  =  0.) 
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where  S  is  the  area  of  Fermi  surface  in  the  band,  and  v  is  the  electronic  speed  averaged  over  the 
Fermi  surface: 


HI 


dS  |v(k)|. 


(13.72) 


(Note  that  this  contains,  as  a  special  case,  the  fact  that  filled  or  empty  bands  (neither  of  which 
have  any  Fermi  surface)  carry  no  current.  It  also  provides  an  alternative  way  of  viewing  the  fact 
that  almost  empty  (few  electrons)  and  almost  filled  (few  holes)  bands  have  low  conductivity, 
since  they  will  have  very  small  amounts  of  Fermi  surface.) 

Verify  that  (13.71)  reduces  to  the  Drude  result  in  the  free  electron  limit. 

3.  Show  that  the  equations  describing  the  electric  and  thermal  currents,  (13.45)  and  (13.50) 
to  (13.53),  continue  to  hold  in  the  presence  of  a  uniform  magnetic  field,  provided  that  Eq.  (13.48) 
for  o(E)  is  generalized  to  include  the  effects  of  the  magnetic  field  by  replacing  the  second  v(k) 
by  v(k),  defined  in  Eq.  (13.70). 

4.  The  response  of  the  conduction  electrons  to  an  electric  field 

E(r,  £ )  =  Re  [E(q,  toy"1 '  (13.73) 

which  depends  on  position  as  well  as  time,  requires  some  special  consideration.  Such  a  field  will 
in  general  induce  a  spatially  varying  charge  density 

p( r,  f)  =  -e<5n(r,  £), 

<5n(r,  t)  =  Re  [<5n(q,  Oj)e,|q  r  ~M)].  (13.74) 

Since  electrons  are  conserved  in  collisions,  the  local  equilibrium  distribution  appearing  in  the 
relaxation-time  approximation  (13.3)  must  correspond  to  a  density  equal  to  the  actual  instanta¬ 
neous  local  density  n(r,  £).  Thus  even  at  uniform  temperature  one  must  allow  for  a  local  chemical 
potential  of  the  form 

p(t,  t)  =  p  +  5p(r,  £), 

5p(i,  t)  =  Re  [<5/r(q,  o)e*l'r-"],  (13.75) 

where  <5p(q,  to)  is  chosen  to  satisfy  (to  linear  order  in  E)  the  condition 


<5n(q,  w)  =  —I,—  &p(q,  co). 

8p 


(13.76) 


(a) 


Show,  as  a  result,  that  at  uniform  temperature  Eq.  (13.22)  must  be  replaced  by43 

9(1,  k,  f)  =  /(£( k))  +  Re  [<5t/(q,  k,  oje*" r— *], 

»)-(■-  f)  (13.77) 

V  v8J  0/t)  -  i[w  -  q  •  v(k)] 

(b)  By  constructing  the  induced  current  and  charge  densities  from  the  distribution  function 
(13.77).  show  that  the  choice  (13.75)  of  <5/t(q,  co)  is  precisely  what  is  required  to  insure  that  the 
equation  of  continuity  (local  charge  conservation) 

dp 
8l 

is  satisfied. 


q  •  j(q,  co)  =  Ojp(q.  to) 


(13.78) 


43  See  fooinote  7. 
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(c)  Show  that  if  no  charge  density  is  induced,  then  the  current  is 
j(r,  t)  =  Re  [c(q,  to)  •  E(q, 

^ ">  -  *■  }*?  (- 1)  m= ,  •  .no]  <13TO) 

Show  that  a  sufficient  condition  for  (13.79)  to  be  valid  is  that  the  electric  field  E  be  perpendicular 
to  a  plane  of  mirror  symmetry  in  which  the  wave  vector  q  lies. 


5.  Consider  a  metal  in  which  thermal  and  electric  currents  flow  simultaneously.  The  rate  at 
which  heat  is  generated  in  a  unit  volume  is  related  to  the  local  energy  and  number  densities  by 
(cf.  (13.39)): 


dq  du  dii 

di=  di~  ^  di ’ 


(13.80) 


where  p  is  the  local  chemical  potential.  Using  the  equation  of  continuity, 


dll 

di 


-v-r. 


(13.81) 


and  the  fact  that  the  rate  of  change  of  the  local  energy  density  is  determined  by  the  rate  at  which 
electrons  carry  energy  into  the  volume  plus  the  rate  at  which  the  electric  field  does  work, 


du 

dt 


—  V'j‘+  Ej, 


show  that  (13.80)  can  be  written  in  the  form 


dq 

dt 


=  -v-r  H'j 


(13.82) 


(13.83) 


where  j?  is  the  thermal  current  (given  by  (13.38)  and  (13.40)),  and  E  =  E  +  (l/e)Vp.  Assuming 
cubic  symmetry,  so  that  the  tensors  L';  are  diagonal,  show  that  under  conditions  of  uniform 
current  flow  (V  ■  j  =  0)  and  uniform  temperature  gradient  (V27  =  0)  that 

^  =  pj2  +  —  (V 7  )2  -  l  -p-  (VT)  •  j  (13.84) 

dt  h  I  di '  dTx  '  J 

where  p  is  the  resistivity,  K  is  the  thermal  conductivity,  and  Q  is  the  thermopower.  By  measuring 
the  change  in  bulk  heating  as  the  current  direction  is  reversed  for  fixed  temperature  gradient 
(known  as  the  Thomson  effect)  one  can  therefore  determine  the  temperature  derivative  of  the 
thermopower,  and  thereby  compute  the  value  of  Q  at  high  temperatures,  given  its  low-temperature 
value. 

Compare  the  numerical  coefficient  of  VT-j  with  that  of  the  crude  estimate  in  Problem  3, 
Chapter  1. 


6.  The  average  velocity  v  (Eq.  (13.70))  appearing  in  the  expression  (13.69)  for  the  conductivity 
in  a  uniform  magnetic  field  takes  on  a  fairly  simple  form  in  the  high-field  limit. 

(a)  Show  that  for  a  closed  orbit,  the  projection  of  V  in  a  plane  perpendicular  to  H  is 


he 

eHx 


H  x 


[k-  <k>],  +  O 


V 


(13.85) 
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where  <k>  is  the  time  average  of  the  wave  vector  over  the  orbit: 


(13.86) 


(b)  Show  that  for  an  open  orbit  the  high-field  limit  of  v  is  just  the  average  velocity  of  motion 
along  the  orbit  (and  hence  parallel  to  the  direction  of  the  orbit). 

(c)  Show44  that  in  the  high-field  limit,  when  E  •  H  =0. 


ji  = 


(13.87) 


where  w  =  c(E  x  H )/H1  is  the  drift  velocity  defined  in  (12.46).  Deduce  the  forms  (12.51)  or 
(12.52)  from  (13.87),  depending  on  whether  the  band  is  particle-  or  hole-like.  (Note:  Because  k 
is  not  a  periodic  function  in  k-space,  one  cannot  automatically  integrate  by  parts  in  (13.87).) 

(d)  Deduce,  from  the  result  of  (b),  the  limiting  form  (12.56)  for  the  conductivity  in  the  presence 
of  open  orbits.  (Hint:  Observe  that  v  is  independent  of  the  component  of  k  parallel  to  the  k-space 
direction  of  the  open  orbit.) 

(e)  Show  from  the  general  form  of  the  semiclassical  equation  of  motion  in  a  magnetic  field 
(12.6)  that  the  conductivity  tensor  (13.69)  for  a  given  band  in  a  uniform  magnetic  field  has  the 
functional  dependence  on  H  and  t  of  the  form : 


a  =  tF(Ht). 


(13.88) 


Deduce  from  (13.88)  that  when  the  current  is  carried  by  electrons  in  a  single  band  (or  if  the 
relaxation  time  is  the  same  for  all  bands),  then 


pj,H)  -  pxM 

Pxx(  0) 


(13.89) 


depends  on  H  and  t  only  through  the  product  Hz  (Kohler’s  rule),  for  any  diagonal  component  of 
the  resistivity  perpendicular  to  H. 

(f)  Deduce  from  the  properties  of  the  semiclassical  equations  of  motion  in  a  magnetic  field 
that 

ojfl)  =  (13.90) 

This  is  known  as  an  Onsager  relation.45  (Hint:  Make  the  change  of  variables  k(r)  =  k',  and 
appeal  to  Liouville’s  theorem  in  replacing  the  k-space  integrals  in  (13.69)  by  integrals  over  k'.) 


44  Argue  that  the  term  in  <k>  in  (13.85)  makes  no  contribution  because  it  depends  only  on  8  and  fc,. 

45  Such  relationships  between  transport  coefficients  were  first  formulated  in  very  great  generality  by 
L.  Onsager.  The  first  equality  in  Eq.  (13.51)  is  another  example  of  an  Onsager  relation. 
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There  is  a  class  of  measurable  quantities  valued  primarily  because  they  contain 
detailed  information  about  the  geometric  structure  of  the  Fermi  surface.  Such  quan¬ 
tities  depend  only  on  universal  constants  (e,  h,  c,  or  m),  experimentally  controlled 
variables  (such  as  temperature,  frequency,  magnetic  field  strength,  crystal  orientation), 
and  information  about  the  electronic  band  structure  that  is  entirely  determined  by 
the  shape  of  the  Fermi  surface. 

We  have  already  encountered  one  such  quantity,  the  high-field  Hall  constant, 
which  (in  uncompensated  metals  with  no  open  orbits  for  the  given  field  direction) 
is  entirely  determined  by  the  k-space  volume  enclosed  by  the  hole-like  and  particle-like 
branches  of  the  Fermi  surface. 

Quantities  yielding  such  Fermi  surface  information  have  a  place  of  special  im¬ 
portance  in  metals  physics.  Their  measurement  almost  always  requires  single  crystals 
of  very  pure  substances  at  very  low  temperatures  (to  eliminate  dependence  on  the 
relaxation  time)  and  is  frequently  performed  in  very  strong  magnetic  fields  (to  force 
the  electrons  to  sample  the  Fermi  surface  geometry  in  the  course  of  their  semiclassical 
motion  in  k-space). 

The  importance  of  determining  the  Fermi  surface  of  metals  is  clear:  The  shape 
of  the  Fermi  surface  is  intimately  involved  in  the  transport  coefficients  of  a  metal 
(as  discussed  in  Chapters  1 2  and  1 3)  as  well  as  in  the  equilibrium  and  optical  properties 
(as  will  be  illustrated  in  Chapter  15).  An  experimentally  measured  Fermi  surface 
provides  a  target  at  which  a  first-principles  band  structure  calculation  can  aim.  It 
can  also  be  used  to  provide  data  for  fitting  parameters  in  a  phenomenological  crystal 
potential,  which  can  then  be  used  to  calculate  other  phenomena.  If  nothing  else, 
Fermi  surface  measurements  are  of  interest  as  a  further  test  of  the  validity  of  the 
one-electron  semiclassical  theory,  since  there  are  now  many  independent  ways  of 
extracting  Fermi  surface  information. 

Of  the  techniques  used  to  deduce  the  geometry  of  the  Fermi  surface,  one  has  proved 
far  and  away  the  most  powerful :  the  de  Haas-van  Alphen  effect  (and  a  group  of  closely 
related  effects  based  on  the  same  underlying  physical  mechanism).  This  phenomenon 
is  almost  entirely  responsible  for  the  vast  and  growing  body  of  precise  knowledge 
of  the  Fermi  surfaces  of  a  great  number  of  metals.  No  other  technique  approaches 
it  in  power  or  simplicity.  For  this  reason  the  bulk  of  this  chapter  is  devoted  to  an 
exposition  of  the  de  Haas-van  Alphen  effect.  We  shall  conclude  with  brief  discussions 
of  a  selection  of  other  effects  which  have  been  used  to  provide  supplementary 
geometrical  information. 


THE  de  HAAS-van  ALPHEN  EFFECT 

In  Figure  14.1  are  shown  the  results  of  a  famous  experiment  by  de  Haas  and  van 
Alphen  in  1930.  They  measured  the  magnetization  M  of  a  sample  of  bismuth  as  a 
function  of  magnetic  field  in  high  fields  at  14.2  K,  and  found  oscillations  in  M/H. 

On  the  face  of  it  this  curious  phenomenon,  observed  only  at  low  temperatures  and 
high  fields,  would  not  strike  one  as  the  extraordinary  key  to  the  electron  structure  of 
metals  it  has  turned  out  to  be.  The  full  extent  of  its  usefulness  was  only  pointed  out 
in  1952,  by  Onsager.  Since  the  original  experiment,  and  especially  since  around  1960, 
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-M/H  (X  10*) 


The  data  of  de  Haas  and  van  Alphen.  Magnetization  per  gram 
divided  by  field,  plotted  vs.  field  for  two  orientations  of  a  crystal  of 
bismuth,  at  14.2  K.  (W.  J.  de  Haas  and  P.  M.  van  Alphen,  Leiden 
Comm.  208d,  212a  (1930),  and  220d  (1932).) 

careful  observations  have  been  made  in  many  metals  of  this  same  oscillatory  field 
dependence  in  the  magnetic  susceptibility,1  x  =  dM/dH. 

The  oscillations  display  a  remarkable  regularity,  if  the  susceptibility  is  plotted  not 
against  field,  but  against  inverse  field.  It  then  becomes  clear  that  x  has  a  periodic 
dependence  on  l/H,  though  frequently  two  or  more  periods  are  superposed.  Some 
typical  data  are  shown  in  Figure  14.2. 

Similar  oscillatory  behavior  has  been  observed  not  only  in  the  susceptibility,  but 
also  in  the  conductivity  (Shubnikov-de  Haas  effect),  the  magnetostriction  (depen¬ 
dence  of  sample  size  on  magnetic  field  strength),  and,  when  measured  with  sufficient 
care,  in  almost  all  other  quantities.  Minute  oscillations  of  this  kind  have  even  been 
observed  in  the  high-field  Hall  “constant,”  a  clear  indication  that  the  effect  must  lie 
in  a  failure  of  the  semiclassical  model.  A  variety  of  such  effects  are  displayed  in 
Figure  14.3. 

The  refinement  of  the  de  Haas-van  Alphen  effect  into  a  powerful  probe  of  the  Fermi 
surface  has  been  due  largely  to  D.  Shoenberg,  whose  history  of  the  phenomenon2 
provides  delightful  and  instructive  reading.  Two  major  techniques  have  been  widely 
exploited  to  measure  the  oscillations.  One,  based  on  the  fact  that  in  a  field  a  magne¬ 
tized  sample  experiences  a  torque  proportional  to  its  magnetic  moment,3  simply 
measures  the  oscillations  in  angular  position  of  a  sample  of  the  metal,  attached  to  a 


1  When  the  magnetization  varies  linearly  with  the  field  one  need  not  distinguish  between  M/H  and 
dM/dH .  Here,  however,  (and  in  the  treatment  of  critical  phenomena  in  Chapter  33)  nonlinear  effects  are 
crucial.  It  is  now  generally  agreed  that  in  such  cases  the  susceptibility  should  be  defined  as  dM/dH. 

2  Proc .  9th  Internat.  Conf.  on  Low  Temperature  Physics ,  Daunt,  Edwards,  Milford,  and  Yaqub,  ed. 
Plenum  Press,  New  York,  1965,  p.  665. 

3  The  torque  only  exists  when  the  magnetization  is  not  parallel  to  the  field.  Because  the  effect  is 
nonlinear,  this  is  generally  the  case  except  when  the  field  is  in  certain  symmetry  directions. 
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Figure  14.2 

De  Haas-van  Alphen  oscillations  in  (a)  rhenium  and  (b)  silver.  (Courtesy  of  A.  S.  Joseph.) 

filar  suspension,  as  the  magnetic  field  strength,  and  hence  the  magnetization  M(H) 
varies.  The  second  technique,  especially  valuable  if  high  fields  are  required,  measures 
the  voltage  induced  in  a  pickup  coil  surrounding  the  sample  when  a  burst4  of  field 
is  applied.  Since  this  will  be  proportional  to  dM/dt  =  (dM/dH)(dH/dt),  one  can 
measure  the  oscillations  in  the  susceptibility  as  a  function  of  field. 

Even  before  the  key  to  the  theory  of  the  de  Haas-van  Alphen  effect  for  Bloch 
electrons  was  pointed  out  by  Onsager,  Landau5  was  able  to  account  for  the  oscil¬ 
lations  in  free  electron  theory,  as  a  direct  consequence  of  the  quantization  of  closed 


4  This  “burst”  of  field  is,  of  course,  slowly  varying  on  the  scale  of  metallic  relaxation  times,  so  that 
the  magnetization  stays  in  equilibrium  with  the  instantaneous  value  of  the  field. 

5  L.  D.  Landau,  Z.  Phys.  64,  629  (1930).  Note  the  date.  Landau  predicted  the  oscillations  without 
knowing  of  de  Hass  and  van  Alphen’s  experiment,  but  thought  that  a  magnetic  field  uniform  enough  to 
observe  them  could  not  be  achieved  (see  Problem  3). 


dTldH  (arbitrary  units) 
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(c) 

Figure  14.3  (a),  (b),  (c) 

The  ubiquity  of  the  oscillations,  of  which  the  de  Haas  van  Alphen 
effect  is  the  most  celebrated  example,  (a)  Sound  attenuation  in  tungsten. 
(C.  K.  Jones  and  J.  A.  Rayne.)  (b)  dT/dH  vs.  field  in  antimony.  (B.  D. 
McCombe  and  G.  Seidel.)  (c)  Magnetoresistance  of  gallium  vs.  field 
at  1.3  K. 
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Figure  14.3  (d),  (e),  (f) 

(d)  Oscillations  accompanying  the  Peltier  effect  in  zinc,  (e)  Thermoelectric  voltage  of  bismuth 
at  1.6  K.  (f)  Thermal  conductivity  of  bismuth  at  1.6  K.  (Sources:  (a),  (b),  (c) — Proc.  9th 
Internal.  Conf.  on  Low  Temperature  Physics,  J.  G.  Daunt  et  al.,  eds.,  Plenum  Press,  New  York, 
1965.  (d)— H.  J.  Trodahl  and  F.  J.  Blatt,  Phys.  Rev.  180,  709  (1969).  (e),  (f)— M.  C.  Steele 
and  J.  Babiskin,  Phys.  Rev.  98,  359  (1955).) 


electronic  orbits  in  a  magnetic  field,  and  thus  as  a  direct  observational  manifestation 
of  a  purely  quantum  phenomenon.  The  phenomenon  became  of  even  greater  interest 
and  importance  when  Onsager6  pointed  out  that  the  change  in  1  /H  through  a  single 
period  of  oscillation,  A(1  /H),  was  determined  by  the  remarkably  simple  relation: 


27ie  1 

he  Ae 


(14.1) 


where  Ae  is  any  extremal  cross-sectional  area  of  the  Fermi  surface  in  a  plane  normal 
to  the  magnetic  field. 


6  L.  Onsager,  Phil.  Mag.  43,  1006  (1952). 
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Some  extremal  areas  are  illustrated  in  Figure  14.4.  If  the  z-axis  is  taken  along  the 
magnetic  field,  then  the  area  of  a  Fermi  surface  cross  section  at  height  kz  is  A(kz), 
and  the  extremal  areas  Ae  are  the  values  of  A(kz)  at  the  k.  where  dA/dkz  =  0.  (Thus 
maximum  and  minimum  cross  sections  are  among  the  extremal  ones.) 


Figure  14.4 

Illustration  of  various  extremal  orbits.  For  H 
along  the  k,-axis,  (1)  and  (2)  are  maximum 
extremal  orbits  and  (3)  is  a  minimum  extremal 
orbit.  When  the  field  is  along  the  fc2-axis,  only  one 
extremal  orbit,  (4),  is  present. 


Since  altering  the  magnetic  field  direction  brings  different  extremal  areas  into  play, 
all  extremal  areas  of  the  Fermi  surface  can  be  mapped  out.  This  frequently  provides 
enough  information  to  reconstruct  the  actual  shape  of  the  Fermi  surface.  In  practice 
this  may  be  a  complex  task,  for  if  more  than  one  extremal  orbit  is  present  in  certain 
directions,  or  if  more  than  one  band  is  partially  filled,  several  periods  will  be  super¬ 
imposed.  Rather  than  directly  disentangling  the  geometrical  information  from  the 
data,  it  is  often  easier  to  guess  at  what  the  surface  is  (using,  for  example,  an  approxi¬ 
mate  calculation  of  the  band  structure),  later  refining  the  guess  by  testing  it  against 
the  data. 

The  argument  justifying  (14.1)  is  simple  but  strikingly  bold.  The  explanation  cannot 
be  classical,  for  a  theorem  of  Bohr  and  van  Leeuwen  (see  Chapter  31)  asserts  that 
no  properties  of  a  classical  system  in  thermal  equilibrium  can  depend  in  any  way  on 
the  magnetic  field.  This  powerful  result  applies  to  semiclassical  systems  (in  the  sense 
of  Chapters  12  and  13)  as  well,  so  the  de  Haas  van  Alphen  effect  is  a  definite  failure 
of  the  semiclassical  model.  The  failure  arises  whenever  the  semiclassical  theory  pre¬ 
dicts  closed  orbits  for  the  electronic  motion  projected  on  a  plane  perpendicular  to 
the  field.  When  this  happens  (as  it  generally  does),  the  energies  of  motion  perpendicular 
to  H  are  quantized.  To  find  these  energy  levels  one  must  in  principle  return  to  the 
Schrodinger  equation  for  an  electron  in  the  periodic  crystalline  potential  in  the 
presence  of  the  magnetic  field.  The  full  solution  of  this  problem  is  a  formidable  task, 
which  has  been  accomplished  only  in  the  simple  case  of  free  electrons  (i.e.,  zero 


270  Chapter  14  Measuring  the  Fermi  Surface 


periodic  potential)  m  a  magnetic  field.  We  describe  the  results  in  the  free  electron  case 
below,  referring  the  reader  to  one  of  the  standard  texts  for  their  derivation.7  We  shall 
not  use  the  free  electron  results  except  to  illustrate  and  test  the  validity  of  Onsager’s 
far  more  general,  but  somewhat  less  rigorous,  theory  of  the  magnetic  levels  in  a 
periodic  potential. 


FREE  ELECTRONS  IN  A  UNIFORM  MAGNETIC  FIELD 


The  orbital8  energy  levels  of  an  electron  in  a  cubical  box  with  sides  of  length  Lparallel 
to  the  x-,  y-,  and  z-axes  are  determined  in  the  presence  of  a  uniform  magnetic  field 
H  along  the  z-direction  by  two  quantum  numbers,  v  and  kz : 


(14.2) 


The  quantum  number  v  runs  through  all  nonnegative  integers,  and  kz  takes  on  the 
same  values  as  in  the  absence  of  a  magnetic  field  (Eq.  (2.16)): 
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2nnz 

~L' 


(14.3) 


for  any  integral  n2.  Each  level  is  highly  degenerate.  The  number  of  levels  with  energy 
(14.2)  for  a  given  v  and  kz  is  (including  the  factor  of  2  for  spin  degeneracy): 

HL2.  (14.4) 

he 

Since 

^  =  2.068  x  10“ 7  G-cm2,  (14.5) 

2e 

in  a  field  of  a  kilogauss  (a  typical  field  for  a  de  Haas-van  Alphen  experiment)  and  a 
sample  1  cm  on  a  side,  this  degeneracy  will  be  about  1010.  The  degeneracy  reflects 
the  fact  that  a  classical  electron  with  a  given  energy  and  kz  spirals  about  a  line  parallel 
to  the  z-axis,  which  can  have  arbitrary  x-  and  y-coordinates.9 

Equation  (14.2)  is  quite  plausible:  Since  there  is  no  component  of  Lorentz  force 
along  H,  the  energy  of  motion  in  the  z-direction  is  unaffected  by  the  field,  and  con¬ 
tinues  to  be  given  by  h2kz2/2m.  However,  the  energy  of  motion  perpendicular  to  the 

field,  which  would  be  h2(kx2  +  k2)/2m  if  no  field  were  present,  is  quantized  in  steps 

of  hu)c — Planck’s  constant  times  the  frequency  of  the  classical  motion  (page  14).  This 


7  L.  D.  Landau  and  E.  M.  Lifshitz,  Quantum  Mechanics,  (2nd  ed.)  Addison- Wesley,  Reading,  Mass., 
1965,  pp.  424-426,  or  R.  E.  Peierls,  Quantum  Theory,  of  Solids.  Oxford,  New  York,  1955,  pp.  146-147. 
Peierls  gives  a  better  discussion  of  the  rather  subtle  spatial  boundary  condition.  The  energy  levels  are 
found  by  reducing  the  problem,  by  a  simple  transformation,  to  that  of  a  one-dimensional  harmonic 
oscillator. 

8  Equation  (14.2)  does  not  include  the  interaction  energy  between  the  field  and  the  electron  spin.  We 
consider  the  consequences  of  this  additional  term  below,  but  for  the  moment  we  ignore  it. 

9  This  is  why  the  degeneracy  (14.4)  is  proportional  to  the  cross-sectional  area  of  the  specimen. 
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phenomenon  is  called  orbit  quantization.  The  set  of  all  levels^  with  a  given  v  (and 
arbitrary  kz)  is  referred  to  collectively  as  the  vtli  Landau  level10. 

From  this  information  a  theory  of  the  de  Haas-van  Alphen  effect  can  be  con¬ 
structed  for  the  free  electron  model.  Rather  than  reproduce  that  analysis11  we  turn 
to  a  slightly  modified  version  of  Onsager’s  simple,  but  subtle,  argument,  which 
generalizes  the  free  electron  results  to  Bloch  electrons  and  bears  directly  on  the 
problem  of  Fermi  surface  determination. 


LEVELS  OF  BLOCH  ELECTRONS  IN  A  UNIFORM  MAGNETIC 
FIELD 


Onsager’s  generalization  of  Landau’s  free  electron  results  is  only  valid  for  magnetic 
levels  with  fairly  high  quantum  numbers.  However,  we  shall  find  that  the  de  Haas-van 
Alphen  effect  is  due  to  levels  at  the  Fermi  energy  which  almost  always  do  have  very 
high  quantum  numbers.  In  free  electron  theory,  for  example,  unless  almost  all  the 
electronic  energy  is  in  motion  parallel  to  the  field,  a  level  of  energy  £,.  must  have  a 
quantum  number  v  whose  order  of  magnitude  is  SF/hajc  =  SF/[(eh/mc)H~\.  Now 


—  =  A  x  10~8  eV/G  =  1.16  x  10  8  eV/G. 
me  m 


(14.6) 


Since  SF  is  typically  several  electron  volts,  even  in  fields  as  high  as  104  G,  the  quantum 
number  v  will  be  of  order  104. 

Energies  of  levels  with  very  high  quantum  numbers  can  be  accurately  calculated 
with  Bohr’s  correspondence  principle,  which  asserts  that  the  difference  in  energy  of 
two  adjacent  levels  is  Planck’s  constant  times  the  frequency  of  classical  motion  at 
the  energy  of  the  levels.  Since  kz  is  a  constant  of  the  semiclassical  motion,  we  apply 
this  condition  to  levels  with  a  specified  kz,  and  quantum  numbers  v  and  v  +  1. 

Let  £v(kz)  be  the  energy  of  the  vth  allowed  level12  at  the  given  kz.  The  correspon¬ 
dence  principle  then  gives 

£v+1(fcz)  -  UK)  =  T{Zv(kz),  kzy  (14'7) 


10  It  must  be  added  that  the  above  results  are  only  valid  when  the  radius  of  classical  circular  motion 
of  an  electron  with  energy  €  and  momentum  hkz  is  not  comparable  with  the  cross-sectional  dimensions 
of  the  box.  For  an  electron  with  energy  £f  and  kz  =  0  the  condition  is  most  stringent. 
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At  to3  gauss,  hc/eH  =  KT10  cm2.  Since  kf  is  typically  about  10s  cm  the  results  are  applicable  to 
samples  with  dimensions  on  the  order  of  centimeters,  but  fail  when  the  sample  is  still  as  large  as  0.1  mm. 

11  It  can  be  found  in  the  book  by  Peierls  cited  in  footnote  7. 

12  Throughout  the  discussion  that  follows  we  consider  a  single  band,  and  drop  explicit  reference  to 
the  band  index.  This  is  done  primarily  to  avoid  confusion  between  the  band  index  n  and  the  magnetic 
quantum  number  v.  Throughout  this  chapter  Ev(kJ  is  the  vth  allowed  energy  of  an  electron  ,n  the  given 
band,  with  wave  vector  kz.  Should  it  be  necessary  to  deal  with  more  than  one  band,  we  would  use  the 
notation  £n  v(kz). 
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where  T(£,  kz)  is  the  period  of  semiclassical  motion  on  the  orbit  specified  by  £  and 
K  (Eq.  (12.42)): 


h2c  c/1  (£,  kz) 
=  ~AH  fi£  ’ 


(14.8) 


and  /1(£,  k2)  is  the  k- space  area  enclosed  by  the  orbit.  Combining  (14.8)  and  (14.7), 
we  may  write  (suppressing  explicit  reference  to  the  variable  kz) 

(ev  +  t  -  £v)  ^  ^  =  2J~-  (! 4-9) 


Because  we  are  interested  in  £v  of  the  order  of  we  can  greatly  simplify  (14.9). 
On  the  basis  of  the  free  electron  results  we  expect  that  the  energy  difference  between 
neighboring  Landau  levels  will  be  of  order  hcoc,  which  is  at  least  10-4  times  smaller 
than  the  energies  of  the  levels  themselves.  It  is  therefore  an  excellent  approximation 
to  take: 


d_ 

di 


Aev)  = 


^(£v+.i) 


£y  +  1 


>*(£>) 

£v 


(14.10) 


Placing  this  in  (14.9)  we  find 


^(£v+1)  -  i4(£v)  = 


2neH 

he 


(14.11) 


which  states  that  classical  orbits  at  adjacent  allowed  energies  (and  the  same  kz ) 
enclose  areas  that  differ  by  the  fixed  amount  A/1,  where 


A  A 


2neH 

he 


(14.12) 


Another  way  of  stating  this  conclusion  is  that,  at  large  v,  the  area  enclosed  by  the 
semiclassical  orbit  at  an  allowed  energy  and  k.  must  depend  on  v  according  to: 


/1(EW(C),  k.)  =  (v  +  A)  A  A, 


(14.13) 


where  A  is  independent13  of  v.  This  is  Onsager’s  famous  result  (which  he  derived  by 
an  alternate  route,  using  the  Bohr-Sommerfeld  quantization  condition). 


ORIGIN  OF  THE  OSCILLATORY  PHENOMENA 

Underlying  the  de  Haas-van  Alphen  and  related  oscillations  is  a  sharp  oscillatory 
structure  in  the  electronic  density  of  levels  imposed  by  the  quantization  condition 


13  We  shall  follow  the  usual  practice  of  assuming  that  A  is  also  independent  of  kz  and  H.  This  is 
verified  in  Problem  la  for  free  electrons,  and  holds  for  any  ellipsoidal  band.  Although  it  has  not  been 
proved  in  general,  the  reader  is  invited  to  show,  as  an  exercise,  that  the  conclusions  reached  below  under 
the  assumption  of  a  constant  /  are  altered  only  if  /.  is  an  exceedingly  rapidly  varying  function  of  either 
kz  or  H.  This  is  most  unlikely. 
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(14.13).  The  level  density  will  have  a  sharp  peak14  whenever  £  is  equal  to  the  energy 
of  an  extremal  orbit15  satisfying  the  quantization  condition.  The  reason  for  this  is 
shown  in  Figure  14.5.  Figure  14.5a  depicts  the  set  of  all  orbits  satisfying  (14.13)  for 
a  given  v.  These  form  a  tubular  structure  (of  cross-sectional  area  (v  +  X)  A/4)  in 
k- space.  The  contribution  to  g(Z)  dz  from  the  Landau  levels  associated  with  orbits 
on  the  vth  such  tube  will  be  the  number  of  such  levels  with  energies  between  £  and 
£  +  dz.  This,  in  turn,  is  proportional  to  the  area16  of  the  portion  of  tube  contained 
between  the  constant-energy  surfaces  of  energies  £  and  £  +  dz.  Figure  14.5b  shows 
this  portion  of  tube  when  the  orbits  of  energy  £  on  the  tube  are  not  extremal,  and 
Figure  14.5c  shows  the  portion  of  tube  when  there  is  an  extremal  orbit  of  energy  £  on 
the  tube.  Evidently  the  area  of  the  portion  of  tube  is  enormously  enhanced  in  the 
latter  case,  as  a  result  of  the  very  slow  energy  variation  of  levels  along  the  tube  near 
the  given  orbit. 

Most  electronic  properties  of  metals  depend  on  the  density  of  levels  at  the  Fermi 
energy,  g(ZF).  It  follows  directly  from  the  above  argument17  that  g(ZF)  will  be  singular 


(a)  (b)  (c) 

Figure  14.5 

(a)  A  Landau  tube.  Its  cross  sections  by  planes  perpendicular  to  H  have  the  same  area — 
(v  +  X)  A .4  for  the  vth  tube — and  are  bounded  by  curves  of  constant  energy  Ev(k_)  at  height 
k,.  (b)  The  portion  of  the  tube  containing  orbits  in  the  energy  range  from  6  to  E  +  JE  when 
none  of  the  orbits  in  that  range  occupy  extremal  positions  on  their  constant-energy  surfaces, 
(c)  Same  construction  as  in  (b),  except  that  E  is  now  the  energy  of  an  extremal  orbit.  Note 
the  great  enhancement  in  the  range  of  L.  for  which  the  tube  is  contained  between  the  constant- 
energy  surfaces  at  E  and  £  +  dZ. 


14  In  fact,  a  more  detailed  analysis  shows  that  the  level  density  becomes  singular  as  (6  —  E0)-1'2, 
when  E  is  near  the  energy  6C  of  an  extremal  orbit  satisfying  the  quantization  condition. 

ls  An  extremal  orbit  of  energy  £  is  one  that  encloses  an  extremal  cross-sectional  area  of  the  surface 
£(k)  =  £. 

16  The  density  of  levels  contained  in  the  tube  is  uniform  along  the  field  direction,  allowed  values  of 
k.  being  given  by  (14.3). 

17  Strictly  speaking,  the  chemical  potential  (which  is  equal  to  ZF  at  zero  temperature)  also  depends 
on  magnetic  field  strength,  complicating  the  argument,  but  this  is  a  very  small  effect  and  can  normally  re 
ignored 
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whenever  the  value  of  the  magnetic  field  causes  an  extremal  orbit  on  the  Fermi 
surface  to  satisfy  the  quantization  condition  (14.13),  i.e.,  whenever 


(v  +  x)  A  A  =  /4r(EF). 


(14.14) 


Using  the  value  (14.12)  for  AA,  it  follows  that  g(CF)  will  be  singular  at  regularly 
spaced  intervals  in  1  H  given  by 
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(14.15) 


Thus  oscillatory  behavior  as  a  function  of  1  H  with  period  (14.15)  should  appear 
in  any  quantity  that  depends  on  the  level  density  at  EF,  which,  at  zero  temperature, 
includes  almost  all  characteristic  metallic  properties. 

At  nonzero  temperatures  typical  metallic  properties  are  determined  by  averages 
over  a  range  of  energies  within  kBTo f  SF.  If  this  range  is  so  broad  that  for  any  value 
of  H  extremal  orbits  satisfying  (14.1 3)  contribute  appreciably  to  the  average,  then  the 
oscillatory  structure  in  l/H  will  be  washed  out.  This  will  happen  when  kHT  is  greater 
than  the  typical  energy  separation  between  adjoining  tubes  of  Landau  levels.  We 
estimate  this  energy  separation  by  its  free  electron  value.  hoic  (Eq.  (14.2)).  Since 


=  1.34  x  1(T4  K/G.  (14.16) 

luck  g 

one  must  use  fields  of  the  order  of 104  G  and  temperatures  as  low  as  a  few  degrees 
Kelvin  to  avoid  the  thermal  obliteration  of  the  oscillations. 

Electron  scattering  can  cause  similar  problems.  A  detailed  treatment  of  how  this 
affects  the  oscillations  is  difficult,  but  for  a  rough  estimate  we  need  only  note  that  if 
the  electronic  relaxation  time  is  r,  then  its  energy  can  be  defined  only  to  within 
As  —  h/ t.  If  A8  is  larger  than  the  spacing  between  peaks  in  i/(8),  the  oscillatory 
structure  will  be  appreciably  diminished.  In  the  free  electron  case  this  spacing  is 
hwc,  which  leads  to  the  condition  that  ioct  be  comparable  to  or  larger  than  unity  for 
oscillations  to  be  observed.  This  is  the  same  high-field  condition  that  arises  in  the 
semiclassical  theory  of  electronic  transport  (Chapters  1 2  and  1 3). 


THE  EFFECT  OF  ELECTRON  SPIN  ON  THE  OSCILLATORY 
PHENOMENA 


Ignoring  the  effects  of  spin-orbit  coupling,18  the  major  complication  introduced  by 
electron  spin  is  that  the  energy  of  each  level  will  be  increased  or  decreased  by  an 
amount 


gehH 

4mc 


(14.17) 


according  to  whether  the  spin  is  along  or  opposite  to  the  field.  The  number  g  (not  to 
be  confused  with  the  level  density  g( 8)  )  is  the  “electron  ^-factor,”  which  is  very  nearly 
equal  to  2.  If  we  denote  the  level  density  calculated  by  ignoring  this  additional  energy 


18 


Which  are  small  in  the  lighter  elements.  See  page  169. 
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by  0O(£).  then  the  result  of  these  shifts  is  that  the  true  level  density  g( E)  is  given  by 


1 

S(£)  =  j  So 


gehH' 

4hic 


(14.18) 


Note  that  the  shift  in  the  peaks  is  comparable  to  the  separation  between  peaks  (as 
estimated  by  the  free  electron  value  ehH/mc).  Cases  have  actually  been  observed  in 
which  for  appropriate  field  directions  this  shift  causes  the  oscillations  in  the  two 
terms  of  (14.18)  to  fall  180“  out  of  phase,  leading  to  no  net  oscillation. 


OTHER  FERMI  SURFACE  PROBES 

A  variety  of  other  experiments  are  used  to  probe  the  Fermi  surface.  In  general,  the 
information  available  from  other  techniques  is  geometrically  less  straightforward 
than  the  extremal  areas  supplied  by  the  de  Haas-van  Alphen  effect  and  related 
oscillations.  Furthermore,  this  information  is  often  more  difficult  to  extract  un¬ 
ambiguously  from  the  data.  We  therefore  confine  ourselves  to  a  brief  survey  of 
selected  methods. 


The  Magnetoacoustic  Effect 

Fairly  direct  information  about  Fermi  surface  geometry  can  sometimes  be  extracted 
by  measuring  the  attenuation  of  sound  w  aves  in  a  metal  as  they  propagate  perpendic¬ 
ular  to  a  uniform  magnetic  field,19  particularly  if  the  wave  is  carried  by  displacements 
of  the  ions  that  are  both  perpendicular  to  its  direction  of  propagation,  and  to  the 
magnetic  field  (Figure  14.6).  Since  the  ions  are  electrically  charged,  such  a  wave  is 
accompanied  by  an  electric  field  of  the  same  frequency,  wave  vector,  and  polarization. 
The  electrons  in  the  metal  can  interact  with  the  sound  wave  through  this  electric 
field,  thereby  supporting  or  hindering  its  propagation. 

If  conditions  permit  the  electrons  to  complete  many  orbits  in  the  magnetic  field 
between  collisions,20  then  the  sound  attenuation  can  depend  on  wavelength  in  a 


Figure  14.6 

The  instantaneous  displacement  from  equi¬ 
librium  of  the  ions  in  a  sound  wave  suitable 
for  the  magnetoacoustic  effect.  Only  one 
row  of  ions  is  shown. 
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X  =  equilib¬ 
rium  position 

’•*  =  instan¬ 
taneous 
position 


19  A  detailed  theorv  of  this  phenomenon  in  the  case  of  free  electrons  has  been  given  by  M.  H.  Cohen 
et  al,  Phys.  Rev.  I 17,  937  (1960). 

20  This  requires  o\x  »  I ;  i.e.,  the  specimen  must  be  a  single  crystal  of  high  purity  at  low  temperatures 
in  a  strong  field. 
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manner  reflecting  the  Fermi  surface  geometry.  This  is  because21  the  electrons  follow 
real  space  orbits  whose  projections  in  planes  perpendicular  to  the  field  are  simply 
cross  sections  of  constant  energy  surfaces,  scaled  by  the  factor  hc/eH  (and  rotated 
through  90°).  When  the  wavelength  of  the  sound  is  comparable  to  the  dimensions 
of  an  electron’s  orbit22  the  extent  to  which  the  electric  field  of  the  wave  perturbs  the 
electron  depends  on  how  the  wavelength  /  matches  the  maximum  linear  dimension 
lc  of  the  orbit  along  the  direction  of  wave  propagation  (referred  to  in  this  context  as 
the  orbit’s  “diameter”).  For  example,  electrons  on  orbits  with  diameters  equal  to  half 
a  wavelength  (Figure  14.7a)  can  be  accelerated  (or  decelerated)  by  the  wave  throughout 
their  entire  orbit,  while  electrons  with  orbit  diameters  equal  to  a  whole  wavelength 
(Figure  14.7b)  must  always  be  accelerated  on  parts  of  their  orbit  and  decelerated  on 
other  parts. 

Figure  14.7 

(a)  An  electron  orbit  with  a  diameter 
Ic  equal  to  half  a  wavelength,  posi¬ 
tioned  so  as  to  be  accelerated  by  the 
electric  field  accompanying  thesound 
wave  at  all  points  of  its  orbit,  (b)  An 
electron  orbit  with  a  diameter  equal 
to  a  whole  wavelength.  No  matter 
where  the  orbit  is  positioned  along 
the  direction  q,  the  kind  of  coherent 
acceleration  (or  deceleration)  over  the 
entire  orbit  possible  in  case  (a)  cannot 
occur. 


(b) 

More  generally,  an  electron  will  be  weakly  coupled  to  the  wave  when  its  orbit 
diameter  is  a  whole  number  of  wavelengths,  but  can  be  strongly  coupled  when  the 
orbit  diameter  differs  from  a  whole  number  of  wavelengths  by  half  a  wavelength: 

lc  =  nl  (weakly  coupled), 

fc  =  (n  +  j)l  (strongly  coupled).  (14.19) 

The  only  electrons  that  can  affect  the  sound  attenuation  are  those  near  the  Fermi 
surface,  since  the  exclusion  principle  forbids  electrons  with  lower  energies  from 
exchanging  small  amounts  of  energy  with  the  wave.  The  Fermi  surface  has  a  con- 


21  See  pages  229  and  230. 

22  A  typical  orbit  diameter  is  of  order  ly/co r  Since  the  angular  frequency  of  the  sound  is  of  order 
vjl  when  /  ^  lc  we  have  cu  %  coc{rJvF).  Typical  sound  velocities  are  about  1  percent  of  the  Fermi  velocity, 
so  electrons  can  complete  many  orbits  during  a  single  period  of  the  waves  of  interest.  In  particular,  during 
a  single  revolution  of  an  electron,  the  electric  field  perturbing  it  can  be  regarded  as  static. 
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tinuous  range  of  diameters,  but  the  electrons  on  orbits  with  diameters  near  the 
extremal  diameters  play  a  dominant  role,  since  there  are  many  more  of  them.23 

As  a  result  the  sound  attenuation  can  display  a  periodic  variation  with  inverse 
wavelength,  in  which  the  period  (cf.  Eq.  (14.19))  is  equal  to  the  inverse  of  the  extremal 
diameters  of  the  Fermi  surface  along  the  direction  of  sound  propagation: 


By  varying  the  direction  of  propagation  (to  bring  different  extremal  diameters  into 
play)  and  by  varying  the  direction  of  the  magnetic  field  (to  bring  different  Fermi 
surface  cross  sections  into  play),  one  can  sometimes  deduce  the  shape  of  the  Fermi 
surface  from  this  structure  in  the  sound  attenuation. 

Ultrasonic  Attenuation 

Information  about  the  Fermi  surface  can  also  be  extracted  from  measurements  of 
sound  attenuation  when  no  magnetic  field  is  present.  One  no  longer  examines  a 
resonant  effect,  but  simply  calculates  the  rate  of  attenuation  assuming  that  it  is 
entirely  due  to  energy  being  lost  to  the  electrons.  It  can  be  shown  that  if  this  is  the 
case,24  then  the  attenuation  will  be  entirely  determined  by  Fermi  surface  geometry. 
However,  the  geometrical  information  extracted  in  this  way  is,  under  the  best  of 
circumstances,  nowhere  near  as  simple  as  either  the  extremal  areas  furnished  by  the 
de  Haas-van  Alphen  effect  or  the  extremal  diameters  one  can  deduce  from  the 
magnetoacoustic  effect. 

Anomalous  Skin  Effect 

One  of  the  earliest  Fermi  surface  determinations  (in  copper)  was  made  by  Pippard25 
from  measurements  of  the  reflection  and  absorption  of  microwave  electromagnetic 
radiation  (in  the  absence  of  a  static  magnetic  field).  If  the  frequency  w  is  not  too  high, 
such  a  field  will  penetrate  into  the  metal  a  distance  30  (the  “classical  skin  depth”) 
given  by26 

<5o  =  -  u  •  (14.21) 

The  derivation  of  (14.21)  assumes  that  the  field  in  the  metal  varies  little  over  a  mean 
free  path:  S0  »  L  When  60  is  comparable  to  f  a  much  more  complicated  theory  is 
required,  and  when  d0  «  (  (the  “extreme  anomalous  regime”)  the  simple  picture  of 
an  exponentially  decaying  field  over  a  distance  d0  breaks  down  completely.  However, 
in  the  extreme  anomalous  case  it  can  be  shown  that  the  field  penetration  and  the 


23  This  is  quite  analogous  to  the  role  played  by  cross  sections  of  extremal  area  in  the  theory  of  the 
de  Haas-van  Alphen  effect. 

24  In  general,  an  unwarranted  assumption.  There  are  other  mechanisms  for  sound  attenuation.  See, 
for  example.  Chapter  25. 

25  A.  B.  Pippard,  Phil.  Trans.  Roy.  Soc .  A250, 325  (1957). 

26  See,  for  example,  J.  D.  Jackson,  Classical  Electrodynamics,  Wiley,  New  York,  1962,  p.  225. 
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microwave  reflectivity  are  now  determined  entirely  by  certain  features  of  the  Fermi 
surface  geometry  that  depend  only  on  the  orientation  of  the  Fermi  surface  with 
respect  to  the  actual  surface  of  the  sample. 


Cyclotron  Resonance 

This  technique  also  exploits  the  attenuation  of  a  microwave  field  as  it  penetrates  a 
metal.  Strictly  speaking,  the  method  does  not  measure  Fermi  surface  geometry,  but 
the  “cyclotron  mass”  (12.44),  determined  by  cM/c1 8.  This  is  done  by  observing  the 
frequency  at  which  an  electric  field  resonates  with  the  electronic  motion  in  a  uniform 
magnetic  field.  High  ldct  is  required  for  the  electrons  to  undergo  periodic  motion, 
and  the  resonance  condition  a)  =  aic  is  satisfied  at  microwave  frequencies. 

Since  the  field  does  not  penetrate  far  into  the  metal,  electrons  can  absorb  energy 
only  when  they  are  within  a  skin  depth  of  the  surface.27  At  microwave  frequencies 
and  large  co£  one  is  in  the  extreme  anomalous  regime,  where  the  skin  depth  is  quite 
small  compared  to  the  mean  free  path.  Because  the  dimensions  of  the  electron’s  real 
space  orbit  at  the  Fermi  surface  are  comparable  to  the  mean  free  path,  the  skin  depth 
will  also  be  small  compared  with  the  size  of  the  orbit. 

These  considerations  led  Azbel'  and  Kaner28  to  suggest  placing  the  magnetic  field 
parallel  to  the  surface,  leading  to  the  geometry  shown  in  Figure  14.8.  If  the  electron 


I«o 


Figure  14.8 

Parallel-field  Azbel’-Kaner  geometry. 


experiences  an  electric  field  of  the  same  phase  each  time  it  enters  the  skin  depth,  then 
it  can  resonantly  absorb  energy  from  the  field.  This  will  be  the  case  if  the  applied 
field  has  completed  an  integral  number  of  periods,  TE,  each  time  the  electron  returns 
to  the  surface: 

T  =  nTE,  (14.22) 

where  T  is  the  period  of  cyclotron  motion  and  n  is  an  integer.  Since  frequencies 
are  inversely  proportional  to  periods  we  can  write  (14.22)  as 

co  =  nojc.  (14.23) 

Usually  one  works  at  fixed  frequency  co,  and  varies  the  strength  of  the  magnetic  field 
H,  writing  the  resonant  condition  as 

1  2ne  1 

l“U4) 

Thus  if  the  absorption  is  plotted  vs.  1  /H,  resonant  peaks  due  to  a  given  cyclotron 
period  will  be  uniformly  spaced. 


27  In  semiconductors  the  electron  density  is  very  much  lower,  a  microwave  field  can  penetrate  much 
further,  and  the  technique  of  cyclotron  resonance  is  much  more  straightforward.  (See  Chapter  28.) 

28  M.  I.  Azbel'  and  E.  A.  Kaner,  Sou.  Phys.  JETP  3, 772  (1956). 
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Analysis  of  the  data  is  complicated  by  the  question  of  which  orbits  are  providing 
the  major  contributions  to  the  resonance.  In  the  case  of  an  ellipsoidal  Fermi  surface 
it  can  be  shown  that  the  cyclotron  frequency  depends  only  on  the  direction  of  the 
magnetic  field,  independent  of  the  height.  kz,  of  the  orbit.  The  method  is  therefore 
quite  unambiguous  in  this  case.  However,  when  a  continuum  of  periods  is  present 
for  a  given  field  direction,  as  happens  whenever  T(£f ,  k,)  depends  on  kz,  some  care 
must  be  exercised  in  interpreting  the  data.  As  usual,  only  orbits  at  the  Fermi  surface 
need  be  considered,  for  the  exclusion  principle  prohibits  electrons  in  lower-lying 
orbits  from  absorbing  energy.  A  quantitative  calculation  indicates  that  the  orbits  at 
which  the  cyclotron  period  T(£f,  kj  has  its  extremal  value  with  respect  to  kz  are 
very  likely  to  determine  the  resonant  frequencies.  However,  the  detailed  frequency 
dependence  of  the  energy  loss  can  have  quite  a  complicated  structure,  and  one  must 
be  wary  of  the  possibility  that  one  may  not  always  be  measuring  extremal  values  ol 
T(£f,  k  ),  but  some  rather  complicated  average  of  T  over  the  Fermi  surface.  The 
situation  is  nowhere  near  as  clearcut  as  it  is  in  the  de  Haas-van  Alphen  effect. 

Some  typical  cyclotron  resonance  data  are  shown  in  Figure  14.9.  Note  that  several 
extremal  periods  are  involved.  The  uniform  spacing  in  l/H  of  all  the  peaks  produced 
by  a  single  period  is  of  great  help  in  sorting  out  the  rather  complex  structure. 

Figure  14.9 

Typical  cyclotron  resonance 
peaks  in  aluminum  at  two 
different  field  orientations. 

Peaks  in  the  field  derivative  |  „ 
of  the  absorbed  power  due  >  f 
to  four  distinct  extremal  cy-  -3  >, 
clotron  masses  can  be  iden-  .2  g 
tified.  (Peaks  due  to  the  same  g 1 
extremal  mass  are  spaced 
uniformly  in  l/H,  as  can  be 
verified  by  careful  examina¬ 
tion  of  the  figure.)  (T.  W. 

Moore  and  F.  W.  Spong, 

Phys.  Rev.  125, 846  (1962).) 


H(kG) 
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Size  Effects 

Another  class  of  Fermi  surface  probes  works  with  very  thin  specimens  with  parallel 
plane  surfaces,  looking  for  resonant  effects  produced  by  those  electronic  orbits  that 
just  fit  between  the  two  surfaces.  The  most  straightforward  of  these  is  the  parallel-field 
Gantmakher  effect,29  in  which  a  thin  plate  of  metal  is  placed  in  a  magnetic  field 
parallel  to  its  surface,  and  exposed  to  microwave  radiation  polarized  perpendicular 
to  the  field  (Figure  14.10). 


Incident  radiation 


Direction  of  E 


Figure  14.10 

The  parallel-field  Gantmakher  effect.  When  the  thickness  of  the 
slab  coincides  with  an  extremal  orbit  diameter  (or  is  an  integral 
multiple  of  an  extremal  orbit  diameter),  there  will  be  resonant 
transmission  through  the  slab.  The  field  can  penetrate  the  slab  only 
to  within  the  skin  depth  (shaded  region  on  top),  and  only  electrons 
within  a  skin  depth  can  reradiate  energy  out  of  the  metal  (shaded 
region  on  the  bottom). 

Suppose  that  the  slab  is  thick  compared  with  the  skin  depth,  but  not  compared 
with  the  mean  free  path  (which  requires  us  to  be  in  the  extreme  anomalous  regime). 
Then  an  electric  field  can  influence  the  electrons  only  when  they  are  within  a  skin 
depth  of  the  surface,  and,  conversely,  electrons  can  radiate  energy  back  out  of  the 
metal  only  when  they  are  within  a  skin  depth  of  the  surface. 

Now  consider  those  electrons  whose  orbits  in  the  magnetic  field  carry  them  from 
within  a  skin  depth  of  the  top  of  the  slab  to  within  a  skin  depth  of  the  bottom.  It  can 
be  shown  that  electrons  in  such  orbits  can  reproduce,  on  the  far  side  of  the  slab,  the 
current  induced  by  the  driving  electric  field  on  the  near  side,  thereby  causing  electro¬ 
magnetic  energy  to  be  radiated  from  the  far  side  of  the  slab.  As  a  result,  there  is  a 
resonant  increase  in  the  transmission  of  electromagnetic  energy  through  the  slab 


«  V.  F.  Gantmakher,  Sor.  Phys.  JETP  15.  982  (1962).  The  parallel-  and  tilted-field  Gantmakher 
effects  are  also  important  sources  of  information  about  electronic  relaxation  times. 
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whenever  the  thickness  and  magnetic  field  are  such  that  orbits  can  be  so  matched 
with  the  surfaces.  Here,  again,  only  electrons  near  the  Fermi  surface  are  effective, 
since  only  these  are  permitted  by  the  exclusion  principle  to  exchange  energy  w'ith  the 
field.  Here,  too,  only  orbits  with  extremal  linear  dimensions  w'ill  contribute  to  the 
resonance. 

Measurements  of  the  Gantmakher  effect  are  often  made  in  the  megahertz  region 
to  avoid  the  complex  situation  arising  when  the  size  resonances  are  superimposed 
on  the  cyclotron  resonance  frequencies,  as  might  happen  in  the  microw'ave  regime. 
It  is  necessary,  however,  that  the  frequency  be  high  enough  to  be  in  the  anomalous 
regime. 

The  Fermi  surface  probes  described  above,  together  w'ith  a  variety  of  related  probes, 
have  now  been  applied  to  a  large  number  of  metals.  The  information  that  has  been 
extracted  in  this  way  is  surveyed  in  Chapter  15. 


PROBLEMS 


1.  (a)  Show  that  the  Onsager  quantization  condition  (14.13)  (with  /.  =  j)  applied  to  the  orbits 
of  a  free  electron  leads  directly  to  the  free  electron  levels  (14.2). 

(b)  Show  that  the  degeneracy  (14.4)  of  the  free  electron  levels  (14.2)  is  just  the  number  of 
zero-field  free  electron  levels  with  the  given  lc,,  and  with  kx  and  ky  w  ithin  a  planar  region  of  area 
AA  (Eq.  (14.12)). 

2.  Using  the  fundamental  relation  (14.1),  deduce  the  ratio  of  the  areas  of  the  two  extremal 
orbits  responsible  for  the  oscillations  in  Figure  I4.2b. 


3.  If  there  is  any  nonuniformity  of  the  magnetic  field  over  the  sample  of  metal  used  in  a  de 
Haas-van  Alphen  experiment,  then  the  structure  in  g(Z)  will  reflect  this  variation.  Different  regions 
will  have  maxima  in  g(S)  at  different  field  strengths,  and  the  susceptibility,  which  sums  contribu¬ 
tions  from  all  regions,  may  lose  its  oscillatory  structure.  To  avoid  this,  any  spatial  variation  8H  in 
the  field  must  lead  to  a  variation  i5£v  that  is  small  compared  with  + ,  —  E„  for  the  extremal  orbits. 
Using  the  fact  that  r?A(£,  k,)/dkz  vanishes  for  the  extremal  orbits,  calculate  c’E,(k.),  cH  from  ( 14.1 3) 
for  an  extremal  orbit.  Deduce  from  this  that  to  preserve  the  oscillatory  structure  the  field  in¬ 
homogeneity  must  satisfy 


5H  A  A 

- < - , 

H  A 


(14.25) 


where  AA  is  given  in  (14.12). 


4.  (a)  Show  that  in  the  range  of  microwave  frequencies  (o  1010  sec  ’)  Eq.  (1.33)  for  the 
propagation  of  an  electromagnetic  wave  in  a  metal  reduces  to 


— V2E  = 


( 47tl(TCj') 


E. 


(14.26) 


(b)  Deduce  from  this  the  expression  (14.21)  for  the  classical  skin  depth. 

(c)  Why  is  this  analysis  incorrect  when  the  field  varies  appreciably  over  a  mean  free  path? 
(Hint:  It  is  necessary  to  reexamine  Drude’s  derivation  of  Ohm's  law.) 
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In  this  chapter  we  describe  some  of  the  better  understood  features  of  the  band 
structures  of  specific  metals,  as  deduced  from  experiment  through  techniques  such 
as  those  described  in  Chapter  14.  Our  primary  aim  is  simply  to  illustrate  the  great 
range  and  variety  of  band  structures  possessed  by  the  metallic  elements.  When 
however,  a  particular  feature  of  its  band  structure  is  strikingly  reflected  in  the  physical 
properties  of  a  metal,  we  shall  point  this  out.  In  particular,  we  shall  note  examples  of 
Fermi  surfaces  that  afford  dearcut  illustrations  of  the  influence  of  band  structure  on 
transport  properties,  as  discussed  in  Chapters  12  and  13,  as  well  as  noting  some  of 
the  more  straightforward  examples  of  how  band  structure  can  affect  specific  heats 
and  optical  properties. 

THE  MONOVALENT  METALS 

The  monovalent  metals  have  the  simplest  of  all  Fermi  surfaces.  They  fall  into  two 
classes,  the  alkali  metals  and  the  noble  metals,  whose  atomic  structures  and  crystal 
structures  are  shown  in  Table  15.1. 


Table  15.1 

THE  MONOVALENT  METALS 


ALKALI  METALS 
(BODY-CENTERED  CUBIC)0 

NOBLE  METALS 
(FACE-CENTERED  cubic) 

Li:  L22s‘ 

— 

Na:  [Ne]3s» 

— 

K:  [Ar]4s‘ 

Cu:  [Ar]3d104s' 

Rb:  [Kr]5i‘ 

Ag:  [Kr]4d'  °5s' 

Cs:  [Xe]6$‘ 

Au:  [Xe]4/145d106s> 

«  The  Fermi  surface  of  lithium  is  not  well  known  because  it  has  a 
so-called  martensitic  transformation  to  a  mixture  of  crystalline  phases 
at  77  K.  Thus  the  bcc  phase  only  exists  at  temperatures  too  high  to 
observe  the  de  Haas-van  Alphen  effect,  and  the  low-temperature 
phase  lacks  the  crystallinity  necessary  for  a  de  Haas-van  Alphen 
study.  Sodium  undergoes  a  similar  martensitic  transformation  at 
23  K,  but  with  care  the  transformation  can  be  partially  inhibited, 
and  good  de  Haas-van  Alphen  data  on  the  bcc  phase  have  been 
obtained.  (We  have  also  omitted  the  first  and  the  last  occupants  of 
column  IA  of  the  periodic  table  from  the  list  of  alkali  metals :  Solid 
hydrogen  is  an  insulator  (and  therefore  not  a  monatomic  Bravais 
lattice),  though  a  metallic  phase  at  very  high  pressures  has  been  con¬ 
jectured;  and  francium  is  radioactive,  with  a  very  short  half-life.) 

The  Fermi  surfaces  of  these  metals  are  known  with  great  precision  (with  the 
exception  of  lithium)  and  enclose  a  volume  of  k-space  that  accommodates  just  one 
electron  per  atom.  All  the  bands  are  completely  filled  or  empty  except  for  a  single 
half-filled  conduction  band.  Of  the  two  groups  the  noble  metals  are  the  more  com¬ 
plicated.  Their  Fermi  surfaces  have  a  more  complex  topology  and  the  influence  on 
their  properties  of  the  filled  d-band  can  be  pronounced. 
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The  Alkali  Metals 

The  alkali  metals  have  singly  charged  ions  (whose  core  electrons  form  the  tightly 
bound  rare  gas  configuration  and  therefore  give  rise  to  very  low-lying,  very  narrow, 
filled,  tight-binding  bands)  outside  of  which  a  single  conduction  electron  moves.  If 
we  treated  the  conduction  electrons  in  the  metal  as  completely  free,  then  the  Fermi 
surface  would  be  a  sphere  of  radius  kF,  given  by  (see  Eq.  (2.21)) 


(15.1) 


where  a  is  the  side  of  the  conventional  cubic  cell  (the  bcc  Bravais  lattice  has  two 
atoms  per  conventional  cell).  In  units  of  In/a  (half  the  length  of  the  side  of  the  con¬ 
ventional  cubic  cell  of  the  fee  reciprocal  lattice)  we  can  write: 

The  shortest  distance  from  the  center  of  the  zone  to  a  zone  face  (Figure  15.1)  is 

TN  =  ^  V©2  +  (|)2  +  02  =  0.707  Q .  (15.3) 

Therefore  the  free  electron  sphere  is  entirely  contained  within  the  first  zone,  ap¬ 
proaching  it  most  closely  in  the  direction  FN,  where  it  reaches  a  fraction  kf/FN  = 
0.877  of  the  way  to  the  zone  face. 

De  Haas -van  Alphen  measurements  of  the  Fermi  surface  confirm  this  free  electron 
picture  to  a  remarkable  degree  of  precision,  especially  in  Na  and  K,  where  deviations 
in  kF  from  the  free  electron  value  are  at  most  a  few  parts  in  a  thousand.1  The  deviations 
of  these  Fermi  surfaces  from  perfect  spheres  are  shown  in  Figure  15.1,  which  reveals 
both  how  small  these  deviations  are,  and  how  very  precisely  they  are  known. 

Thus  the  alkalis  furnish  a  spectacular  example  of  the  accuracy  of  the  Sommerfeld 
free  electron  model.  It  would  be  wrong  to  conclude  from  this,  however,  that  the 
effective  crystalline  potential  is  minute  in  the  alkali  metals.  What  it  does  suggest  is 
that  the  weak  pseudopotential  method  (Chapter  1 1)  is  well  suited  to  describing  the 
conduction  electrons  in  the  alkalis.  Furthermore,  even  the  pseudopotential  need  not 
be  minute,  for  except  near  Bragg  planes  the  deviation  from  free  electron  behavior 
occurs  only  to  second  order  in  the  perturbing  potential  (Problem  5;  see  also  Chapter 
9).  As  a  result,  it  can  be  shown  that  band  gaps  as  large  as  an  electron  volt  at  the  Bragg 
planes  are  still  consistent  with  the  nearly  spherical  Fermi  surfaces  (see  Figure  15.3). 


1  The  difficulty  in  observing  such  minute  changes  in  the  de  Haas-van  Alphen  period  as  the  orientation 
of  the  crystal  is  changed  is  neatly  circumvented  by  working  at  constant  magnetic  field,  and  observing 
the  change  in  susceptibility  as  the  orientation  of  the  crystal  varies.  Typical  data  are  illustrated  in  Figure  15.2. 
The  peak-to-peak  distance  is  now  associated  with  a  change  in  extremal  area  of  A  A,  which  is  typically 
10  *A.  Therefore,  quite  precise  information  can  be  extracted. 
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The  measured  Fermi  surfaces  of  the  alkali  metals.  Contours  of  constant  distance  from  the 
origin  are  shown  for  that  portion  of  the  surface  lying  in  the  first  octant.  The  numbers 
indicate  percent  deviation  of  k/k0  from  unity  at  maximum  and  minimum  deviation, 
where  k0  is  the  radius  of  the  free  electron  sphere.  Contours  for  Na  and  K  are  at  intervals 
of  0.02  percent,  for  Rb  at  intervals  of  0.2  percent,  with  an  extra  dashed  one  at  —  0.3  percent, 
and  for  Cs  at  intervals  of  0.5  percent,  with  an  extra  dashed  one  at  -  1.25  percent.  (From 
D.  Shoenberg,  The  Physics  of  Metals,  vol.  1,  J.  M.  Ziman,  ed.,  Cambridge,  1969.) 


) 
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Figure  15.2 

De  Haas-van  Alphen  oscillations  produced  by  rotating  a  potassium  crystal  in  a  fixed  magnetic 
field.  (From  D.  Shoenberg,  Low  Temperature  Physics  7.7  9,  Plenum  Press,  New  York,  1965.) 


The  alkali  metals  are  unique  in  possessing  nearly  spherical  Fermi  surfaces  lying 
entirely  inside  a  single  Brillouin  zone.  Because  of  this  property  the  detailed  semi- 
classical  analysis  of  Chapter  12  reduces  to  the  simple  Sommerfeld  free  electron  theory 
of  Chapter  2  when  applied  to  the  transport  properties  of  the  alkalis.  Since  the  analysis 
of  free  electrons  is  simpler  than  for  general  Bloch  electrons,  the  alkalis  provide  a 
most  valuable  testing  ground  for  studying  various  aspects  of  electronic  behavior  in 
metals,  free  of  the  formidable  analytical  complications  imposed  by  band  structure. 


Figure  15.3 

Illustrating  that  a  fairly  substantial 
energy  gap  at  the  Bragg  plane  (N) 
is  possible,  even  though  the  bands 
are  indistinguishable  from  free  elec¬ 
tron  bands  at  kr  =  0.877  TN. 
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By  and  large  the  observed  transport  properties  of  the  alkali  metals2  agree  rea¬ 
sonably  well  with  the  observed  sphericity  of  their  Fermi  surfaces — i.e.,  with  the 
predictions  of  free  electron  theory.  However,  it  is  difficult  to  prepare  samples  suffi¬ 
ciently  free  of  crystalline  defects  to  test  this  with  any  stringency.  Thus,  for  example, 
although  it  is  clear  that  measurements  of  the  magnetoresistance  show'  it  to  be  far  less 
field-dependent  in  the  alkalis  than  in  many  other  metals,  the  field-independent 
behavior  at  large  wcx  required  for  spherical  Fermi  surfaces  has  yet  to  be  observed, 
and  measured  Hall  constants  are  still  reported  w'hich  deviate  by  a  few'  percent  from 
the  value  —  \/nec  required  by  free  electron  theory  (or  by  any  simple  closed  Fermi 
surface  containing  one  electronic  level  per  atom).  These  discrepancies  have  led  to 
some  speculation  that  the  electronic  structure  of  the  alkalis  may  be  more  complex 
than  described  here,  but  the  evidence  for  this  is  far  from  compelling,  and  as  of  this 
writing  the  widely  prevailing  opinion  is  that  the  alkali  metals  do  indeed  have  very 
nearly  spherical  Fermi  surfaces. 

The  Noble  Metals 

A  comparison  of  potassium  ([Ar]4s1 *)  and  copper  ([Ar]3rf104s’)  reveals  the  charac¬ 
teristic  important  differences  between  the  alkali  and  noble  metals.  In  the  metallic 
state  of  both  elements,  the  closed-shell  atomic  levels  of  the  argon  configuration 
(ls22s22p63s23p6)  give  rise  to  very  tightly  bound  bands,  lying  w'ell  below'  the  energies 
of  any  of  the  remaining  electronic  levels  in  the  metal.  The  electrons  in  these  low'-lying 
levels  can  be  considered  as  part  of  the  (for  most  purposes)  inert  ion  cores,  and  the 
remaining  bands  can  be  constructed  by  considering  either  a  bcc  Bravais  lattice  of 
K +  ion  cores  to  which  is  added  one  electron  per  primitive  cell,  or  an  fee  Bravais 
lattice  of  Cu1 1  +  ions  to  which  are  added  eleven  electrons  (3r/104s1)  per  primitive  cell. 

In  the  case  of  potassium  (and  the  other  alkalis)  the  extra  electron  is  accommodated 
by  filling  half  of  a  band  that  is  quite  free  electron-like,  resulting  in  the  very  nearly 
spherical  Fermi  surfaces  described  above. 

In  the  case  of  copper  (and  the  other  noble  metals3)  at  least  six  bands  are  required 
(and  six  turn  out  to  be  enough)  to  accommodate  the  eleven  additional  electrons. 
Their  structure  is  show  n  in  Figure  15.4.  For  almost  all  wave  vectors  k  the  six  bands 
can  be  seen  to  separate  into  five  lying  in  a  relatively  narrow'  range  of  energies  from 
about  2  to  5  eV  below  Sr,  and  a  sixth,  with  an  energy  anyw'here  from  about  7  eV 
above  to  9  eV  below  6F. 

It  is  conventional  to  refer  to  the  set  of  five  narrow  bands  as  the  d-bands,  and  the 
remaining  set  of  levels  as  the  s-band.  However,  these  designations  must  be  used 
cautiously,  since  at  some  values  of  k  all  six  levels  are  close  together,  and  the  distinction 
between  rf-band  and  s-band  levels  is  not  meaningful.  The  nomenclature  reflects  the 
fact  that  at  wave  vectors  where  the  levels  do  clearly  group  into  sets  of  five  and  one, 
the  five  are  derived  from  the  five  orbital  atomic  d-levels,  in  the  sense  of  tight  binding 
(Chapter  10),  and  the  remaining  level  accommodates  what  would  be  the  4s  electron 
in  the  atom. 


2  Excepting  lithium,  whose  Fermi  surface,  for  the  reasons  mentioned  in  Table  15.1,  is  poorly  known. 

3  The  4/  band  in  gold  lies  low  enough  for  its  electrons  to  be  considered  part  of  the  ion  core,  along 

with  all  those  from  the  Xe  configuration. 
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Figure  15.4 

(a)  Calculated  energy  bands 
in  copper.  (After  G.  A.  Bur¬ 
dick,  Phys.  Rev.  129,  138 
(1963).)  The  £  vs.  k  curves  are 
shown  along  several  lines  in 
the  interior  and  on  the  surface 
of  the  first  zone.'(The  point  T 
is  at  the  center  of  the  zone.) 
The  4-bands  occupy  the 
darkest  region  of  the  figure, 
whose  width  is  about  3.5  eV. 

(b)  The  lowest-lying  free 
electron  energies  along  the 
same  lines  as  in  (a).  (The 
energy  scales  in  (a)  and  (b) 
are  not  the  same.) 


Note  that  the  k  dependence  of  the  s-band  levels,  except  where  they  approach  the 
4-bands,  bears  a  remarkable  resemblance  to  the  lowest  free  electron  band  for  an  fee 
crystal  (plotted  in  Figure  15.4b  for  comparison),  especially  if  one  allows  for  the 
expected  modifications  near  the  zone  faces  characteristic  of  a  nearly  free  electron 
calculation  (Chapter  9).  Note  also  that  the  Fermi  level  lies  far  enough  above  the 
4-band  for  the  s-band  to  intersect  6f  at  points  where  the  resemblance  to  the  free 
electron  band  is  still  quite  recognizable.4  Thus  the  calculated  band  structure  indicates 
that  for  purposes  of  Fermi  surface  determination  one  might  still  hope  for  some 
success  with  a  nearly  free  electron  calculation.  However,  one  must  always  keep  in 
mind  that  not  too  far  below  the  Fermi  energy  lurks  a  very  complex  set  of  d-bands, 
which  can  be  expected  to  influence  the  metallic  properties  far  more  strongly  than 
do  any  of  the  filled  bands  in  the  alkali  metals.5 

The  Fermi  surface  for  a  single  half-filled  free  electron  band  in  an  fee  Bravais 
lattice  is  a  sphere  entirely  contained  within  the  first  Brillouin  zone,  approaching  the 
surface  of  the  zone  most  closely  in  the  <111)  directions,  where  it  reaches  0.903  of  the 
distance  from  the  origin  to  the  center  of  the  hexagonal  face.  The  de  Haas-van  Alphen 


4  However,  the  Fermi  level  is  close  enough  to  the  d-band  to  make  the  s-band  nomenclature  somewhat 
dubious  for  conduction  band  levels  on  the  Fermi  surface.  A  more  precise  specification  of  how  s-like  or 
rf-like  a  level  is  must  be  based  on  a  detailed  examination  of  its  wave  function.  In  this  sense  most,  but 
by  no  means  all,  levels  at  the  Fermi  surface  are  s-like. 

5  The  atomic  ionization  potentials  provide  a  convenient  reminder  of  the  different  roles  played  by 
filled  bands  in  the  alkali  and  noble  metals.  To  remove  the  first  (4s)  and  then  the  second  (3p)  electron  from 
atomic  potassium  requires  4.34  and  31.81  eV,  respectively.  The  corresponding  figures  for  copper  are 
7.72  eV  (4s)  and  20.29  eV  (3<i). 
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effect  in  all  three  noble  metals  reveals  that  their  Fermi  surfaces  are  closely  related  to 
the  free  electron  sphere;  however,  in  the  <11 1>  directions  contact  is  actually  ma  e 
with  the  zone  faces,  and  the  measured  Fermi  surfaces  have  the  shape  shown  in  h  igure 
15.5.  Eight  “necks”  reach  out  to  touch  the  eight  hexagonal  faces  of  the  zone,  but 
otherwise  the  surface  is  not  grossly  distorted  from  spherical.  The  existence  of  these 
necks  is  most  strikingly  evident  in  the  de  Haas-van  Alphen  oscillations  for  magnetic 
fields  in  the  <  1 1 1  >  directions,  which  contain  two  periods,  determined  by  the  extrema 
“belly”  (maximum)  and  “neck”  (minimum)  orbits  (Figure  15.6).  The  ratio  of  the  two 
periods  directly  determines  the  ratio  of  the  maximal  to  minimal  <  1 1 1  >  cross  sections . 

Figure  15.5 

(a!  In  the  three  noble  metals  the  free  electron  sphere 
bulges  out  in  the  <1 1 1  >  directions  to  make  contact  with 
the  hexagonal  zone  faces,  (b)  Detailed  cross  sections 
of  the  surface  for  the  separate  metals.  (D.  Shoenberg 
and  D.  J.  Roaf,  Phil.  Trans.  Roy.  Soc.  255, 85  (1962).) The 
cross  sections  may  be  identified  by  a  comparison  with 
(a). 


(a) 

X  K 


«  M.  R.  Halse,  Phil.  Trans.  Roy.  Soc.  A265,  507  (1969).  The  entry  for  silver  can  be  read  directly  from 
the  experimental  curve  in  Figure  15.6. 
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29 

Although  a  distorted  sphere,  bulging  out  to  make  contact  with  the  hexagonal 
zone  faces,  is  still  a  fairly  simple  structure,  when  viewed  in  the  repeated-zone  scheme 
the  noble  metal  Fermi  surface  reveals  a  variety  of  exceedingly  complex  orbits.  Some 
of  the  simplest  are  shown  in  Figure  15.7.  The  open  orbits  are  responsible  for  the  very 
dramatic  behavior  of  the  magnetoresistance  of  the  noble  metals  (Figure  1 5.8),  whose 


Figure  15.6 

De  Haas-van  Alphen  oscillations  in  silver.  (Courtesy  of  A.  S. 
Joseph.)  The  magnetic  field  is  along  a  <111>  direction.  The  two 
distinct  periods  are  due  to  the  neck  and  belly  orbits  indicated  in 
the  inset,  the  high-frequency  oscillations  coming  from  the  larger 
belly  orbit.  By  merely  counting  the  number  of  high-frequency 
periods  in  a  single  low-frequency  period  (i.e.,  between  the  two 
arrows)  one  deduces  directly  that  AIt  ^belly)//)! ,  ,(neck)  =  51. 
(Note  that  it  is  not  necessary  to  know  either  the  vertical  or  hori¬ 
zontal  scales  of  the  graph  to  determine  this  fundamental  piece  of 
geometrical  information!) 
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Figure  15.7 

Indicating  only  a  few  of  the  surprisingly  many 
types  of  orbits  an  electron  can  pursue  in  k-space 
when  a  uniform  magnetic  field  is  applied  to  a 
noble  metal.  (Recall  that  the  orbits  are  given  by 
slicing  the  Fermi  surface  with  planes  perpen¬ 
dicular  to  the  field.)  The  figure  displays  (a)  a 
closed  particle  orbit;  (b)  a  closed  hole  orbit; 
(c)  an  open  orbit,  which  continues  in  the  same 
general  direction  indefinitely  in  the  repeated- 
zone  scheme. 

b  ° 


a 


Figure  15.8 

The  spectacular  direction  de¬ 
pendence  of  the  high-field 
magnetoresistance  in  copper 
that  is  characteristic  of  a  Fermi 
surface  supporting  open  orbits. 
The  [001]  and  [010]  directions 
of  the  copper  crystal  are  as 
indicated  in  the  figure,  and  the 
current  Rows  in  the  [100] 
direction  perpendicular  to  the 
graph.  The  magnetic  field  is 
in  the  plane  of  the  graph.  Its 
magnitude  is  fixed  at  18  kilo- 
gauss,  and  its  direction  varied 
continuously  from  [001]  to 
[010].  The  graph  is  a  polar 
plot  of 

P(H)  -  P(0) 

P(  0) 

vs.  orientation  of  the  field.  The 
sample  is  very  pure  and  the 
temperature  very  low  (4.2  K — 
the  temperature  of  liquid  he¬ 
lium)  to  insure  the  highest 
possible  value  for  coct.  (J.  R. 
Klauder  and  J.  E.  Kunzler, 
The  Fermi  Surface,  Harrison 
and  Webh  s.,  Wiley,  New 
York,  1960... 
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failure  to  saturate  in  certain  directions  is  very  neatly  explained  by  the  semiclassical 
theory  (see  pages  234-239). 

Although  the  topology  of  the  noble  metal  Fermi  surfaces  can  lead  to  very  complex 
transport  properties,  the  Fermi  surfaces  have  but  a  single  branch,  and  therefore  like 
the  alkali  metals  the  noble  metals  can  be  treated  as  one-band  metals  in  analyses  of 
their  transport  properties.  All  other  known  Fermi  surfaces  of  metallic  elements  have 
more  than  one  branch. 

Because,  however,  of  the  very  shallow  4-bands,  a  one-band  model  is  quite  likely 
to  be  inadequate  in  explaining  effects  requiring  more  than  a  semiclassical  analysis. 
The  4-bands  are  revealed  especially  unambiguously  in  the  optical  properties  of  the 
noble  metals. 


Optical  Properties  of  the  Monovalent  Metals 

The  color  of  a  metal  is  determined  by  the  frequency  dependence  of  its  reflectivity: 
Some  frequencies  are  reflected  more  strongly  than  others.  The  very  different  colors 
of  copper,  gold,  and  aluminum  indicate  that  this  frequency  dependence  can  vary 
strikingly  from  one  metal  to  another. 

In  turn,  the  reflectivity  of  a  metal  is  determined  by  its  frequency-dependent 
conductivity,  through  one  of  the  standard  calculations  of  electromagnetic  theory 
(Appendix  K).  Substituting  the  free  electron  form  (1.29)  into  Eq.  (K.6)  yields  a  re¬ 
flectivity  in  which  properties  of  the  specific  metal  appear  only  through  the  plasma 
frequency  and  the  relaxation  time.  This  free  electron  reflectivity  lacks  the  structure 
necessary  to  account  for  the  characteristic  thresholds  that  appear  in  the  reflectivities 
of  real  metals,  as  well  as  the  striking  variations  from  one  metal  to  another. 

Abrupt  changes  in  the  reflectivity  are  caused  by  the  onset  of  new  mechanisms  for 
the  absorption  of  energy.  The  free  electron  model  gives  a  relatively  structureless 
reflectivity  because  collisions  provide  the  only  mechanism  for  energy  absorption. 
Incident  radiation  simply  accelerates  free  electrons,  and  if  there  were  no  collisions 
the  electrons  would  radiate  back  all  the  energy  so  acquired  in  the  form  of  transmitted 
and  reflected  radiation.  Since  there  is  no  transmission  below  the  plasma  frequency 
(see  page  18  and  also  Problem  2),  all  radiation  would  be  perfectly  reflected  in  the 
absence  of  collisions.  Above  the  plasma  frequency,  transmission  is  possible,  and  the 
reflectivity  declines.  The  only  effect  of  collisions  on  this  is  to  round  the  sharp  transition 
from  perfect  to  partial  reflection.  Because  of  collisions  some  of  the  energy  acquired 
by  the  electrons  from  the  incident  radiation  is  degraded  into  thermal  energy  (of  the 
ions  or  impurities,  for  example),  thereby  diminishing  the  amount  of  reflected  energy 
both  above  and  below  the  plasma  frequency.  Because  collisions  have  this  effect  at 
all  frequencies,  they  introduce  no  striking  frequency-dependent  structure  into  the 
reflectivity. 

For  Bloch  electrons  the  situation  is  quite  different.  A  strongly  frequency-dependent 
mechanism  for  absorbing  incident  energy  is  possible,  which  is  most  simply  understood 
by  regarding  the  incident  radiation  as  a  beam  of  photons  of  energy  ho  and  momentum 
hq.  A  photon  may  lose  energy  by  exciting  an  electron  from  a  level  with  energy  6  to 
one  with  energy  6'  =  6  +  ho.  In  the  free  electron  case  momentum  conservation 
imposes  the  additional  constraint  p’  =  p  -t-  hq.  which  proves  impossible  to  satisfy 
(Problem  3),  thereby  ohibiting  this  type  of  energy  loss.  In  the  presence  of  a  periodic 
potential,  however,  t.  ranslational  symmetry  of  free  space  is  broken,  and  momentum 
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conservation  does  not  hold.  Nevertheless,  a  weaker  conservation  law  is  still  in  force 
because  of  the  remaining  translational  symmetry  of  the  periodic  potential.  This 
restricts  the  change  in  electron  wave  vector  in  a  manner  reminiscent  of  momentum 
conservation: 

k'  =  k  +  q  +  K,  (15.4) 

where  K  is  a  vector  of  the  reciprocal  lattice. 

Equation  (15.4)  is  a  special  case  of  “crystal  momentum  conservation,”  which  is 
discussed  in  detail  in  Appendix  M.  Here,  we  only  note  that  (15.4)  is  a  highly  plausible 
modification  of  the  momentum  conservation  law  satisfied  in  free  space,  since  the 
electronic  levels  in  a  periodic  potential,  although  not  the  single  plane-wave  levels  of 
free  space,  can  still  be  represented  as  superpositions  of  plane  waves  all  of  whose  wave 
vectors  differ  only  by  vectors  of  the  reciprocal  lattice  (see,  for  example,  Eq.  (8.42)). 

Since  a  photon  of  visible  light  has  a  wavelength  of  order  5000  A,  the  photon  wave 
vector  q  is  typically  of  order  105  cm-1.  Typical  Brillouin  zone  dimensions,  on  the 
other  hand,  are  of  order  kF  %  108  cm-1.  Thus  the  term  q  in  (15.4)  can  shift  the  wave 
vector  k  by  only  a  fraction  of  a  percent  of  the  dimensions  of  the  Brillouin  zone. 
Because  two  levels  in  the  same  band  whose  wave  vectors  differ  by  a  reciprocal  lattice 
vector  are,  in  fact,  identical,  the  shift  by  K  can  be  ignored  altogether,  and  we  reach 
the  important  conclusion  that  the  wave  vector  of  a  Bloch  electron  is  essentially 
unchanged  when  it  absorbs  a  photon. 

For  the  electron’s  energy  to  change  by  fiat,  typically  a  few  electron  volts,  the 
electron  must  move  from  one  band  to  another  without  appreciable  change  in  wave 
vector.  Such  processes  are  known  as  interband  transitions.7  They  can  occur  as  soon 
as  hat  exceeds  £„(k)  —  £„(k)  for  some  k,  and  for  two  bands  n  and  ri,  where  £n(k)  is 
below  the  Fermi  level  (so  that  such  an  electron  is  available  for  excitation)  and  £„.(k) 
is  above  the  Fermi  level  (so  that  the  final  electronic  level  is  not  made  unavailable  by 
the  Pauli  principle).  This  critical  energy  or  frequency  is  called  the  interband  threshold.8 

The  interband  threshold  may  be  due  either  to  the  excitation  of  electrons  from  the 
conduction  band  (highest  band  containing  some  electrons)  into  higher  unoccupied 
levels,  or  to  the  excitation  of  electrons  from  filled  bands  into  unoccupied  levels  in 
the  conduction  band  (lowest  band  containing  some  unoccupied  levels). 

In  the  alkali  metals  the  filled  bands  lie  far  below  the  conduction  band,  and  the 
excitation  of  conduction  band  electrons  to  higher  levels  gives  the  interband  threshold. 
Since  the  Fermi  surface  in  the  alkali  metals  is  so  close  to  a  free  electron  sphere,  the 
bands  above  the  conduction  band  are  also  quite  close  to  free  electron  bands,  especially 


7  More  precisely,  they  are  known  as  direct  interband  transitions.  In  general  the  analysis  of  optical 
data  is  complicated  by  the  further  possibility  of  indirect  interband  transitions,  in  which  the  electronic  wave 
vector  k  is  not  conserved,  the  missing  crystal  momentum  being  carried  away  by  a  quantized  lattice  vibration 
or  phonon.  Since  phonon  energies  are  very  much  less  than  optical  photon  energies  in  the  monovalent  metals 
(Chapters  23  and  24),  our  general  conclusions  are  not  very  sensitive  to  the  possibility  of  indirect  transitions, 
and  we  shall  ignore  them.  They  cannot  be  ignored,  however,  in  a  more  precise  quantitative  theory. 

8  Interband  transitions  are  explicitly  prohibited  in  the  semidassical  model  of  Chapters  12  and  13 
by  condition  (12 10).  When  the  frequency  becomes  comparable  to  the  interband  threshold,  the  semidassical 
AC  conductivity  (13.34)  must  be  used  cautiously,  if  at  all,  since  corrections  coming  from  the  more  general 
form  (13.37)  may  be  quite  important. 


; 
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for  values  of  k  within  the  Fermi  “sphere,”  which  does  not  reach  all  the  way  to  the 
zone  faces.  A  free  electron  estimate  of  the  threshold  energy  ttw  follows  from  observing 
that  the  occupied  conduction  band  levels  with  energies  closest  to  the  next  highest 
free  electron  levels  at  the  same  k  occur  at  points  on  the  Fermi  sphere  nearest  to  a 
Bragg  plane;  i.e.,  at  points  (Figure  15.1)  where  the  Fermi  sphere  meets  the  lines  TN. 
As  a  result,  the  interband  threshold  is 

hw  =  ~(2k0-kFf -~kF2.  (15.5) 


Here  k0  is  the  length  of  the  line  TN  from  the  center  of  the  zone  to  the  midpoint  of  one 
of  the  zone  faces  (Figure  15.9),  and  satisfies  (see  page  285)  kF  —  0.877 k0.  If  k0  is 
expressed  in  terms  of  kF,  Eq.  (15.5)  gives 

hco  =  0.646f.  (15.6) 


Figure  15.9 

Free  electron  determination  of  the 
threshold  energy  for  interband  ab¬ 
sorption  in  the  alkali  metals.  Nu¬ 
merically,  lit o  =  0.64Ef . 


Figure  15.10  shows  Re  er(a>)  as  deduced  from  the  measured  reflectivities  of  sodium, 
potassium,  and  rubidium.  At  lower  frequencies  the  sharp  decrease  with  increasing 
frequency  characteristic  of  the  free  electron  model  (see  Problem  2)  is  observed.  In 
the  neighborhood  of  0.64ef ,  however,  there  is  a  noticeable  rise  in  Re  <j(w),  a  striking 
confirmation  of  the  nearly  free  electron  estimate  of  the  interband  threshold. 

The  situation  is  quite  different  in  the  noble  metals,  due  to  the  d-bands.  Figure  15.1 1 
shows  the  computed  band  structure  of  Cu,  including  the  lowest-lying  completely 
empty  bands.  Note  that  these  are  also  recognizable  distortions  of  the  free  electron 
bands  displayed  below  them.  The  threshold  for  exciting  an  electron  up  from  the 
conduction  band  occurs  at  point  b  (which  is  where  the  Fermi  surface  “neck  meets 
the  hexagonal  zone  face  (Figure  15.5a))  with  an  energy  proportional  to  the  length 
of  the  upper  vertical  arrow — about  4  eV. 

However,  <f-band  electrons  can  be  excited  into  unoccupied  conduction  band  levels 
with  considerably  less  energy  than  this.  Such  a  transition  occurs  at  the  same  point 
b,  with  an  energy  difference  proportional  to  the  length  of  the  lower  vertical  arrow— 
about  2  eV.  Another,  somewhat  lower,  transition  occurs  at  point  a. 


Re  c(w)(1014  sec'1) 
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Figure  15.10 

Re  a(w).  deduced  from  reflec¬ 
tivity  measurements  in  three 
alkali  metals.  The  interband 
threshold  is  evident,  and  occurs 
quite  close  to  0.64£F,  where  £F  is 
the  free  electron  Fermi  energy 
given  in  Table  2.1,  page  38. 
(Courtesy  of  N.  Smith.) 


The  measured  absorption  in  copper  (see  Figure  15.12)  does  increase  sharply  at 
about  2  eV.  Thus  its  reddish  color  is  a  direct  manifestation  of  the  rather  low  threshold 
for  the  excitation  of  rf-band  electrons  into  the  conduction  band,  2eV  lying  somewhere 
in  the  orange  part  of  the  visible  spectrum.9 

The  inference  of  band  structure  from  optical  properties  remains  simple  in  some 
of  the  polyvalent  metals,10  but  it  can  be  far  from  straightforward  in  others.  Frequently, 
for  example,  there  are  points  on  the  Fermi  surface  where  the  conduction  band  is 
degenerate  with  the  next  higher  band,  leading  to  interband  transitions  at  arbitrarily 
low  energies  and  the  absence  of  any  sharp  interband  threshold. 


9  A  threshold  at  about  the  same  energy  also  produces  the  yellowish  color  of  gold.  Silver,  however, 
is  more  complicated:  the  threshold  for  d-band  excitation  and  a  plasmon-like  threshold  apparently  merge 
at  about  4  eV  (Figure  15.12),  resulting  in  a  more  uniform  reflectivity  throughout  the  visible  range  (about 
2  to  4  eV). 

10  See,  for  example,  the  discussion  of  aluminum  below. 
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Figure  15.11 

Burdick’s  calculated  bands  for 
copper,  illustrating  that  the  ab¬ 
sorption  threshold  for  transi¬ 
tions  up  from  the  conduction 
band  is  about  4  eV,  while  the 
threshold  for  transitions  from 
the  d-band  to  the  conduction 
band  is  only  about  2  eV.  (The 
energy  scale  is  in  tenths  of  a 
rydberg  (0.1  Ry  =  1.36  eV).) 
Note  the  resemblance  of  the 
bands  other  than  the  d-bands  to 
the  free  electron  bands  plotted 
below. 


0  5  to  15  20  25  0 

tioj(eV)  tia>(eV) 

Figure  15.12 

The  imaginary  part  of  the  dielectric  constant,  e2(co)  =  1m  e(co)  vs.  ftco,  as  deduced  from  reflectivity 
measurements.  (H.  Ehrenreich  and  H.  R.  Phillip,  Phys.  Rev.  128,  1622  (1962).)  Note  the  charac¬ 
teristic  free  electron  F  ivior  (1/co3)  below  about  2  eV  in  copper  and  below  about  4  eV  in  silver. 
The  onset  of  interbai  jsorption  is  quite  apparent- 
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THE  DIVALENT  METALS 

The  divalent  metals  lie  in  the  columns  of  the  periodic  table  immediately  to  the  right 
of  the  alkali  and  noble  metals.  Their  electronic  structure  and  crystal  structures  are 
given  in  Table  15.2. 

Table  15.2 

DIVALENT  METALS 


IIA  METALS  IIB  METALS 


Be: 

Is2  2s2 

hep 

Mg: 

[Ne]3j2 

hep 

Ca: 

[Ar]4s2 

fee 

Zn: 

[Ar]3</104i2 

hep 

Sr: 

[Kr]5.r2 

fee 

Cd: 

[Kr]4</105i2 

hep 

Ba: 

[Xe]6s2 

bee 

Hg: 

fXe]4/,45(/106i2 

* 

*  Rhombohedral  monatomic  Bravais  lattice. 


In  contrast  to  the  IA  (alkali)  and  IB  (noble)  metals,  the  properties  of  the  IIA  and 
IIB  metals  are  considerably  less  drastically  affected  by  the  presence  or  absence  of 
the  filled  d-band.  Band  structure  calculations  indicate  that  in  zinc  and  cadmium  the 
d-band  lies  completely  below  the  bottom  of  the  conduction  band,  while  in  mercury 
it  overlaps  the  conduction  band  only  in  a  narrow  region  quite  near  the  bottom.  As 
a  result,  the  d-bands  are  relatively  inert,  and  the  variation  in  metallic  properties  with 
crystal  structure  is  far  more  striking  than  the  variation  from  column  IIA  to  IIB. 

The  Cubic  Divalent  Metals 

With  two  electrons  per  primitive  cell,  calcium,  stronium,  and  barium  could,  in  prin¬ 
ciple,  be  insulators.  In  the  free  electron  model,  the  Fermi  sphere  has  the  same  volume 
as  the  first  zone  and  therefore  intersects  the  zone  faces.  The  free  electron  Fermi 
surface  is  thus  a  fairly  complex  structure  in  the  first  zone,  and  pockets  of  electrons 
in  the  second.  From  the  point  of  view  of  nearly  free  electron  theory,  the  question  is 
whether  the  effective  lattice  potential  (i.e.,  the  pseudopotential)  is  strong  enough  to 
shrink  the  second-zone  pockets  down  to  zero  volume,  thereby  filling  up  all  the 
unoccupied  levels  in  the  first  zone.  Evidently  this  is  not  the  case,  since  the  group  II 
elements  are  all  metals.  However,  the  detailed  structures  of  the  Fermi  surfaces  in 
the  group  IIA  metals  (the  “alkali  earths”)  are  not  known  with  confidence,  since  they 
are  difficult  to  obtain  in  pure  forms  and  the  standard  probes  are  correspondingly 
ineffective. 

Mercury 

Mercury,  having  a  rhombohedral  Bravais  lattice,  requires  one  to  embark  on  un¬ 
pleasantly  unfamiliar  geometric  constructions  in  k-space.  However,  de  Haas- -van 
Alphen  measurements  have  been  made1 1  indicating  pockets  of  electrons  in  the  second 
zone  and  a  complex  extended  figure  in  the  first. 


11  G.  B.  Brandt  and  J.  A.  Rayne,  Phys.  Rev.  148,  644  (1966). 
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The  Hexagonal  Divalent  Metals 

Good  de  Haas-van  Alphen  data  are  available  for,  beryllium,  magnesium,  zinc,  and 
cadmium.  The  data  suggest  Fermi  surfaces  that  are  more  or  less  recognizable  dis¬ 
tortions  of  the  (extremely  complex)  structure  found  by  simply  drawing  a  free  electron 
sphere  containing  four  levels  per  primitive  hexagonal  cell  (remember  that  the  hep 
structure  has  two  atoms  per  primitive  cell)  and  seeing  how  it  is  sliced  up  by  the 
Bragg  planes.  This  is  illustrated  in  Figure  9.11  for  the  “ideal”  ratio1*  c/a  =  1.633. 

A  complication  characteristic  of  all  hep  metals  arises  from  the  vanishing  of  the 
structure  factor  on  the  hexagonal  faces  of  the  first  zone,  in  the  absence  of  spin-orbit 
coupling  (page  169).  It  follows  that  a  weak  periodic  potential  (or  pseudopotential) 
will  not  produce  a  first-order  splitting  in  the  free  electron  bands  at  these  faces.  This 
fact  transcends  the  nearly  free  electron  approximation:  Quite  generally,  if  spin-orbit 
coupling  is  neglected,  there  must  be  at  least  a  twofold  degeneracy  on  these  faces.  As 
a  result,  to  the  extent  that  spin-orbit  coupling  is  small  (as  it  is  in  the  lighter  elements) 
it  is  better  to  omit  these  Bragg  planes  in  constructing  the  distorted  free  electron 
Fermi  surface,  leading  to  the  rather  simpler  structures  shown  in  Figure  9.12.  Which 
picture  is  the  more  accurate  depends  on  the  size  of  the  gaps  induced  by  the  spin-orbit 
coupling.  It  may  happen  that  the  gaps  have  such  a  size  that  the  representation  of 
Figure  9.1 1  is  valid  for  the  analysis  of  low-field  galvanomagnetic  data,  while  at  high 
fields  the  probability  of  magnetic  breakthrough  at  the  gaps  is  large  enough  that  the 
representation  of  Figure  9.12  is  more  appropriate. 

This  complication  makes  it  rather  difficult  to  disentangle  de  Haas-van  Alphen 
data  in  hexagonal  metals.  Beryllium  (with  very  weak  spin-orbit  coupling)  has  perhaps 
the  simplest  Fermi  surface  (Figure  15.13).  The  “coronet”  encloses  holes  and  the  (two) 
“cigars”  enclose  electrons,  so  that  beryllium  furnishes  a  simple,  if  topologically 
grotesque,  example  of  a  compensated  metal. 


THE  TRIVALENT  METALS 

Family  resemblances  diminish  still  further  among  the  trivalent  metals,  and  we  con¬ 
sider  only  the  simplest,  aluminum.13 


Aluminum 

The  Fermi  surface  of  aluminum  is  very  close  to  the  free  electron  surface  for  a  face- 
centered  cubic  monatomic  Bravais  lattice  with  three  conduction  electrons  per  atom. 


12  Be  and  Mg  have  c/a  ratios  close  to  the  ideal  value,  but  Zn  and  Cd  have  a  c./a  ratio  about  15 
percent  larger. 

13  Boron  is  a  semiconductor.  The  crystal  structure  of  gallium  (complex  orthorhombic)  leads  to  a  free 
electron  Fermi  surface  extending  into  the  ninth  zone.  Indium  has  a  centered  tetragonal  lattice  that  can 
be  regarded  as  fee,  slightly  stretched  along  one  cube  axis,  and  many  of  its  electronic  properties  are  recog¬ 
nizable  distortions  of  those  of  aluminum.  Thallium  is  the  heaviest  hep  metal,  and  therefore  the  one  with 
strongest  spin-orbit  coupling.  Its  Fermi  surface  appears  to  resemble  the  free  electron  surface  of  Figure  9.1 1, 
in  which  the  splittings  on  the  hexagonal  faces  are  retained  (in  contrast  to  beryllium,  the  lightest  hep  metal). 
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Figure  15.13 

The  measured  Fermi  surface  of  beryllium.  (T.  L.  Loucks  and  P.  H.  Cutler,  Phys.  Rev.  A  133,  819 
(1964).)  The  free  electron  “monster”  (upper  left)  shrinks  to  a  “coronet”  (upper  right)  and  all  the 
other  free  electron  pieces  (lower  left)  vanish  except  for  two  “cigars”  (lower  right).  The  coronet 
encloses  unoccupied  levels,  and  the  cigars  contain  electrons. 


pictured  in  Figure  15.14.  One  can  verify  (Problem  4)  that  the  free  electron  Fermi 
surface  is  entirely  contained  in  the  second,  third,  and  fourth  zones  (Figure  15.14c). 
When  displayed  in  a  reduced-zone  scheme  the  second-zone  surface  (Figure  15.14d) 
is  a  closed  structure  containing  unoccupied  levels,  while  the  third-zone  surface 
(Figure  15.14e)  is  a  complex  structure  of  narrow  tubes.  The  amount  of  surface  in  the 
fourth  zone  is  very  small,  enclosing  tiny  pockets  of  occupied  levels. 

The  effect  of  a  weak  periodic  potential  is  to  eliminate  the  fourth-zone  pockets  of 
electrons,  and  reduce  the  third-zone  surface  to  a  set  of  disconnected  “rings”  (Figure 
15.15).  This  is  consistent  with  the  de  Haas  van  Alphen  data,  which  reveal  no  fourth- 
zone  electron  pockets  and  give  the  dimensions  of  the  second-  and  third-zone  surfaces 
quite  precisely. 

Aluminum  provides  a  striking  illustration  of  the  semiclassical  theory  of  Hall 
coefficients.  The  high-field  Hall  coefficient  should  be  RH  =  —  l/(ne  —  nh)ec,  where  ne 
and  nh  are  the  number  of  levels  per  unit  volume  enclosed  by  the  particle-like  and 
hole-like  branches  of  the  Fermi  surface.  Since  the  first  zone  of  aluminum  is  completely 
filled  and  accommodates  two  electrons  per  atom,  one  of  the  three  valence  electrons 
per  atom  remains  to  occupy  second-  and  third-zone  levels.  Thus 


(15.7) 
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(d) 


(e) 


Figure  15.14 

(a)  First  Brillouin  zone  for  an  fee  crystal,  (b)  Second  Brillouin  zone  for  an  fee 
crystal,  (c)  The  free  electron  sphere  for  a  trivalent  monatomic  fee  Bravais  lattice. 
It  completely  encloses  the  first  zone,  passing  through  and  beyond  the  second 
zone  into  the  third  and  (at  the  corners)  ever  so  slightly  into  the  fourth,  (d)  Portion 
of  the  free  electron  sphere  in  the  second  zone  when  translated  back  into  the  first 
zone.  The  convex  surface  encloses  holes,  (e)  Portion  of  the  free  electron  sphere 
in  the  third  zone  when  translated  back  into  the  first  zone.  The  surface  encloses 
particles.  (The  fourth-zone  surface  translates  into  microscopic  pockets  of  electrons 
at  all  comer  points.)  (From  R.  Luck,  doctoral  dissertation,  Techmsche 
Flochschule,  Stuttgart,  1965.) 


Figure  15.15 

The  third-zone  surface  of  aluminum,  in  a  reduced-zone  scheme. 
(From  N.  VV.  Ashcroft,  Phil.  Mag.  8,  2055  (1963).) 
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where  n  is  the  free  electron  carrier  density  appropriate  to  valence  3. 
hand,  since  the  total  number  of  levels  in  any  zone  is  enough  to  hold 
per  atom,  we  also  have 

n'J  +  „»  =  2 


On  the  other 
two  electrons 

(15.8) 


Subtracting  (15.8)  from  (15.7)  gives 


"I"  -  <  =  -  -.  (15.9) 

Thus  the  high-field  Hall  coefficient  should  have  a  positive  sign  and  yield  an  effective 
density  of  carriers  a  third  of  the  free  electron  value.  This  is  precisely  what  is  observed 
(see  Figure  1.4).  From  the  point  of  view  of  the  high-field  Hall  effect  aluminum  has 
one  hole  per  atom  (the  net  result  of  slightly  more  than  one  hole  per  atom  in  the 
second  zone,  and  a  small  fraction  of  an  electron  per  atom  in  the  third)  rather  than 
three  electrons. 

The  reflectivity  of  aluminum  (Figure  15.16a)  has  a  very  sharp  minimum,  which 
is  neatly  accounted  for  as  an  interband  transition  in  a  nearly  free  electron  model.14 
Figure  15.16b  shows  the  energy  bands  in  a  nearly  free  electron  calculation  plotted 
along  the  line  TX  (passing  through  the  center  of  the  square  face  of  the  zone).  The 
bands  are  plotted  as  a  function  of  k  within  the  square  face  in  Figure  15.16c.  In  a 
nearly  free  electron  model  the  bands  in  Figure  1 5.1 6c  are  easily  shown  (see  Eq.  (9.27) ) 
to  be  displaced  by  a  constant  amount  2\U\,  independent  of  k.  Because  of  the  position 
of  the  Fermi  level  it  is  clear  from  Figure  1 5.16c  that  there  is  a  range  of  values  of  k  within 
the  square  face  for  which  transitions  are  possible  from  occupied  to  unoccupied  levels, 
all  of  which  differ  in  energy  by  2|l/|.  This  results  in  a  resonant  absorption  at  ha)  =  2\U\, 
and  a  pronounced  dip  in  the  reflectivity. 

The  value  of  |l/|  deduced  from  the  position  of  the  dip  in  Figure  15.16a  is  in  good 
agreement  with  the  value  deduced  from  de  Haas-van  Alphen  data.15 


THE  TETRA VALENT  METALS 

The  only  tetravalent  metals  are  tin  and  lead,  and  we  again  consider  only  the  simplest, 
lead.16 


14  The  interband  thresholds  in  the  alkali  metals  were  explained  in  terms  of  an  essentially  free  electron 
model ;  i.e.,  it  was  unnecessary  to  take  into  account  any  of  the  distortions  in  the  free  electron  bands  produced 
by  the  lattice  potential.  The  example  discussed  here  is  at  the  next  level  of  complexity :  the  pertinent  transition 
takes  place  between  levels  whose  wave  vectors  lie  in  a  Bragg  plane,  and  the  splitting  between  them  is 
therefore  entirely  due  to  the  first-order  perturbation  of  the  periodic  potential,  in  a  nearly  free  electron 
model. 

13  In  the  nearly  free  electron  model  the  cross-sectional  areas  at  a  Bragg  plane  (which  are  extremal, 
and  therefore  directly  accessible  from  the  de  Haas-van  Alphen  data)  are  entirely  determined  by  the  matrix 
element  of  the  periodic  potential  \V\  associated  with  that  plane.  See  Eq.  (9.39). 

16  Carbon  is  an  insulator  or  a  semimetal  (see  below),  depending  on  crystal  structure.  Silicon  and 
germanium  are  semiconductors  (Chapter  28).  Tin  has  both  a  metallic  (white  tin)  and  a  semiconducting 
(grey  tin)  phase.  Grey  tin  has  the  diamond  structure,  but  white  tin  is  body-centered  tetragonal  with  a 
two-atom  basis.  The  Fermi  surface  has  been  measured  and  calculated  and  is  again  a  recognizable  distortion 
of  the  free  electron  surface. 
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tlG)(cV) 

(a) 


(c) 

Figure  15.16 

(a)  Reflectivity  of  aluminum  in  the  energy  range  0  <  net)  $  5  eV. 
(H.  E.  Bennet,  M.  Silver,  and  E.  J.  Ashley,  J.  Opt.  Soc.  Am.  53,  1089 
(1963).)  (b)  Energy  bands  plotted  along  FX,  and  also  in  the  square 
zone  face  perpendicular  to  TX  (dashed),  (c)  A  second  view  of  the  energy 
bands  in  the  square  zone  face  perpendicular  to  EX.  For  a  weak  pseudo¬ 
potential  these  bands  are  nearly  parallel  and  separated  by  an  amount 
2|  17|.  When  fioj  just  exceeds  this  amount  it  becomes  possible  for  electrons 
within  fico  of  the  Fermi  energy  to  be  excited  from  the  lower  to  the  upper 
band.  This  is  the  source  of  the  structure  in  (a).  (See  N.  W.  Ashcroft 
and  K.  Sturm,  Phys.  Rev.  B  3,  1898  (1971).) 


Lead 

Like  aluminum,  lead  has  an  fee  Bravais  lattice  and  its  free  electron  Fermi  surface  is 
rather  similar,  except  that  the  sphere  must  have  one  third  more  volume  and  hence 
10  percent  greater  radius,  to  accommodate  four  electrons  per  atom  (see  Figure  9.  ). 
The  fourth-zone  electron  pockets  are  therefore  rather  larger  than  in  aluminum,  but 
are  still  apparently  eliminated  by  the  crystal  potential.  The  hole  surface  in  the  second 
zone  is  smaller  than  in  aluminum,  and  the  extended  tubular  particle  surface  in  the 
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third  zone  is  less  slender.1 7  Since  lead  has  an  even  valence,  the  second-  and  third-zone 
surfaces  must  contain  the  same  number  of  levels;  i.e.,  n'h'  =  n[".  Its  galvanomagnetic 
properties  are  rather  complex,  however,  since  the  orbits  of  the  third  zone  Fermi 
surface  are  not  all  of  a  single  carrier  type. 


THE  SEMIMETALS 

The  graphite  form  of  carbon,  and  the  conducting  pentavalent  elements  are  semi¬ 
metals.18  Semimetals  are  metals  in  which  the  carrier  concentration  is  several  orders 
of  magnitude  lower  than  the  1022/cm3  typical  of  ordinary  metals. 

Graphite 

Graphite  has  a  simple  hexagonal  Bravais  lattice  with  four  carbon  atoms  per  primitive 
cell.  Lattice  planes  perpendicular  to  the  c-axis  have  the  honeycomb  arrangement 
(Figure  15.17).  The  structure  is  peculiar  in  that  the  separation  between  lattice  planes 
along  the  c-axis  is  almost  2.4  times  the  nearest-neighbor  distance  within  planes. 
There  is  hardly  any  band  overlap,  the  Fermi  surface  consisting  primarily  of  tiny 
pockets  of  electrons  and  holes,  with  carrier  densities  of  about  ne  =  nh  =  3  x  1018/cm3. 


Figure  15.17 

Crystal  structure  of  graphite  (not  to  scale).  The  distance 
between  the  top  and  bottom  planes  is  almost  4.8  times 
the  nearest-neighbor  distance  within  planes. 


The  Pentavalent  Semimetals 

The  noninsulating  pentavalent  elements.  As  ([Ar]3d104x24p3),  Sb  ([Kr]4dl05s25p3), 
and  Bi  ([Xe]4/ 1 45d 1  °6s26p3)  are  also  semimetals.  All  three  have  the  same  crystal 
structure:  a  rhombohedral  Bravais  lattice  with  a  two-atom  basis,  as  described  in 


17  Since  lead  is  a  very  heavy  metal  it  is  important  to  take  into  account  spin-orbit  coupling  in  com¬ 
puting  the  Fermi  surface.  See  E.  Fawcett,  Phys.  Rev,  LeU.  6,  534  (1961). 

18  Semimetals  should  not  be  confused  with  semiconductors.  A  pure  semimelal  at  T  =  0  is  a  conductor: 
there  are  partially  filled  electron  and  hole  bands.  A  semiconductor,  however,  conducts  only  because  carriers 
are  either  thermally  excited  or  introduced  by  impurities.  A  pure  semiconductor  at  T  =  0  is  an  insulator 
(see  Chapter  28). 
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Table  7.5.  Having  an  even  number  of  conduction  electrons  per  primitive  rhombo- 
hedral  cell,  they  come  very  close  to  being  insulators,  but  there  is  a  slight  band  overlap, 
leading  to  a  very  small  number  of  carriers.  The  Fermi  surface  of  bismuth  consists  of 
several  quite  eccentric,  ellipsoidal-shaped  pockets  of  electrons  and  holes;  the  total 
density  of  electrons  (and  the  total  density  of  holes — these  are  compensated  semi¬ 
metals)  is  about  3  x  1017/cm3 — down  from  typical  metallic  densities  by  about  a 
factor  of  105.  Similar  pockets  have  been  observed  in  antimony,  though  apparently 
not  so  perfectly  ellipsoidal,  and  with  larger  electron  (and  hole)  densities — about 
5  x  1019/cm3.  In  arsenic  the  common  electron  or  hole  density  is  2  x  102O/cm3. 
The  pockets  are  still  less  ellipsoidal,  the  hole  pockets  apparently  being  connected  by 
thin  “tubes”  leading  to  an  extended  surface.19 

These  low  carrier  densities  explain  why  the  pentavalent  metals  provide  such  glaring 
exceptions  to  the  data  tabulated  in  Chapters  1  and  2  in  rough  support  of  free  electron 
theory.  Small  pockets  of  carriers  imply  little  Fermi  surface  area  and  hence  a  small 
density  of  levels  at  the  Fermi  energy.  This  is  why20  the  linear  term  in  the  heat  capacity 
of  bismuth  is  only  about  5  percent  of  the  naive  free  electron  value  for  a  pentavalent 
element,  and  in  antimony  only  about  35  percent  (see  Table  2.3).  The  resistivity  of 
bismuth  is  typically  10  to  100  times  larger  than  that  of  most  metals,  and  in  antimony 
it  is  about  3  to  30  times  larger  (see  Table  1.2). 

It  is  interesting  to  note  that  the  crystal  structure  of  bismuth  (and  the  other  two 
semimetals)  is  only  a  slight  distortion  of  a  simple  cubic  monatomic  Bravais  lattice, 
for  it  can  be  constructed  as  follows:  Take  a  sodium  chloride  structure  (Figure  4.24). 
stretch  it  out  slightly  along  the  (111)  direction  (so  that  the  cube  axes  make  equal 
angles  of  somewhat  less  than  90°  with  each  other),  and  displace  each  chlorine  site 
very  slightly  by  the  same  amount  in  the  (111)  direction.  The  bismuth  structure  has 
one  bismuth  atom  at  each  of  the  resulting  sodium  and  chlorine  sites. 

As  a  result,  the  pentavalent  semimetals  provide  a  striking  illustration  of  the  crucial 
importance  of  crystal  structure  in  determining  metallic  properties.  Were  they  exactly 
simple  cubic  Bravais  lattices,  then,  having  an  odd  valence,  they  would  be  very  good 
metals  indeed.  Thus  the  band  gaps  introduced  by  a  very  slight  deviation  from  simple 
cubic  change  the  effective  number  of  carriers  by  a  factor  as  large  as  105 ! 


THE  TRANSITION  METALS 

The  three  rows  of  the  periodic  table  extending  from  the  alkali  earths  (calcium, 
strontium,  and  barium)  to  the  noble  metals  (copper,  silver,  and  gold)  each  contain 
nine  transition  elements,  in  which  the  d-shell  that  is  empty  in  the  alkali  earths,  and 
completely  filled  in  the  noble  metals,  is  gradually  filled  in.  The  stable  room  tempera¬ 
ture  forms  of  the  transition  elements  are  either  monatomic  fee  or  bcc  Bravais  lattices, 
or  hep  structures.  All  are  metals,  but  unlike  the  metals  we  have  described  up  to  now 


19  See  M.  G.  Priestley  et  al.,  Phys.  Re c.  154,  67 1  (1967). 

20  In  understanding  these  deviations  from  free  electron  theory  it  is  also  important  to  note  that  the 
effective  masses  in  the  pentavalent  semimetals  are  generally  substantially  smaller  than  the  free  electron 
mass;  as  a  result  the  disparity  in  the  conductivity  is  not  so  great  as  the  disparity  in  the  number  of  carriers 
would  suggest  (since  their  velocities  are  higher  for  a  given  k  than  the  velocity  of  a  free  electron). 
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(the  noble  metals  and  the  so-called  simple  metals)  their  properties  are  to  a  con¬ 
siderable  degree  dominated  by  the  ^-electrons. 

Calculated  transition  metal  band  structures  indicate  that  the  d-band  not  only  lies 
high  up  in  the  conduction  band  (as  in  the  noble  metals),  but  in  general  (unlike  the 
noble  metals)  extends  through  the  Fermi  energy.  When  levels  on  the  Fermi  surface 
are  d-derived  levels,  the  tight-binding  approximation  is  probably  a  more  sound 
conceptual  starting  point  for  estimating  the  Fermi  surface  than  the  nearly  free  electron 
(or  OPW)  constructions,  and  there  is  no  longer  any  reason  to  expect  the  transition 
metal  Fermi  surfaces  to  resemble  slightly  distorted  free  electron  spheres.  A  typical 
example,  a  suggested  Fermi  surface  for  bcc  tungsten  ([Xe]4/'45d46.s2),  is  shown  in 
Figure  15.18. 


k 

Ik 
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Figure  15.18 

Proposed  Fermi  surface  for  bcc 
tungsten.  The  six  octahedron-shaped 
pockets  at  the  zone  corners  contain 
holes.  They  are  all  equivalent:  i.e.. 
any  one  can  be  taken  into  any  other 
by  a  translation  through  a  reciprocal 
lattice  vector,  so  that  all  physically 
distinct  levels  in  the  group  are  con¬ 
tained  in  any  one  of  them.  The 
twelve  smaller  pockets  in  the  centers 
of  the  zone  faces  (only  five  are 
visible)  are  also  hole  pockets.  They 
are  equivalent  in  pairs  (from  opposite 
faces).  The  structure  in  the  center  is 
an  electron  pocket.  Tungsten  has  an 
even  number  of  electrons,  and  is 
therefore  a  compensated  metal.  It 
therefore  follows  that  the  volume  of 
a  large  hole  pocket  plus  six  times  the 
volume  of  a  small  hole  pocket  is 
equal  to  the  volume  of  the  electron 
sA  pocket  at  the  center  of  the  zone. 
Consistent  with  a  Fermi  surface 
composed  entirely  of  closed  pockets, 
the  magnetoresistance  has  been  ob¬ 
served  to  increase  quadratically  with 
H  for  all  field  directions,  as  predicted 
for  a  compensated  metal  without 
open  orbits.  Note  that  the  surface, 
unlike  those  considered  earlier,  is  not 
a  distortion  of  the  free  electron 
surface.  This  is  a  consequence  of  the 
Fermi  level  lying  within  the  d-band, 
and  is  characteristic  of  transition 
metals.  (After  A.  V.  Gold,  as  quoted 
in  D.  Shoenberg,  The  Physics  of 
Metals — 1.  Electrons ,  J.  M.  Ziman, 
ed.,  Cambridge,  1969,  p.  112.) 
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The  d-bands  are  narrower  than  typical  free  electron  conduction  bands,  and  con¬ 
tain  enough  levels  to  accommodate  ten  electrons.  Since  the  d-bands  contain  more 
levels  in  a  narrower  energy  range,  the  density  of  levels  is  likely  to  be  substantially 
higher  than  the  free  electron  density  of  levels  throughout  the  energy  region  where  the 
d-band  lies  (see  Figure  15.19).  This  effect  can  be  observed  in  the  electronic  contribution 
to  the  low  temperature  specific  heat.  This  was  shown  in  Chapter  2  to  be  proportional 
to  the  density  of  levels  at  the  Fermi  energy  (Eq.  (2.80)  ).21  An  inspection  of  Table  2.3 
reveals  that  the  transition  metal  electronic  specific  heats  are  indeed  significantly 
higher  than  those  of  the  simple  metals.22,23 

Figure  15.19 

Some  qualitative  features  of 
the  d-band  and  s-band  con¬ 
tributions  to  the  density  of 
levels  of  a  transition  metal. 

The  d-band  is  narrower  and 
contains  more  levels  than  the 
s-band.  Consequently  when 
the  Fermi  level  (separating 
the  shaded  and  unshaded 
regions)  lies  within  the  d- 
band,  the  density  of  levels 
g(8F)  is  very  much  larger  than 
the  free-electronlike  contri¬ 
bution  of  the  s-band  alone. 

(An  actual  density  of  levels 
would  have  sharp  kinks  in 
it;  see  the  description  of  van 
Hove  singularities  on  page 
145.)  (From  J.  M.  Ziman, 

Electrons  and  Phonons,  Ox¬ 
ford,  New  York,  1960.) 

Studies  of  the  transition  metals  are  complicated  by  the  fact  that  the  partially  filled 
d-bands  can  give  rise  to  striking  magnetic  properties.  Consequently,  a  far  more  subtle 
treatment  of  electron  spin  interactions  is  required  than  is  found  in  any  of  the  methods 
we  have  described.  These  points  will  be  discussed  further  in  Chapter  32. 

The  de  Haas-van  Alphen  effect  is  also  more  difficult  to  measure  in  transition 
metals,  since  narrow  bands  lead  to  large  values  ofdA/dE  and  therefore  to  low  cyclotron 
frequencies  (Eq.  (12.42)).  The  high  wcx  regime  is  therefore  more  difficult  to  reach. 
Despite  these  complexities,  de  Haas— van  Alphen  data  and  the  more  conventional 
types  of  band  structure  calculations  (for  what  they  are  worth)  are  now  available  for 


21  The  derivation  of  Eq.  (2.80)  did  not  use  specific  properties  of  the  free  electron  level  density,  and 
the  result  is  equally  valid  for  Bloch  electrons. 

22  Note  also  that  the  specific  heats  of  the  semimetals  are  significantly  lower,  as  their  very  low  density 

of  conduction  electrons  would  lead  one  to  expect. 

23  A  detailed  comparison  of  Eq.  (2.80)  with  experiment  is  complicated  by  the  fact  that  there  are 
corrections  due  to  electron-electron  interactions  (typically  only  a  few  percent)  as  well  as  corrections  due 
to  electron-phonon  interactions  (to  be  discussed  in  Chapter  26),  which  can  be  as  large  as  100  percent. 
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more  than  half  of  the  transition  metals,  and  data  have  been  taken  even  in  ferro¬ 
magnetic  samples. 


THE  RARE  EARTH  METALS 

Between  lanthanum  and  hafnium  lie  the  rare  earth  metals.  Their  atomic  configura¬ 
tions  are  characterized  by  partially  filled  4/  shells,  which,  like  the  partially  filled 
d-shells  of  the  transition  metals,  can  lead  to  a  variety  of  magnetic  effects.  The  typical 
rare  earth  atomic  configuration  is  [Xe]4/"5d(l  °r%s2.  The  solids  can  have  many  types 
of  crystal  structure,  but  by  far  the  most  common  room  temperature  form  is  hexagonal 
close-packed. 

There  is  currently  very  little  Fermi  surface  data  on  the  rare  earth  metals,  for  being 
chemically  very  similar  to  one  another,  they  are  difficult  to  obtain  in  sufficiently  pure 
form.  There  have  been  a  few  band  structure  calculations,  but  in  the  absence  of  Fermi 
surface  data  their  reliability  is  not  easily  assessed. 

The  customary  approach  is  to  treat  the  conduction  band  as  containing  a  number 
of  electrons  per  atom  equal  to  the  nominal  chemical  valence  (three,  in  almost  all 
cases).  Except  for  the  influence  of  the  5 d  atomic  levels  (which  can  be  considerable) 
the  conduction  band  is  free  electron-like;  i.e.,  the  4 f  levels  arc  not  mixed  in,  in  any 
essential  way.  At  first  glance  this  appears  surprising,  for  one  might  expect  the  4 f 
atomic  levels  to  broaden  out  into  a  partially  filled24  4/  band.  Such  a  band  would, 
like  any  partially  filled  band,  contain  the  Fermi  level,  and  therefore,  conversely,  at 
least  some  levels  on  the  Fermi  surface  would  have  a  strong  4 f  character.  This  is  in 
obvious  analogy  to  what  indeed  happens  to  the  partially  filled  3d-,  4 d-,  or  5 d  levels 
in  the  transition  metals. 

In  spite  of  the  analogy  this  does  not  occur  in  the  rare  earth  metals,  and  levels  on 
the  Fermi  surface  have  very  little  4/ character.  The  crucial  difference  is  that  the  atomic 
4/  orbitals  in  the  rare  earth  elements  are  much  more  localized  than  the  highest 
occupied  atomic  d-levels  in  the  transition  metal  elements.  As  a  result,  it  seems  quite 
likely  that  the  independent  electron  approximation  breaks  down  altogether  for  the 
4 f  electrons,  since  they  meet  the  requisite  conditions  (pages  186-187)  of  yielding 
narrow  partially  filled  bands  in  a  tight-binding  analysis.  Electron-electron  inter¬ 
actions  among  the  4 f  electrons  at  each  atomic  site  are  in  fact  strong  enough  to  yield 
local  magnetic  moments  (Chapter  32). 

It  is  sometimes  asserted  that  the  4/  band  in  the  rare  earths  splits  into  two  narrow 
parts:  one  fully  occupied  well  below  the  Fermi  level,  and  the  other  completely  empty, 
well  above  (Figure  15.20).  This  picture  has  dubious  validity,  but  is  probably  the  best 
that  can  be  had  if  one  insists  on  applying  the  independent  electron  model  to  the  4/ 
electrons.  The  gap  between  the  two  portions  of  the  4 f  band  is  then  an  attempt  to 
represent  the  very  stable  spin  configuration  attained  by  the  4/  electrons  in  the  occupied 
portion  of  the  band,  in  which  any  additional  electrons  are  unable  to  participate. 

However  one  chooses  to  describe  the  4/  electrons,  it  does  appear  to  be  the  case 
that  they  can  be  considered  as  part  of  the  ion  cores  in  viewing  the  rare  earth  metal 
band  structures,  in  spite  of  the  fact  that  the  atomic  4 f  shells  are  only  partially  filled. 


24  in  mosi  of  the  rare  earths  there  are  fewer  than  14  electrons  outside  of  the  [Xe]  configuration 
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Figure  15.20 

Two  hypothetical  level  density  curves  for  a  rare  earth  metal,  (a)  The  incorrect  form, 
which  naively  superposes  on  a  fairly  broad  s-p-rl  band  a  sharp /-band  peak,  at  the  Fermi 
energy,  (b)  The  partially  correct  form,  which  has  a  fairly  broad  s-p-d  form  in  the  neighbor¬ 
hood  of  the  Fermi  energy,  and  two  /-band  peaks,  one  well  below  and  one  well  above 
the  Fermi  energy.  The  most  realistic  point  of  view  probably  abandons  the  independent 
electron  approximation  (and  hence  the  possibility  of  drawing  simple  one-electron 
densities  of  levels)  for  the  4/  electrons. 


ALLOYS 

We  conclude  this  chapter  with  the  important  reminder  that  the  metallic  state  is  not 
exhausted  by  a  survey  of  the  periodic  table.  The  construction  of  alloys  from  the  70 
or  so  elemental  metals  is  an  immense  subject  in  its  own  right.  Although  there  is  no 
guarantee  that  any  two  metals  will  dissolve  in  each  other  (indium,  for  example,  will 
not  dissolve  in  gallium),  most  pairs  do  form  so-called  binary  alloys  over  wide  ranges 
of  concentration.  Ternary  (three-component)  alloys,  tertiary  (four-component)  alloys, 
and  so  on,  are  also  made  and  studied.  Evidently  we  can  construct  in  this  way  an 
immense  number  of  different  metals.25 

Alloys  are  conveniently  grouped  into  two  broad  classes:  ordered  and  disordered. 
The  ordered  alloys,  sometimes  also  called  stoichiometric,  have  the  translational 
symmetry  of  a  Bravais  lattice.  Their  structure  is  given  by  placing  at  each  site  of  the 
Bravais  lattice  a  multiatomic  basis.  As  an  example,  the  alloy  known  as  /Tbrass  has 
an  ordered26  phase  in  which  the  two  components  (copper  and  zinc)  have  equal 


25  Some  alloys,  like  indium  antimonide,  are  not  metals,  but  semiconductors. 

26  There  is  also  a  disordered  phase,  and  a  sharp  transition  temperature,  above  which  the  ordering 
disappears.  This  order-disorder  transition  can  be  discussed  in  terms  of  the  lsing  model  (in  which  copper 
at  a  site  corresponds  to  “spin  up'’  and  zinc  at  a  site,  to  “spin  down”).  See  Chapter  33. 
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concentrations  and  form  a  cesium  chloride  structure  (Figure  4.25).  This  can  be  con¬ 
sidered  as  a  simple  cubic  Bravais  lattice  with  a  two-point  basis:  Cu  at  (000)  and  Zn 
at  (n/2)(lll).  The  first  Brillouin  zone  of  a  simple  cubic  lattice  is  a  cube,  whose  surface 
is  intersected  by  a  free  electron  sphere  containing  three  electrons  per  unit  cell  (copper 
has  nominal  valence  one,  and  zinc,  two).27 

However,  use  of  the  free  electron  model  should  be  restricted  by  the  requirement 
that  all  constituents  be  simple  metals  (making  its  use  for  brass  dubious).  When  this 
is  not  the  case,  one  must  resort  to  the  methods  described  in  Chapter  11,  suitably 
generalized  to  cope  with  multiatomic  bases. 

In  the  disordered  phase  of  brass,  which  is  the  stable  one  at  sufficiently  high  tem¬ 
peratures,  even  at  compositions  for  which  ordered  geometries  are  possible  (“stoichio¬ 
metric”  compositions),  the  atoms  lie  at  (or  very  close  to)  the  sites  of  an  abstract 
Bravais  lattice,  and  the  lack  of  order  lies  in  the  randomness  of  the  nature  of  the  atoms. 
For  example,  proceeding  along  a  (111)  direction  in  ordered  /i-brass  one  would  en¬ 
counter  copper  and  zinc  atoms  in  the  sequence:  Cu  Zn  Cu  Zn  Cu  Zn  Cu  Zn  . . . .  In 
the  disordered  phase,  a  typical  sequence  might  be  Cu  Zn  Zn  Zn  Cu  Zn  Cu  Cu  . . . . 

The  theoretical  study  of  disordered  alloys  is  an  altogether  more  difficult  matter. 
Because  of  the  random  assignment  of  atoms  to  sites  there  is  no  Bloch’s  theorem,  and 
without  the  quantum  number  k  one  is  at  a  loss  to  describe  any  electronic  properties 
at  all.  On  the  other  hand,  such  materials  are  clearly  metals,  frequently  well  described 
by  simple  Drude  model  calculations,  and  their  heat  capacities  display  the  charac¬ 
teristic  electronic  contribution  we  have  learned  to  expect  from  metals. 

A  striking  difference,  however,  between  disordered  alloys  and  pure  metals  is  that 
no  matter  how  pure  the  disordered  alloy  is  made,  the  substantial  decrease  in  resistance 
with  decreasing  temperature  characteristic  of  pure  metals  is  absent.  Thus  the  electrical 
resistance  of  the  purest  disordered  brass  at  liquid-helium  temperatures  has  only 
fallen  to  about  half  its  room  temperature  value  (as  opposed  to  a  drop  of  10-4  in 
carefully  prepared  ordered  metals).  This  phenomenon  makes  sense  if  we  view  one 
constituent  of  the  alloy  as  a  (highly  concentrated)  substitutional  impurity  in  the  lattice 
of  the  other,  for  the  impurity  scattering  is  then  a  major  (temperature-independent) 
source  of  resistance  at  all  temperatures.  In  very  pure  metals,  in  contrast,  impurity 
scattering  is  only  revealed  at  very  low  temperatures.  Alternatively,  one  can  simply 
note  that  in  a  disordered  alloy  the  periodicity  is  destroyed,  and  the  semiclassical 
analysis  leading  to  undegraded  currents  in  the  absence  of  scattering  mechanisms  is 
no  longer  valid. 

Thus  the  problem  of  the  electronic  structure  of  disordered  alloys  is  a  very  difficult 
one,  still  largely  unsolved,  and  of  considerable  current  interest. 


PROBLEMS 


1.  Verify  that  in  a  crystal  with  an  fee  monatomic  Bravais  lattice,  the  free  electron  Fermi  sphere 
for  valence  1  reaches  (X6/3tt2),/g  =  0.903  of  the  way  from  the  origin  to  the  zone  face,  in  the  [111] 
direction. 


27  See,  for  example,  J.  P.  Jan,  Can.  J.  Phys.  44,  1787  (1966). 
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2.  Using  the  general  result  given  in  Eq.  (K.  6),  examine  the  reflectivity  implied  by  the  free  electron 
conductivity  (1.29)  when  on  »  1.  Show  that  the  reflectivity  is  unity  below  the  plasma  frequency, 
and  that  r  =  [co,,2/4<n2]2  when  co  »  to„. 

3.  Prove  that  energy  conservation  and  momentum  conservation  make  it  impossible  for  a  free 
electron  to  absorb  a  photon.  (Note:  If  you  use  the  nonrelativistic  form  £  =  p2/2m  for  the  electronic 
energy,  you  will  find  that  such  absorption  is  possible,  but  only  at  an  electronic  energy  so  high 
(on  the  scale  of  me2)  that  the  nonrelativistic  approximation  is  not  valid.  It  is  therefore  necessary 
to  use  the  relativistic  relation,  £  =  (p2c2  +  m2c*)m,  to  prove  that  the  absorption  is  impossible 
at  any  electronic  energy.) 

4.  The  first  Brillouin  zone  for  a  crystal  with  an  fee  Bravais  lattice  extends  furthest  from  the 
origin  (D  at  the  point  W  where  the  square  face  and  two  hexagonal  faces  meet  (Figure  1 5.4). 
Show  that  the  free  electron  Fermi  sphere  for  valence  3  extends  beyond  that  point  (specifically, 
kr/ TW  =  (1296/1257t2)1/6  =  1.008),  so  that  the  first  Brillouin  zone  is  entirely  filled. 


5.  In  the  alkali  metals  the  free  electron  Fermi  sphere  is  entirely  contained  in  the  first  Brillouin 
zone,  and  the  weak  pseudopotential  produces  only  slight  deformations  of  this  sphere,  without 
altering  the  basic  topology  (in  contrast  to  the  noble  metals).  Various  features  of  these  deformations 
can  be  investigated  using  the  methods  of  Chapter  9. 

(a)  In  a  weak  periodic  potential  in  the  vicinity  of  a  single  Bragg  plane,  the  two-plane-wave 
approximation  (pages  156-159)  may  be  used.  Let  the  wave  vector  k  have  polar  angles  6  and  </), 
with  respect  to  the  reciprocal  lattice  vector  K  associated  with  the  Bragg  plane.  If  £  <  (fi2/2m)(K/2)2 
and  l/K  is  sufficiently  small,  show  that  to  order  l/*2  the  surface  of  energy  £  is  given  by 


where 


m  4>)  =  +  m), 

(15.10) 

1  '  ( hK )2  -  2hK  cos  e  J2mz 

(15.11) 

(b)  Assuming  that  the  result  of  the  single  Bragg  plane  approximation  is  valid  through  the 
zone,  show  that  the  shift  in  Fermi  energy  due  to  the  weak  periodic  potential  is  given  by  £f  —  fi”  =  y, 
where 

y  =  - 

(Hint:  Note  that  the  Fermi  energy  satisfies  n  =  $ (dk,'4n3)6(tir  -  EJ,  where  6  is  the  step 
function  (8(x)  =  1  for  x  >  0;  0(x)  =  0  for  x  <  0),  and  expand  the  8  function  in  y.) 

In  the  alkali  metals  the  free  electron  Fermi  sphere  comes  close  to  12  Bragg  planes,  but  since 
it  is  never  close  to  more  than  one  at  a  time,  the  shift  in  Fermi  energy  is  just  that  calculated 
above,  multiplied  by  12. 


i_N72fcA,i 

8  S?  ^  K )  1 


+  2  kF/K 
-  2kF/K 


(15.12) 
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The  general  semiclassical  theory  of  conduction  in  Chapter  13  (as  well  as  the  arguments 
in  Chapters  1  and  2)  described  electronic  collisions  as  random,  uncorrelated  events 
that  could  be  treated  in  a  relaxation-time  approximation.  This  approximation  as- 
sumes  that  the  form  of  the  nonequilibrium  electronic  distribution  function  has  no 
effect  either  on  the  rate  at  which  a  given  electron  experiences  collisions  or  on  the 
distribution  of  electrons  emerging  from  collisions.1 

No  attempt  was  made  to  justify  these  assumptions.  They  are  only  used  because 
they  give  the  simplest  representation  of  the  fact  that  collisions  do  take  place  and  are 
ultimately  responsible  for  thermal  equilibrium.  Indeed,  in  detail  the  assumptions  are 
certainly  wrong.  The  rate  at  which  an  electron  collides  depends  critically  on  the 
distribution  of  the  other  electrons,  even  in  an  independent  electron  approximation, 
because  the  Pauli  exclusion  principle  allows  an  electron  to  be  scattered  only  into 
empty  electronic  levels.  Furthermore,  the  distribution  of  electrons  that  emerge  from 
collisions  does  depend  on  the  electronic  distribution  function,  not  only  because  the 
exclusion  principle  limits  the  available  final  levels,  but  also  because  the  net  output 
from  collisions  depends  on  the  form  of  the  input,  which  is  determined  by  the  dis¬ 
tribution  function. 

Thus  considerable  restraint  must  be  exercised  in  drawing  conclusions  from  cal¬ 
culations  based  on  the  relaxation-time  approximation.  In  general,  such  results  can 
be  used  with  confidence  only  when  details  of  the  collision  process  are  clearly  of  little 
consequence.  For  example,  the  high-frequency  (cut  »  I)  conductivity  and  the  high- 
field  (ojcz  »  1)  Hall  coefficient  are  left  unchanged  by  improvements  in  the  relaxation¬ 
time  approximation,  since  they  describe  limiting  cases  in  which  the  number  of 
collisions  per  cycle,  or  per  period  of  revolution  in  the  magnetic  field,  is  vanishingly 
small.  2 

Results  whose  validity  transcends  the  relaxation-time  approximation  usually  in¬ 
volve  quantities  that  are  independent  of  x.  The  high-field  Hall  coefficient,  for  example, 
is  R„  =  -  l/»ec.  However,  one  should  not  assume  uncritically  that  all  expressions  in 
which  x  fails  to  appear  are  valid  beyond  the  relaxation-time  approximation.  A 
noteworthy  counterexample  is  the  Wiedemann-Franz  law,  which  predicts  the  uni¬ 
versal  ratio  (n2/3){kB/e)2T  for  the  ratio  of  thermal  to  electrical  conductivities  of  a 
metal,  independent2  of  the  functional  form  ofx„(k).  Nevertheless,  when  the  relaxation¬ 
time  approximation  is  not  made,  we  shall  see  that  that  the  law  holds  only  when  the 
energy  of  each  electron  is  conserved  in  each  collision.  - 

When  specific  features  of  the  collision  process  are  of  importance,  the  relaxation¬ 
time  approximation  can  still  be  highly  informative,  provided  that  the  features  are 
gross  properties  rather  than  fine  details.  Tlius  in  describing  semiconductors  one  often 
assigns  different  relaxation  times  to  electrons  and  holes;  i.e,  one  uses  a  x  that  depends 
on  the  band  index  but  not  on  the  wave  vector.  If  there  is  reason  to  believe  that 
scattering  processes  are  much  more  common  in  one  band  than  the  other,  then  such 
a  simplification  may  be  quite  valuable  in  working  out  the  general  implications  of 
such  a  disparity. 


1  See  pages  244  and  245. 

2  See  page  255. 
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However,  results  that  are  sensitive  to  detailed  features  of  the  functional  form  of 
x„(k)  are  to  be  viewed  with  suspicion.  If,  for  example,  one  attempts  to  deduce  r(k)  for 
a  given  band  from  a  set  of  data  and  a  theory  based  on  the  relaxation-time  approxima¬ 
tion,  there  is  no  reason  to  expect  that  different  kinds  of  experimental  determinations 
of  x(k)  will  not  yield  entirely  different  functions.  The  relaxation-time  approximation 
overlooks  the  fact  that  the  nature  of  the  scattering  does  depend  on  the  nonequilibrium 
electronic  distribution  function,  which  in  general  will  differ  from  one  experimental 
situation  to  another. 

In  this  chapter  we  shall  describe,  and  contrast  to  the  relaxation-time  approxi¬ 
mation,  the  more  accurate  description  of  collisions  used  (except  in  the  cases  noted 
above)  for  anything  more  accurate  than  a  rough,  qualitative  description  of  conduction. 

SOURCES  OF  ELECTRONIC  SCATTERING 

How  electronic  collisions  are  to  be  described  depends  on  the  particular  collision 
mechanisms  of  importance.  We  have  already  noted3  the  gross  inadequacy  of  Drude’s 
idea  of  collisions  with  individual  ions.  According  to  the  Bloch  theory,  an  electron  in 
a  perfectly  periodic  array  of  ions  experiences  no  collisions  at  all.  Within  the  indepen¬ 
dent  electron  approximation  collisions  can  only  arise  from  deviations  from  perfect 
periodicity.  These  fall  into  two  broad  categories: 

1.  Impurities  and  Crystal  Defects  Point  defects  (Chapter  30) — e.g.,  missing  ions  or 
an  occasional  ion  in  the  wrong  place — behave  in  much  the  same  way  as  impurities, 
providing  a  localized  scattering  center.  These  are  probably  the  simplest  kinds  of 
collision  mechanisms  to  describe.  In  addition,  however,  there  may  also  be  more 
extended  defects  in  which  the  lattice  periodicity  is  violated  along  a  line,  or  even 
over  an  entire  plane.4 

2.  Intrinsic  Deviations  from  Periodicity  in  a  Perfect  Crystal,  due  to  Thermal  Vibra¬ 
tions  of  the  Ions  Even  in  the  absence  of  impurities  or  defects  the  ions  do  not 
remain  rigidly  fixed  at  the  points  of  an  ideal  periodic  array,  for  they  possess  some 
kinetic  energy  which  increases  with  increasing  temperature  (Chapters  21-26). 
Below  the  melting  temperature  this  energy  is  rarely  enough  to  enable  the  ions  to 
stray  very  far  from  their  ideal  equilibrium  positions,  and  the  primary  effect  of 
the  thermal  energy  is  to  cause  the  ions  to  undergo  small  vibrations  about  these 
positions.  The  deviation  of  the  ionic  network  from  perfect  periodicity  due  to 
these  vibrations  is  the  most  important  source  of  temperature  dependence  in  the 
DC  resistivity  (Chapter  26),  and  is  usually  the  dominant  scattering  mechanism 
at  room  temperature.  As  the  temperature  declines,  the  amplitude  of  the  ionic 
vibrations  drops,  and  eventually  impurity  and  defect  scattering  dominates. 

Aside  from  the  scattering  mechanisms  due  to  deviations  from  perfect  periodicity, 
another  source  of  scattering,  neglected  in  the  independent  electron  approximation. 


3  See  page  141. 

4  In  this  category  we  might  include  surface  scattering,  which  becomes  important,  for  example,  in 
crystals  whose  dimensions  are  comparable  to  a  mean  free  path. 
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arises  from  the  interactions  among  the  electrons.  Electron-electron  scattering5  plays 
a  relatively  minor  role  in  the  theory  of  conduction  in  solids,  for  reasons  to  be  described 
in  Chapter  17.  At  high  temperatures  it  is  much  less  important  than  the  scattering 
by  thermal  vibrations  of  the  ions,  and  at  low  temperatures,  except  in  crystals  of 
exceptional  purity  and  perfection,  it  is  dominated  by  impurity  or  defect  scattering. 


SCATTERING  PROBABILITY  AND  RELAXATION  TIME 

Instead  of  making  the  relaxation-time  approximation,  more  realistic  descriptions  of 
collisions  assume  there  is  a  probability  per  unit  time  (to  be  determined  by  suitable 
microscopic  calculations)  that  an  electron  in  band  n  with  wave  vector  k  will,  as  a 
result  of  a  collision,  be  scattered  into  band  ri  with  wave  vector  k'.  For  simplicity  we 
limit  our  discussion  to  a  single  band,6  assuming  that  scattering  occurs  only  within 
this  band  ( n '  =  n).  We  also  assume  that  the  electron’s  spin  is  conserved  in  the 
scattering.7  Finally,  we  assume  that  the  collisions  can  be  well  localized  in  space  and 
time,  so  that  the  collisions  occurring  at  r,  t  are  determined  by  properties  of  the  solid 
in  the  immediate  vicinity  of  r,  t.  Since  all  quantities  affecting  collisions  at  r,  f  will 
then  be  evaluated  at  that  point,  to  keep  the  notation  simple  we  suppress  explicit 
reference  to  these  variables. 

The  scattering  probability  is  written  in  terms  of  a  quantity  WKK,  defined  as  follows: 
The  probability  in  an  infinitesimal  time  interval  dt  that  an  electron  with  wave  vector 
k  is  scattered  into  any  one  of  the  group  of  levels  (with  the  same  spin)  contained  in 
the  infinitesimal  k-space  volume  element  dk'  about  k',  assuming  that  these  levels 
are  all  unoccupied  (and  therefore  not  forbidden  by  the  exclusion  principle)  is 


WkX  dt  dk’ 
(2  n)3 


(16.1) 


The  particular  form  taken  by  WkX  depends  on  the  particular  scattering  mechanism 
being  described.  In  general,  IT  will  have  a  quite  complex  structure,  and  may  depend 
on  the  distribution  function  g.  We  shall  consider  a  particularly  simple  form  WTnay 
take  below  (Eq.  (16.14) ),  but  for  the  moment  the  points  we  wish  to  make  depend  only 
on  the  existence  of  W  and  not  on  its  detailed  structure. 

Given  the  quantity  Wkx,  and  given  the  electronic  distribution  function  g,  we  can 
explicitly  construct  the  probability  per  unit  time  that  an  electron  with  wave  vector 
k  will  suffer  some  collision.  This  quantity  is,  by  definition,  just  l/x(k),  (page  244)  and 
its  structure  reveals  some  of  the  limitations  of  the  relaxation-time  approximation. 
Since  Wxx  dk'/{2it)3  is  the  probability  per  unit  time  that  an  electron  with  wave  vector 
k  will  be  scattered  into  the  group  of  levels  (with  the  same  spin)  contained  in  dk' 
about  k',  given  that  these  levels  are  all  unoccupied,  the  actual  rate  of  transition  must 


5  In  the  kinetic  theory  of  gases,  the  analogue  of  electron-electron  scattering  is  the  sole  source  of 
scattering,  aside  from  collisions  with  the  walls  of  the  container.  In  this  sense  the  electron  gas  in  a  metal 
is  quite  unlike  an  ordinary  gas. 

6  It  is  straightforward  to  generalize  the  discussion  to  cover  the  possibility  of  interband  scattering. 
All  of  the  points  we  wish  to  make,  however,  are  fully  illustrated  by  the  one-band  case. 

7  This  is  not  the  case  when  the  scattering  is  due  to  magnetic  impurities,  and  the  lack  of  spin  con¬ 
servation  can  lead  to  some  striking  effects  (pages  687-688). 
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be  reduced  by  the  fraction  of  these  levels  that  actually  are  unoccupied  (since  transitions 
into  occupied  levels  are  forbidden  by  the  exclusion  principle).  This  fraction  is  just8 
1  —  g(k').  The  total  probability  per  unit  time  for  a  collision  is  given  by  summing 
over  all  final  wave  vectors  k': 

o«> 

It  is  evident  from  (16.2)  that,  in  contrast  to  the  relaxation  time  approximation, 
r(k)  is  not  a  specified  function  of  k,  but  depends  on  the  particular  form  assumed  by 
the  nonequilibrium  distribution  function  g. 
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It  is  convenient  to  represent  the  information  in  (16.2)  in  a  slightly  different  way.  We 
define  a  quantity  (dg(k)/dt)out  so  that  the  number  of  electrons  per  unit  volume  with 
wave  vectors  in  the  infinitesimal  volume  element  dk  about  k  that  suffer  a  collision 
in  the  infinitesimal  time  interval  dt  is 


(16.3) 


Since  dk  is  infinitesimal,  the  effect  of  any  collision  on  an  electron  in  the  volume 
element  is  to  remove  it  from  that  volume  element.  Thus  (16.3)  can  also  be  viewed  as 
the  number  of  electrons  scattered  out  of  the  volume  element  dk  about  k  in  the  time 
interval  dt. 

To  evaluate  (dg(k)/dt)ou„  we  simply  note  that  because  dt/ t(k)  is  the  probability 
that  any  electron  in  the  neighborhood  of  k  is  scattered  in  the  time  interval  dt,  the 
total  number  of  electrons  per  unit  volume  in  dk  about  k  that  suffer  a  collision  is  just 
dt/i(k)  times  the  number  of  electrons  per  unit  volume  in  dk  about  k,  gt(k)  dk/(27i)3. 
Comparing  this  with  (16.3)  we  find  that 


(dg(k)\  =_g( k) 

V  dt  /out  r(k) 

f  dk' 

=  -S(k)  1(^)5  -  0(k')].  (16.4) 

This  is  not  the  only  way  in  which  the  distribution  function  is  affected  by  scattering: 
Electrons  not  only  scatter  out  of  the  level  k,  but  are  also  scattered  into  it  from  other 
levels.  We  describe  such  processes  in  terms  of  a  quantity  (dg(k)/df)ic ,  defined  so  that 


(16.5) 


8  The  contribution  to  the  electron  density  from  electrons  (with  specified  spin)  in  the  volume  element 
dk  about  k  is  (see  page  244)  <?(k)  dk/(2n)3.  Since  the  maximum  possible  contribution  to  the  density  from 
electrons  (with  the  specified  spin)  in  this  volume  element  occurs  when  all  levels  are  occupied,  and  is  equal 
to  dk/(2n)3,  <?(k)  can  also  be  interpreted  as  the  fraction  of  levels  in  the  volume  element  dk  about  k  that 
are  occupied.  Hence  1  —  g(k)  is  the  fraction  that  are  unoccupied. 
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is  the  number  of  electrons  per  unit  volume  that  have  arrived  in  the  volume  element 
d k  about  k,  as  the  result  of  a  collision  in  the  infinitesimal  time  interval  dt. 

To  evaluate  {dg{k)/dt\n,  consider  the  contribution  from  those  electrons  that,  just 
prior  to  the  collision,  were  in  the  volume  element  dk'  about  k'.  The  total  number 
of  such  electrons  (with  the  specified  spin)  is  g(k')dk'/(2n)3.  Of  these,  a  fraction 
Wy.ydt  dk/(2n)3  would  be  scattered  into  dk  about  k,  if  all  the  levels  in  the  volume 
element  dk  were  empty;  since,  however,  only  a  fraction  1  —  g(k)  are  empty,  the  first 
fraction  must  be  reduced  by  this  factor.  Therefore,  the  total  number  of  electrons  per 
unit  volume  arriving  in  the  volume  element  dk  about  k  from  the  volume  element 
dk'  about  k'  as  the  result  of  a  collision  in  the  time  interval  dt  is 


r  dk'  1 

dk 

rwj 

3  dt 

'  (2?r)3 

[1  -  0(k)]. 


(16.6) 


Summing  this  over  all  k',  and  comparing  the  result  with  (16.5),  we  find  that 


KM  k'). 


(16.7) 


which  has  the  same  structure  as  (16.4)  except  for  the  interchange  of  k  and  k\ 

It  is  instructive  to  compare  these  expressions  with  the  corresponding  quanti¬ 
ties  in  the  relaxation-time  approximation.  The  relaxation-time  approximation  for 
(dg(k)/dt)om  differs  from  (16.4)  only  in  that  the  collision  rate,  l/x(k),  is  a  definite 
specified  function  of  k  that  does  not  (in  contrast  to  (16.2))  depend  on  the  distribution 
function  g.  The  disparity  between  the  expression  (16.7)  for  ( dg(k)/dt)in  and  the  corre¬ 
sponding  quantity  in  the  relaxation-time  approximation  is  more  pronounced,  for 
the  relaxation-time  approximation  asserts  that  the  distribution  of  electrons  emerging 
from  collisions  in  an  interval  dt  is  simply  dt/x(k)  times  the  local  equilibrium  distri¬ 
bution  function  g°(k)  (see  Eq.  (13.3)).  These  results  are  summarized  in  Table  16.1. 


Table  16.1 

A  COMPARISON  OF  THE  GENERAL  TREATMENT  OF  COLLISIONS  WITH 
THE  SIMPLIFICATIONS  OF  THE  RELAXATION-TIME  APPROXIMATION 


RELAXATION-TIME 

APPROXIMATION 

GENERAL 

/jgWY* 

\  dt 

ff(k) 

TOO 

- 

'  dk' 

prsMi  -gWttgW 

(271) 

fdg(k)Y 

V  dt 

ff°(k) 

t(k) 

'  dk' 

^  Wk.i0(k')[l  -  <?(k)] 

Comments 

r(k)  is  a  specified  function 
of  k  and  does  not  depend 
on  g(k) ;  g°(k)  is  the  local 
equilibrium  distribution 
function. 

is  a  function  of  k  and  k',  which 
in  general  may  also  depend  on  g(k), 
or  even  on  a  second  distribution  func¬ 
tion  describing  the  local  configuration 
of  scatterers. 

It  is  convenient  to  define  ( dg/dt)coll  as  the  total  rate  at  which  the  distribution  function 
is  changing  due  to  collisions:  (d@(k)/dt)coll  dtdk/(2n)3  is  the  change  in  the  number 
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of  electrons  per  unit  volume  with  wave  vectors  in  the  volume  element  dk  about  k 
in  the  time  interval  dt,  due  to  all  collisions.  Since  electrons  can  be  scattered  either 
into  or  out  of  dk  by  collisions,  (dg/dt)Qol]  is  simply  the  sum  of  ( dg/dt)-m  and  (dg/dt)om: 


'  dk’ 

—3  {VK.kg(k)[l  -  g(k’)]  -  W£.*0(k')[l  -  y(k)]}. 


(16.8) 


In  the  relaxation-time  approximation  (see  Table  16.1)  this  simplifies  to 

=  _  [g(k)  -  g°(k)] 
dt  )co„  r(k) 

DETERMINATION  OF  THE  DISTRIBUTION  FUNCTION: 

THE  BOLTZMANN  EQUATION 

When  the  relaxation-time  approximation  is  abandoned,  one  cannot  construct  an 
explicit  representation  of  the  nonequilibrium  distribution  function  y  in  terms  of 
the  solutions  to  the  semiclassical  equations  of  motion,  as  we  did  in  Chapter  13  by 
surveying  all  past  times.  One  can,  however,  answer  the  more  modest  question  of 
how  g  is  to  be  constructed  at  time  t  from  its  value  an  infinitesimal  time  dt  earlier. 

To  do  this  we  first  ignore  the  possibility  of  collisions  taking  place  between  t  —  dt 
and  t,  correcting  for  this  omission  later.  If  no  collisions  occurred,  then  the  r  and  k 
coordinates  of  every  electron  would  evolve  according  to  the  semiclassical  equations 
of  motion  (12.6): 

r  =  v(k),  fik=— c|E  +  ^xllj  =  F  (r,  k).  (16.10) 

Since  dt  is  infinitesimal,  we  can  find  the  explicit  solution  to  these  equations  to  linear 
order  in  dt:  an  electron  at  r,  k  at  time  t  must  have  been  at  r  —  v(k )dt,  k  —  F  dt/h,  at 
time  t  —  dt.  In  the  absence  of  collisions  this  is  the  only  point  electrons  at  r,  k  can  have 
come  from,  and  every  electron  at  this  point  will  reach  the  point  r,  k.  Consequently,9 

g( r,  k,  t)  =  g(r  —  v(k)  dt,  k  —  F  dt/h,  t  —  dt).  (16.11) 

To  take  collisions  into  account  we  must  add  two  correction  terms  to  (16.1 1).  The 
right-hand  side  is  wrong  because  it  assumes  that  all  electrons  get  from  r  —  v  dt, 
k  —  F  dt/h  to  r,  k  in  the  time  dt,  ignoring  the  fact  that  some  are  deflected  by  collisions. 
It  is  also  wrong  because  it  fails  to  count  those  electrons  found  at  r,  k  at  time  t  not 
as  a  result  of  their  unimpeded  semiclassical  motion  since  time  t  —  dt,  but  as  the  result 


(relaxation-time 

approximation) 


(16.9) 


9  In  writing  (16.1 1)  we  must  appeal  to  Liouville’s  theorem  (Appendix  H)  that  volumes  in  phase  space 
are  preserved  by  the  semiclassical  equations  of  motion,  for  the  argument  only  yields  the  result: 

g( r,  k,  f)  dr(t)  dk(t)  =  g(r  —  v(k)f,  k  —  F  dt/h,  t  —  dt)  dr(t  —  dt)  dk(t  —  dt), 

which  expresses  the  equality  of  the  numbers  of  electrons  in  dr(r)  d k(r)  and  dr(t  —  dt)  <lk(t  —  dt).  Liouville’s 
theorem  is  required  to  permit  cancellation  of  the  phase-space  volume  elements  from  boih  sides. 
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of  a  collision  between  f  -  dt  and  t .  Adding  these  corrections,  we  find  to  leading  order 
in  dt: 


<?(r,  k,  r) 


g{r  -  v(k )dt,  k  -  F  dt/h,  t  -  dt)  (collisionless  evolution) 
fdg(r,  k,  r)\  (correction :  some  fail  to  get 

+  I  —  )  dt  there  because  of  collisions) 


/%(t,M\  dt 

dt  )  out 

.  (WhM)  dL 

V  st 


(correction :  some  get  there 

only  because  of  collisions)  (16.12) 


If  we  expand  the  left  side  to  linear  order  in  dt,  then  in  the  limit  as  dt  ->  0,  (16.12) 


reduces  to 


This  is  the  celebrated  Boltzmann  equation.  The  terms  on  the  left  side  are  often 
referred  to  as  the  drift  terms,  and  the  term  on  the  right  side  as  the  collision  term. 
When  the  form  (16.8)  is  used  for  the  collision  term,  the  Boltzmann  equation  in  general 
becomes  a  nonlinear  integrodifferential  equation.  This  equation  lies  at  the  heart  of 
the  theory  of  transport  in  solids.  Many  subtle  and  ingenious  techniques  have  been 
developed  for  extracting  information  about  the  distribution  function,  and  hence  the 
various  conductivities.10  We  shall  not  pursue  the  subject  further  here,  but  shall  refer 
to  the  Boltzmann  equation  only  to  the  extent  that  it  illuminates  the  limitations  o 
the  relaxation-time  approximation. 

If  we  replace  the  collision  term  in  the  Boltzmann  equation  by  the  relaxation-time 
approximation  (16.9),  the  equation  simplifies  to  a  linear  partial  differential  equation. 
It  can  be  shown  that  the  distribution  function  (13.17)  that  we  constructed  from  the 
relaxation-time  approximation  is  the  solution  to  this  equation  (as  must  be  the  case, 
since  identical  assumptions  underlie  both  derivations).  We  emphasize  this  equiva¬ 
lence  because  it  is  widespread  practice  to  derive  results  such  as  those  we  found  in 
Chapter  13  not  directly  from  the  explicit  distribution  function  (13.17)  given  by  the 
relaxation-time  approximation,  but  by  the  apparently  quite  different  route  of  solving 
the  Boltzmann  equation  (16.13)  with  the  collision  term  given  by  the  relaxation-time 
approximation  (16.9).  The  equivalence  ofthe  two  approaches  is  illustrated  in  Problems 
2  and  3,  where  some  typical  results  of  Chapter  13  are  rederived  from  the  Boltzmann 
equation  in  the  relaxation-time  approximation. 


IMPURITY  SCATTERING 

We  wish  to  compare  some  of  the  predictions  of  the  relaxation-time  approximation 
with  those  implied  by  the  more  accurate  collision  term  (16.8).  When  we  require  a- 
specific  form  for  the  collision  probability  Wky,  we  shall  specialize  to  the  case  of 


10  A  very  thorough  survey  is  given  in  J.  M.  Ziman,  Electrons  and  Phonons,  Oxford,  1960.  There 
is  also  a  remarkable  series  of  review  papers  by  I.  M.  Lifshitz  and  M.  I.  Kaganov,  Sov.  Phys  Usp. 
2  831  (1960),  5,878  (19631,  8,805  (1966).  [l/sp.  Fiz.  Nauk  69,419  (1959),  78, 411  (1962),  87,389  (1965).] 
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greatest  analytic  simplicity:  elastic  scattering  by  fixed  substitutional  impurities,  lo¬ 
cated  at  random  lattice  sites  throughout  the  crystal.  This  is  not  an  artificial  case, 
since  scattering  by  the  thermal  vibrations  of  the  ions  (Chapter  26)  and  electron- 
electron  scattering  (Chapter  17)  become  increasingly  weak  as  the  temperature  drops, 
whereas  neither  the  concentration  of  impurities  nor  the  electron-impurity  interaction 
shows  any  striking  temperature  dependence.  Thus  at  sufficiently  low  temperatures 
impurity  scattering  will  be  the  dominant  source  of  collisions  in  any  real  specimen. 
This  scattering  will  be  elastic  provided  that  the  energy  gap  between  the  impurity 
ground  state  and  lowest  excited  state  (typically  of  the  order  of  electron  volts)  is  large 
compared  with  kBT.  This  will  insure  (a)  that  there  are  very  few  excited  impurity 
ions  to  give  up  energy  to  electrons  in  collisions  and  (b)  that  there  are  very  few  empty 
electronic  levels  that  lie  low  enough  in  energy  to  receive  an  electron  after  it  has  lost 
enough  energy  to  excite  the  impurity  ion  out  of  its  ground  state. 

If  the  impurities  are  sufficiently  dilute11  and  the  potential  U(r)  describing  the 
interaction  between  an  electron  and  a  single  impurity  at  the  origin  is  sufficiently 
weak,  then  it  can  be  shown  that 

wv  =  y  fii<5(£(k)  -  8(k'))|  <  k|  t/|k'  >  I2,  (16.14) 

where  is  the  number  of  impurities  per  unit  volume, 

<  k|  U\k'  >  =  Jr/r  <A*k.(r)tmk(r),  (16-15) 

and  the  Bloch  functions  are  taken  to  be  normalized  so  that 

f  *  |</U(r)|2  =  i'ce„-  (16.16) 

Equation  (16.14)  can  be  derived  by  applying  the  “Golden  Rule”  of  first-order 
time-dependent  perturbation  theory12  to  the  scattering  of  a  Bloch  electron  by  each 
of  the  impurities.  It  is  considerably  harder  to  construct  a  more  fundamental  derivation 
by  starting  with  the  basic  Hamiltonian  for  ail  the  electrons  and  impurities,  and 
deriving  the  entire  Boltzmann  equation  with  a  collision  term  given  by  (16.8)  and 
(16.14).13 

We  shall  not  pursue  the  derivation  of  (16.14)  here,  since  we  shall  only  exploit  a 
few  very  general  properties  of  the  result: 

1.  Because  of  the  delta  function  in  (16.14),  W^-  =  0  unless  S(k)  =  8(k');  the  scat¬ 
tering  is  explicitly  elastic. 

2.  WkM  is  independent  of  the  electronic  distribution  Function  g.  This  is  a  con¬ 
sequence  of  the  independent  electron  approximation :  The  W'ay  in  which  an  electron 


11  They  must  be  dilute  enough  that  one  can  treat  the  electrons  as  interacting  with  one  impurity  at 
a  time. 

12  See,  for  example,  L.  D.  Landau  and  E.  M.  Lifshitz,  Quantum  Mechamcs,  Addison- Wesley,  Reading, 
Mass.,  1965,  Eq.  (43.1). 

13  One  of  the  earliest  full  analyses  was  given  by  J.  M.  Luttinger  and  W.  Kohn,  Phys.  Rev.  108,  590 
(1957)  and  109, 1892  (1958). 
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interacts  with  an  impurity  is,  aside  from  the  restrictions  imposed  by  the  exclusion 
principle,  independent  of  the  other  electrons.  This  is  the  major  simplifying  feature 
of  impurity  scattering.  In  electron-electron  scattering,  for  example,  lT'k  l  depends  on 
the  distribution  function  g,  since  over  and  beyond  the  simple  restrictions  imposed 
by  the  exclusion  principle,  the  probability  of  an  electron  scattering  depends  on  what 
other  electrons  are  available  for  it  to  interact  with.  The  description  of  the  scattering 
by  the  thermal  vibrations  of  the  ions  is  also  more  complicated  since  W  will  depend 
on  the  properties  of  the  system  of  ions,  which  may  be  quite  complex. 

3.  W  has  the  symmetry 

WKk.  =  (16.17) 

which  follows  from  the  fact  that  U  is  Hermitian  (<k|  L/|k'>  =  <k'|  l/|k>*).  It  can  be 
shown  that  this  symmetry  does  not  depend  on  the  impurity-electron  interaction  being 
weak,  but  follows  generally  provided  only  that  the  crystal  and  impurity  potentials 
are  real  and  invariant  under  spatial  inversions.  The  symmetry  (16.17)  is  usually  called 
“detailed  balancing”.  For  more  general  scattering  mechanisms  there  are  related,  but 
more  complex  symmetries,  which  are  of  great  importance  in  studies  of  the  approach 
to  thermodynamic  equilibrium. 

The  symmetry  (16.17)  simplifies  the  collision  term  (16.8)  to 

Note  that  the  quadratic  terms  in  g  appearing  in  (16.8)  as  a  consequence  of  the  ex¬ 
clusion  principle  cancel  identically  as  a  result  of  the  symmetry  (16.17). 

In  the  remainder  of  this  chapter  we  describe  some  typical  problems  whose  proper 
formulation  requires  a  more  accurate  description  of  collisions  than  the  relaxation¬ 
time  approximation  can  provide. 


THE  WIEDEMANN-FRANZ  LAW 

The  derivation  of  the  Wiedemann-Franz  law  in  Chapter  13  appeared  to  be  quite 
general.  When  the  problem  is  reexamined  without  using  the  relaxation-time  approxi¬ 
mation,  however,  it  can  be  shown  that  the  law  holds  only  when  the  energy  of  each 
electron  is  conserved  in  each  collision.  The  corresponding  mathematical  requirement 
is  that  for  any  function  g(k)  the  scattering  probability  WKik,  be  such  that 

J  dk'  Wtk.8(k')g(k’)  =  £(k)  | dk'  W^gik').  (16.19) 

This  is  clearly  satisfied  when  W  has  the  energy-conserving  form,  (16.14),  but  will  not 
hold  if  It).  ,,,  can  be  nonzero  for  values  of  k  and  k'  with  £(k)  #  £(k'). 

It  would  take  us  too  far  afield  to  demonstrate  analytically  that  the  condition  of 
elastic  scattering  (16.19)  is  enough  to  insure  the  Wiedemann-Franz  law.  However, 
the  physical  reason  for  this  is  not  hard  to  understand.  Since  the  charge  of  each 
electron  is  permanently  —  e,  the  only  way  collisions  can  degrade  an  electric  current 
is  by  changing  the  velocity  of  each  electron.  However,  in  a  thermal  current  (Eq.  (1 3.42) ) 
the  charge  is  replaced  by  (£  —  j u)/T.  Therefore  if  energy  is  conserved  in  each  collision 
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(as  charge  certainly  is),  then  thermal  currents  will  be  degraded  in  precisely  the  same 
manner  and  to  the  same  extent  as  electric  currents.  If,  however,  collisions  do  not 
conserve  the  energy  8  of  each  electron,  then  a  second  mechanism  becomes  available 
for  the  degradation  of  a  thermal  current  that  has  no  electrical  analogue:  Collisions 
can  alter  the  electron’s  energy  8  as  well  as  its  velocity.  Since  such  inelastic  colli¬ 
sions  will  have  a  substantially  different  effect  on  thermal  and  electric  currents, 
there  is  no  longer  any  reason  to  expect  a  simple  relation  to  hold  between  electrical 
and  thermal  conductivities.14 

Evidently  the  Wiedemann-Franz  law  will  hold  to  a  good  approximation,  if  energy 
is  conserved  to  a  good  approximation.  The  crucial  requirement  is  that  the  change 
in  energy  of  each  electron  in  a  collision  should  be  small  compared  with  kBT.  It  turns 
out  that  the  scattering  by  thermal  vibrations  of  the  ions  can  satisfy  this  condition  at 
high  temperatures.  Since  such  scattering  is  the  dominant  high-temperature  source  of 
collisions,  the  Wiedemann-Franz  law  is  generally  well  obeyed  at  both  high  and  low15 
temperatures.  However,  in  the  intermediate  temperature  range  (roughly  ten  to  a  few 
hundred  degrees  K),  where  inelastic  collisions  are  both  prevalent  and  capable  of 
producing  electronic  energy  losses  of  order  kBT,  one  expects  and  observes  failures 
of  the  Wiedemann-Franz  law. 
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Suppose  there  are  two  physically  distinguishable  sources  of  scattering  (for  example, 
scattering  by  impurities  and  scattering  by  other  electrons).  If  the  presence  of  one 
mechanism  does  not  alter  the  way  in  which  the  other  mechanism  functions,  then  the 
total  collision  rate  W  will  be  given  by  the  sums  of  the  collision  rates  due  to  the  separate 
mechanisms: 

W  =  W{1)  +  W<2).  (16.20) 

In  the  relaxation-time  approximation  this  immediately  implies  that 


1  _  1  J_ 

X  ~  T(1)  +  I121 


(16.21) 


If,  in  addition,  we  assume  a  k-independent  relaxation  time  for  each  mechanism,  then, 
since  the  resistivity  is  proportional  to  1/t,  we  will  have 


m  1 


t(1)  ne2  x' 


(2)  P(  "+■  P 


i(2> 


(16.22) 


This  asserts  that  the  resistivity  in  the  presence  of  several  distinct  scattering  mecha¬ 
nisms  is  simply  the  sum  of  the  resistivities  one  would  have  if  each  alone  were  present. 


14  One  sometimes  encounters  the  assertion  that  the  Wiedemann-Franz  law  fails  because  the  relax¬ 
ation  time  for  thermal  currents  is  different  from  the  relaxation  time  for  electric  currents.  This  is,  at  best, 
a  misleading  oversimplification.  The  Wiedemann-Franz  law  fails,  if  inelastic  scattering  is  present,  because 
there  are  scattering  processes  that  can  degrade  a  thermal  current  without  degrading  an  electric  current. 
The  failure  is  due  not  to  the  comparative  rates  at  which  electrons  experience  collisions,  but  to  the  com¬ 
parative  effectiveness  of  each  single  collision  in  degrading  the  two  kinds  of  currents. 

15  At  low  temperatures,  as  we  have  pointed  out,  the  dominant  source  of  collisions  is  elastic  impurity 
scattering. 
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This  proposition  is  known  as  Matthiessen’s  rule.  At  first  glance  its  utility  might 
seem  questionable,  since  it  is  difficult  to  imagine  how  one  might  remove  a  source  of 
scattering,  keeping  all  other  things  constant.  However,  it  does  make  certain  general 
assertions  of  principle  that  are  easily  tested.  For  example,  elastic  impurity  scattering 
should  proceed  at  a  temperature-independent  rate  (since  neither  the  number  of  im¬ 
purities  nor  their  interaction  with  electrons  is  appreciably  affected  by  the  temperature), 
but  the  electron-electron  scattering  rate  should  go  as  T2  (in  the  simplest  theories:  see 
Chapter  17).  Thus  Matthiessen’s  rule  predicts  a  resistivity  of  the  form  p  =  A  +  BT 2 
with  temperature-independent  coefficients  A  and  B,  if  impurity  and  electron-electron 
scattering  are  the  dominant  mechanisms. 

It  is  not  hard  to  verify  that  Matthiessen’s  rule  breaks  down  even  in  the  relaxation¬ 
time  approximation,  if  r  depends  on  k.  For  the  conductivity  a  is  then  proportional 
to  some  average,  t,  of  the  relaxation  time  (see,  for  example,  Eq.  (13.25)).  Thus  the 
resistivity,  p,  is  proportional  to  1/t,  and  Matthiessen's  rule  requires  that 

1  ft  =  1/t*1*  +  1  /?*>.  (16.23) 

However,  Eq.  (16.21)  gives  only  relations  such  as 

(I/tj  =  (1/x*1*)  +  (1/t*2>),  (16.24) 

which  are  not  equivalent  to  (16.23)  unless  t,1>  and  t121  are  independent  of  k. 

A  more  realistic  picture  of  collisions  casts  even  graver  doubts  on  the  general 
validity  of  Matthiessen’s  rule,  for  the  assumption  that  the  scattering  rate  due  to  one 
mechanism  is  independent  of  the  presence  of  the  second  becomes  much  less  plausible 
as  soon  as  the  assumptions  of  the  relaxation-time  approximation  are  dropped.  The 
actual  rate  at  which  an  electron  experiences  collisions  depends  on  the  configuration 
of  the  other  electrons,  and  this  can  be  strongly  affected  by  the  presence  of  two  com¬ 
peting  scattering  mechanisms  unless,  by  some  good  fortune,  it  happens  that  the 
distribution  function  in  the  presence  of  each  separate  scattering  mechanism  is  the 
same. 

It  can,  however,  be  shown  without  making  the  relaxation-time  approximation, 
that  Matthiessen’s  rule  holds  as  an  inequality:16 

p  >  p(1)  +  p*2’.  (16.25) 

Quantitative  analytic  studies  of  the  extent  to  which  Matthiessen’s  rule  fails  are 
quite  complex.  The  rule  is  certainly  valuable  as  a  crude  guide  to  what  to  expect,  but 
one  must  always  bear  in  mind  the  possibility  of  gross  failures— a  possibility  that  is 
obscured  by  the  naive  relaxation-time  approximation. 

SCATTERING  IN  ISOTROPIC  MATERIALS 

It  is  sometimes  asserted  that  the  relaxation-time  approximation  can  be  justified  in 
isotropic  systems.  This  is  an  interesting  and  useful  observation,  but  one  must  be  aware 


16  See,  for  example,  J.  M.  Ziman,  Electrons  and  Phonons ,  Oxford,  1960,  p.  286.  and  also  Problem  4 


below. 
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of  its  limitations.  The  point  arises  in  the  description  of  elastic  impurity  scattering  in 
an  isotropic  metal.  The  two  crucial  conditions17  are: 

(a)  The  energy  £(k)  must  depend  only  on  the  magnitude  k  of  the  vector  k. 

(b)  The  probability  of  scattering  between  two  levels  k  and  k'  must  vanish  unless 
k  =  k'  (i.e.,  it  must  be  elastic),  and  must  depend  only  on  the  common  value  of 
their  energies  and  on  the  angle  between  k  and  k'. 

If  condition  (a)  holds,  then  in  the  relaxation-time  approximation  the  nonequilibrium 
distribution  function  in  the  presence  of  a  static,  spatially  uniform  electric  field  and 
temperature  gradient,  Eq.  (13.43),  has  the  general  form18 

0(k)  =  g°(  k)  +  a(e)  •  k,  (16.26) 

where  the  vector  function  a  depends  on  k  only  through  its  magnitude — i.e.,  only 
through  £(k) — and  g°( k)  is  the  local  equilibrium  distribution  function.  When  the 
scattering  is  elastic  impurity  scattering  and  conditions  (a)  and  ( b )  hold,  one  can  show 
that  if  the  solution  to  the  Boltzmann  equation  in  the  relax ation-time  approximation 
has  the  form  (16.26), 19  then  it  is  also  a  solution  to  the  full  Boltzmann  equation. 

To  demonstrate  this  it  suffices  to  show  that  the  more  correct  Eq.  (16.18)  for 
(i dg/dt)coK  reduces  to  the  form  (16.9)  assumed  in  the  relaxation-time  approximation, 
whenever  the  distribution  function  g  has  the  form  (16.26).  We  must  therefore  show 
that  it  is  possible  to  find  a  function  r(k)  that  does  not  depend  on  the  distribution 
function  g,  such  that  whenever  g  has  the  form  (16.26)  and  the  scattering  is  elastic 
isotropic  impurity  scattering,  then 

{(S5  ^-k  ^(k)  “  0(k'tf  =  T('k)  [0(k)  _  »°(k)].  (16.27) 

If  we  substitute  the  distribution  function  (16.26)  into  (16.27)  and  note  that  for 
elastic  scattering  Wk  l  vanishes  unless  £(k)  =  S(k'),  then  the  vector  a(£')  can  be  replaced 
by  a(£)  and  removed  from  the  integral,  reducing  the  condition  (16.27)  to20 

a«js> 


17  A  more  detailed  analysis  reveals  that  these  requirements  need  hold  only  for  levels  within  0(/cBT) 
of  the  Fermi  surface.  This  is  because  the  final  form  of  the  distribution  function  differs  from  the  local 
equilibrium  form  only  within  this  energy  range;  see,  for  example  Eq.  (13.43).  As  a  result,  the  analysis  that 
follows  may  apply  not  only  to  the  ideal  free  electron  gas,  but  also  to  the  alkali  metals,  whose  Fermi  surfaces 
are  remarkably  spherical,  provided  that  the  scattering  is  sufficiently  isotropic  near  the  Fermi  energy. 

18  This  form  also  holds  for  time-dependent  perturbations,  and  in  the  presence  of  spatially  uniform 
static  magnetic  fields.  If,  however,  any  of  the  external  fields  or  temperature  gradients  depend  on  position, 
then  (16.26)  does  not  hold,  and  the  conclusion  that  the  solution  to  the  Boltzmann  equation  has  the  form 
given  by  the  relaxation-time  approximation  can  no  longer  be  drawn. 

15  The  distribution  function  constructed  in  Chapter  13  is  a  solution  to  the  Boltzmann  equation  in 
the  relaxation  time  approximation,  as  pointed  out  on  page  320. 

20  Note  that  the  equilibrium  distribution  g°  depends  on  k  only  through  £(k),  and  therefore  drops 
out  of  (16.28)  when  the  scattering  is  elastic. 
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We  next  resolve  the  vector  k'  into  its  components  parallel  and  perpendicular  to  k: 

k'  =  k^  +  k'x  =  (k  •  k')k  +  k'±.  (16.29) 

Because  the  scattering  is  elastic  and  Bik.  depends  only  on  the  angle  between  k  and 
k',  WKk-  cannot  depend  on  k'±>  and  therefore  Jdk'  B/kkk'i  must  vanish.  Consequently 

|dk'  Ukkk'  =  jdk'  Wkk.k;i  =  k  Jrfk'  Wkk(C  •  W)k'.  (16.30) 

Finally,  because  WKk.  vanishes  unless  the  magnitudes  of  k  and  k'  are  equal,  the  factor 
k'  in  the  last  integrand  in  (16.30)  can  be  replaced  by  k  and  brought  outside  the  integral, 
where  it  can  be  combined  with  the  unit  vector  k  to  yield  the  vector  k: 


Jdk'  k'  =  k  Jrfk'  WM  (k  •  k'). 


(16.31) 


It  follows  from  the  identity  (16.31)that  the  left  side  of  Eq.  (16.28)  does  indeed  have 
the  same  form  as  the  right  side,  provided  that  t(k)  is  defined  by21 


_l=  [*lw 

T(k) 


k  (l  -  £•  Id). 


(16.32) 


Thus  the  relaxation-time  approximation  (with  a  relaxation  time  given  by  (16.32) 
provides  the  same  description  as  the  full  Boltzmann  equation,  when  applied  to 
spatially  homogeneous  disturbances  in  an  isotropic  metal  with  isotropic  elastic 
impurity  scattering. 

Note  that  the  relaxation-time  given  by  (16.32)  is  a  weighted  average  of  the  collision 
probability  in  which  forward  scattering  (C  =  I?)  receives  very  little  weight.  If  6  is  the 
angle  between  k  and  k',  then  1  —  k  •  K'  =  1  —  cos  6  «  62/2  for  small  angles.  It  is 
not  unreasonable  that  small-angle  scattering  should  make  a  very  small  contribution 
to  the  effective  collision  rate.  If  collisions  were  only  in  the  forward  direction  (i.e.,  if 
Hik.  vanished  unless  k  =  k')  then  they  would  have  no  consequences  at  all.  When  the 
possible  changes  in  wave  vector  are  not  zero,  but  very  small,  then  the  distribution  of 
electronic  wave  vectors  will  be  only  slightly  affected  by  collisions.  A  single  collision, 
for  example,  would  certainly  not  obliterate  all  traces  of  the  fields  that  accelerated  the 
electron,  as  maintained  by  the  relaxation-time  approximation,  and  therefore  the 
effective  inverse  relaxation  time  (16.32)  is  very  much  smaller  than  the  actual  collision 
rate  (16.2)  when  the  scattering  is  predominantly  forward  scattering. 

This  point  remains  valid  quite  generally:  Forward  scattering  will  contribute  less 
to  effective  “collision  rates”  than  wide-angle  scattering,  unless  a  property  is  being 
measured  that  depends  delicately  on  the  precise  direction  in  which  certain  electrons 
move.  We  shall  see  in  Chapter  26  that  this  point  is  of  importance  in  understanding 
the  temperature  dependence  of  the  DC  electrical  resistivity. 


Z1  Noie  that  the  relaxation  time  given  by  (16.32)  can  depend  on  the  magnitude,  but  not  the  direction 
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PROBLEMS 

1.  Let  /ifk)  be  any  one-electron  property  whose  total  density  is 

H  =  [7—3  /i(k)fl(k),  (16-33) 


where  g  is  the  electronic  distribution  function.  If,  for  example,  /i(k)  is  the  electronic  energy,  £(k), 
then  H  is  the  energy  density  11;  if  h(k)  is  the  electronic  charge,  —  e,  then  H  is  the  charge  density, 
p.  The  value  of  the  density  H  in  the  neighborhood  of  a  point  changes  because  electrons  move 
into  and  out  of  the  neighborhood,  some  as  the  result  of  the  semiclassical  equations  of  motion, 
and  some  as  a  result  of  collisions.  The  change  in  H  due  to  collisions  is 


(16.34) 


(a)  Show  from  (16.8)  that  ( dH/dt)cM  vanishes  provided  that  all  collisions  conserve  h  (i.e., 
provided  that  there  is  only  scattering  between  levels  k  and  k'  with  h( k)  =  h(k')). 

(b)  Show  that  if  (16.8)  is  replaced  by  the  relaxation-time  approximation  (16.9),  then  (dH/dt)cc , 
vanishes  only  if  the  parameters  p(r,  t )  and  T(r,  t ),  characterizing  the  local  equilibrium  distribution 
/,  yield  an  equilibrium  value  of  H  equal  to  the  actual  value  (16.33). 

(c)  Deduce  the  equation  of  continuity,  V  •  j  +  dp/dt  —  0  from  the  Boltzmann  equation 
(16.13). 

(d)  Deduce  the  equation  of  energy  flow  (1 3.83)  from  the  Boltzmann  equation  (1 6.1 3),  assuming 
that  (du/dt  )coll  =  0. 

2.  A  metal  is  perturbed  by  a  spatially  uniform  electric  field  and  temperature  gradient.  Making 
the  relaxation-time  approximation  (16.9)  (where  g0  is  the  local  equilibrium  distribution  appro¬ 
priate  to  the  imposed  temperature  gradient),  solve  the  Boltzmann  equation  (16.13)  to  linear  order 
in  the  field  and  temperature  gradient,  and  verify  that  the  solution  is  identical  to  (13.43). 

3.  A  metal  at  constant  temperature  and  in  a  uniform  static  magnetic  field  is  perturbed  by  a 
uniform  static  electric  field. 

(a)  Making  the  relaxation-time  approximation  (16.9),  solve  the  Boltzmann  equation  (16.13) 
to  linear  order  in  the  electric  field  (treating  the  magnetic  field  exactly),  under  the  assumption 


£(k)  = 


h2k2 

2m* 


(16.35) 


Verify  that  your  solution  is  of  the  form  (16.26). 

(b)  Construct  the  conductivity  tensor  from  your  solution,  and  verify  that  it  agrees  with  what 
one  finds  by  evaluating  (13.69)  and  (13.70)  for  a  single  free  electron  band. 

4.  Consider  the  Boltzmann  equation  (16.13)  for  a  metal  in  a  static  uniform  electric  field,  with 
a  collision  term  (16.18)  appropriate  to  elastic  impurity  scattering. 

(a)  Assuming  a  nonequilibrium  distribution  function  of  the  form 

g(  k)  =/(k)  +  6g(  k). 


(16.36) 


328  Chapter  16  Beyond  the  Relaxation-Time  Approximation 


where  /  is  the  equilibrium  Fermi  function  and  <5g(k)  is  of  order  E,  derive  to  linear  order  in  E 
an  integral  equation  obeyed  by  Sg,  and  show  that  the  conductivity  can  be  written  in  the  form 

where  u(k)  is  a  solution  to  the  integral  equation 

v(k)  =  H"l  k.[u(k)  -  u(k')].  (16.38) 

(b)  Let  a(k)  and  y(k)  be  any  two  functions  of  k.  Define 

fev)  =  e2J^(-|)“(kMk)  <16-39> 

so  that  (16.37)  can  be  written  compactly  as 

=  (lV>  uv)-  (16.40) 

Define 

<«•»>-  **  jS  (- 1)  “<k» 

Show  that  {%  y}  =  {y,  a},  and  that  Eq.  (16.38)  implies 

v)  =  («V  T).  (16.42) 

so  that  the  conductivity  can  also  be  written  in  the  form 

<Vv  =  {"«>  uv}-  (16.43) 


(c)  Prove  for  arbitrary  a  and  y  that 

{a,  a}  2*^^.  (16.44) 

{y>  y} 

(Hint:  Prove  that  {a  +  Ay,  a  +  Ay}  ?  0  for  arbitrary  A,  and  choose  a  A  that  minimizes  the 
left-hand  side  of  this  inequality.) 

(d)  With  the  choice  a  =  ux,  deduce  that  <J„  satisfies  the  inequality 


(16.45) 


for  arbitrary  functions  y. 

(e)  Suppose  that  W  =  W(1)  +  W{2).  Let  y  be  ux,  where  u  is  the  solution  to  (16.38).  Let  <r(1> 
and  o'21  be  the  conductivities  one  would  have  if  only  Wm  or  Wm  were  present.  Deduce  from 
(16.45),  as  it  applies  to  <r(1)  and  <r(2>,  that 


(16.46) 
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The  proper  choice  of  the  potential  U(t)  appearing  in  the  one-electron  Schrodinger 
equation 

-  4:  +  tfOWM  =  #(r)  (17.1) 


is  a  subtle  problem.1  Underlying  this  problem  is  the  question  of  how  best  to  represent 
the  effects  of  electron-electron  interactions,  a  subject  we  have  so  far  altogether  ignored, 
by  working  in  the  independent  electron  approximation. 

From  a  fundamental  point  of  view  it  is  impossible  to  describe  electrons  in  a  metal 
correctly2  by  so  elementary  an  equation  as  (17.1),  however  ingenious  the  choice  of 
U(r),  because  of  the  enormously  complicating  effects  of  the  interactions  between 
electrons.  A  more  accurate  calculation  of  the  electronic  properties  of  a  metal  should 
start  with  the  Schrodinger  equation  for  the  IV-particle  wave  function  of  all  N  electrons 
in  the  metal,3  'F(r1Si,  r,s2, . . . ,  r NsN): 


=  £T. 


(17.2) 


Here  the  negative  potential-energy  term  represents  the  attractive  electrostatic  poten¬ 
tials  of  the  bare  nuclei  fixed  at  the  points  R  of  the  Bravais  lattice,  and  the  last  term 
represents  the  interactions  of  the  electrons  with  each  other. 

One  has  no  hope  of  solving  an  equation  such  as  (17.2).  Further  progress  requires 
some  simplifying  physical  idea.  One  such  idea  is  suggested  by  asking  what  choice  of 
U( r)  would  make  the  one-electron  equation  (17.1)  least  unreasonable.  Evidently  U(r) 
should  include  the  potentials  of  the  ions: 


u,"<r)--z‘,?FLTif 

In  addition,  however,  we  should  like  l/(r)  to  incorporate  (at  least  approximately)  the 
fact  that  the  electron  feels  the  electric  fields  of  all  the  other  electrons.  If  we  treated 
the  remaining  electrons  as  a  smooth  distribution  of  negative  charge  with  charge 
density  p,  the  potential  energy  of  the  given  electron  in  their  field  would  be 

Ud(r)  =  -e  j*'  p(r')  (17.4) 

Furthermore,  if  we  persisted  in  an  independent  electron  picture,  the  contribution  of 
an  electron  in  the  level4  ij/,  to  the  charge  density  would  be 

PiW  =  -t#i(r)|2.  (17.5) 


1  See  the  discussion  at  the  beginning  of  Chapter  11. 

2  Even  in  the  approximation  of  fixed  immovable  ions.  We  shall  retain  this  assumption  here,  relaxing 
it  in  Chapters  21-26. 

3  We  include  explicitly  the  dependence  of  H?  on  the  electron  spin  s,  as  well  as  the  position  r 

4  We  let  i  stand  for  both  the  spin  and  orbital  quantum  numbers  of  the  one-electron  level. 
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The  total  electronic  charge  density  would  then  be 

p(r)  =  -eZI'A.(r)|2’  (17-6) 

i 

where  the  sum  extends  over  all  occupied  one-electron  levels  in  the  metal.5 

Placing  (17.6)  in  (17.4)  and  letting  U  =  U™  +  L?e!  we  arrive  at  the  one-electron 
equation : 

-  ^  VV,(r)  +  t/io"( r)Wr)  +  I  J dt'  I  Hrt  Mr)  =  SrfMO-  (17-7) 

The  set  of  equations  (17.7)  (there  is  one  for  each  occupied  one-electron  level  i/^^r)) 
is  known  as  the  Hartree  equations.  These  nonlinear  equations  for  the  one-electron 
wave  functions  and  energies  are  solved,  in  practice,  by  iteration:  A  form  is  guessed 
for  Ud  (the  term  in  brackets  in  (17.7))  on  the  basis  of  which  the  equations  are  solved. 
A  new  Ud  is  then  computed  from  the  resulting  wave  functions,  ip,(r\  and  a  new 
Schrodinger  equation  is  solved.  Ideally,  the  procedure  is  continued  until  further 
iterations  do  not  materially  alter  the  potential.6 

The  Hartree  approximation  fails  to  represent  the  way  in  which  the  particular  (as 
opposed  to  the  average)  configuration  of  the  other  N  —  1  electrons  affects  the  electron 
under  consideration,  for  Eq.  (17.7)  describes  the  electron  as  interacting  only  with  the 
field  obtained  by  averaging  over  the  positions  of  the  remaining  electrons  (with  a 
weight  determined  by  their  wave  functions).  As  crude  an  approximation  as  this  is  to 
the  full  Schrodinger  equation  (17.2),  it  still  leads  to  a  mathematical  task  of  considerable 
numerical  complexity.  To  improve  upon  the  Hartree  approximation  is  quite  difficult. 

There  are,  however,  certain  other  important  physical  features  of  electron-electron 
interactions  that  cannot  be  treated  in  a  simple  self-consistent  field  approximation, 
but  are  nevertheless  fairly  well  understood.  In  this  chapter  we  shall  survey  the 
following: 

1.  The  extension  of  the  self-consistent  field  equations  to  include  what  is  known  as 
“exchange.” 

2.  The  phenomenon  of  “screening,”  which  is  of  importance  in  developing  a  still 
more  accurate  theory  of  electron-electron  interactions,  and  in  accounting  for  the 
response  of  metallic  electrons  to  charged  particles  such  as  ions,  impurities,  or 
other  electrons. 

3.  The  Fermi  liquid  theory  of  Landau,  which  provides  a  phenomenological  way  of 
predicting  the  qualitative  effects  of  electron-electron  interactions  on  the  electronic 
properties  of  metals,  as  well  as  offering  an  explanation  for  the  quite  extraordinary 
success  the  independent  electron  approximation  has  had. 

We  shall  not  discuss  any  of  the  many  attempts  to  develop  a  really  systematic  way 
of  treating  electron-electron  interactions.  Such  efforts  come  under  the  general  heading 


5  Although  the  electron  does  not  interact  with  itself,  it  is  not  necessary  to  exclude  its  level  from  the 
sum  in  (17.6),  for  the  inclusion  of  one  extra  spatially  extended  level  among  the  1022  or  so  occupied  let  els 
results  in  a  negligible  change  in  the  density. 

0  For  this  reason  the  Hartree  approximation  is  also  known  as  the  “self-consistent  field  approximation. 
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of  “many-body  problems,”  and  have  been  dealt  with,  in  recent  years,  through  “field 
theoretic”  or  “Green’s  function”  methods. 


EXCHANGE:  THE  HARTREE-FOCK  APPROXIMATION 

The  Hartree  equations  (17.7)  have  a  fundamental  inadequacy  that  is  not  at  all  evident 
from  the  derivation  we  gave.  The  defect  emerges  if  we  return  to  the  exact  A-electron 
Schrodinger  equation  and  cast  it  into  the  equivalent  variational  form,7  which  asserts 
that  a  solution  to  ffiP  =  £¥  is  given  by  any  state  ¥  that  makes  stationary  the 
quantity: 

W,  HV) 

,  (17.8) 

where 

(¥,$)=£■■■£  [  dr,. . .  drN  ¥*(r1s1, .... rNsN)  Ofr . . . , r NsN).  (17.9) 

Si  SN  J 

In  particular,  the  ground-state  wave  function  is  that  ¥  that  minimizes  (17.8).  This 
property  of  the  ground  state  is  frequently  exploited  to  construct  approximate  ground 
states  by  minimizing  (17.8)  not  over  all  ¥,  but  over  a  limited  class  of  wave  functions 
chosen  to  have  a  more  tractable  form. 

It  can  be  shown8  that  the  Hartree  equations  (17.7)  follow  from  minimizing  (17.8) 
over  all  ¥  of  the  form: 


T^iSi,  r 2s2, . . . ,  r NsN)  =  i/'l(r,.s1)i//2(r2.s2) . . .  (Mr NsN),  (17.10) 

where  the  are  a  set  of  N  orthonormal  one-electron  wave  functions.  Thus  the 
Hartree  equations  give  the  best  approximation  to  the  full  /V-electron  wave  function 
that  can  be  represented  as  a  simple  product  of  one-electron  levels. 

The  wave  function  (17. 10),  however,  is  incompatible  with  the  Pauli  principle,  which 
requires  the  sign  of  ¥  to  change  when  any  two  of  its  arguments  are  interchanged:9 

T(r  [.■<!, . . . ,  r,Si, . . . ,  tjSj, rNsN)  =  . . . ,  r jSj,  r,-s„ . . . ,  rNsN).  (17.11) 

Equation  (17.11)  cannot  be  satisfied  for  the  product  form  (17.10)  unless  ¥  vanishes 
identically. 

The  simplest  generalization  of  the  Hartree  approximation  that  incorporates  the 
antisymmetry  requirement  (17.11)  is  to  replace  the  trial  wave  function  (17.10)  by  a 


1  See  Appendix  G.  The  discussion  there  is  for  the  one-electron  Schrodinger  equation,  but  the  general 
case  is,  if  anything,  simpler. 

s  We  leave  this  as  a  straightforward  exercise  (Problem  1)  for  the  reader. 

9  The  antisymmetry  of  the  IV-electron  wave  function  is  the  fundamental  manifestation  of  the  Pauli 
principle.  The  alternative  statement  of  the  principle,  that  no  one-electron  level  can  be  multiply  occupied, 
can  only  be  formulated  in  an  independent  electron  approximation.  There  it  follows  directly  from  the  fact 
that  (17.13)  must  vanish  if  any  ip,  =  tpj.  The  Hartree  state  (17.10)  is  consistent  (though  not,  like  (17.13), 
automatically  so)  with  the  prohibition  on  multiple  occupation,  provided  that  no  two  tpL  are  the  same. 
However,  it  fails  the  more  fundamental  test  of  antisymmetry. 
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Slater  determinant  of  one-electron  wave  functions.  This  is  a  linear  combination  of  the 
product  (17.10)  and  all  other  products  obtainable  from  it  by  permutation  of  the  TjSj 
among  themselves,  added  together  with  weights  +  1  or  —  1  so  as  to  guarantee  condi¬ 
tion  (17.11): 


¥  =  iA,(r,.S',)i//2(r2.s2)  .  . .  \ps(rNsN)  -  ip1(r2s2)ip2(r1s1) . .  .  ipN(rNsN)  +  ■  ■  ■  •  (17.12) 


This  antisymmetrized  product  can  be  written  compactly  as  the  determinant  of  an 
N  x  N  matrix:10 


■  ■  ■  iMtVw) 
^2(riSi)^2(r2s2)  ■  ■  WrNsJv) 


r2s2>  -  -  - 1  tnsn)  — 


(17.13) 


With  a  little  bookkeeping  (Problem  2)  it  can  be  shown  that  if  the  energy  (17.8)  is 
evaluated  in  a  state  of  the  form  (17.13),  with  orthonormal  single  electron  wave  func¬ 
tions  ip!  . . .  ipN,  then  the  result  is: 

<H>t  =  £  Jdr  ,p*(r)  ~y2  + 

+  2 1 1**’  lWf)l2  IWI2 

-l^Jdrdr'  |r  "27|  (*7.14) 

Notice  that  the  last  term  in  (17.14)  is  negative  and  involves  the  product  ip*(r)ipi(r') 
in  place  of  the  usual  one-electron  combination  |tMr)|2-  Minimizing  (17. 14)  with  respect 
to  the  ipt*  (Problem  2)  leads  to  a  generalization  of  the  Hartree  equations  known  as 
the  Hartree-Fock  equations: 


-  ^VVi(r)  +  U™(r)Mr)  + 

2m 

-  E  | *'  |r  e_  r,|  'l'j*(T')'l'i(r')'l'j(T)dsisJ  =  e^.W,  (17.15) 
where  Ud  is  defined  in  (17.4)  and  (17.6). 

These  equations  differ  from  the  Hartree  equations  (17.7)  by  an  additional  term 
on  the  left  side,  known  as  the  exchange  term.  The  complexity  introduced  by  the 
exchange  term  is  considerable.  Like  the  self-consistent  field  Ud  (often  referred  to  as 
the  direct  term )  it  is  nonlinear  in  ip,  but  unlike  the  direct  term  it  is  not  of  the  form 
V(r)ip(r).  Instead,  it  has  the  structure  JF(r,  r')^(r')  dr'-  i.e.,  it  is  an  integral  operator. 
As  a  result,  the  Hartree-Fock  equations  are  in  general  quite  intractable.  The  one 
exception  is  the  free  electron  gas.  When  the  periodic  potential  is  zero  (or  constant) 


10  Since  a  determinant  changes  sign  when  any  two  columns  are  interchanged,  this  insures  that  the 
condition  (17.11)  holds. 
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the  Hartree-Fock  equations  can  be  solved  exactly  by  choosing  the  ^  to  be  a  set  of 
orthonormal  plane  waves. 1 1  Although  the  case  of  free  electrons  has  dubious  bearing 
on  the  problem  of  electrons  in  a  real  metal,  the  free  electron  solution  suggests  a  further 
approximation  that  makes  the  Hartree-Fock  equations  in  a  periodic  potential  more 
manageable.  We  therefore  comment  briefly  on  the  free  electron  case. 
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The  familiar  set  of  free  electron  plane  waves. 


x  spin  function, 


(17.16) 


in  which  each  wave  vector  less  than  kF  occurs  twice  (once  for  each  spin  orientation) 
in  the  Slater  determinant,  gives  a  solution  to  the  Hartree-Fock  equation  for  free 
electrons.  For  if  plane  waves  are  indeed  solutions,  then  the  electronic  charge  density 
that  determines  Ud  will  be  uniform.  However,  in  the  free  electron  gas  the  ions  are 
represented  by  a  uniform  distribution  of  positive  charge  with  the  same  density  as 
the  electronic  charge.  Hence  the  potential  of  the  ions  is  precisely  canceled  by  the 
direct  term:  Ulcm  +  Ue'  =  0.  Only  the  exchange  term  survives,  which  is  easily 
evaluated  by  writing  the  Coulomb  interaction  in  terms  of  its  Fourier  transform12: 


-'1 


=  4ne2  *  Y.  \  ’ |r  x)  -*•  47H?2 
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j'Q  -  (r  -  r") 


(17.17) 


If  (17.17)  is  substituted  into  the  exchange  term  in  (17.15)  and  the  ipj  are  all  taken  to 
be  plane  waves  of  the  form  (17.16),  then  the  left  side  of  (17.15)  assumes  the  form 


where 


and 
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(17.20) 


This  shows  that  plane  waves  do  indeed  solve  (17.15),  and  that  the  energy  of  the 
one-electron  level  with  wave  vector  k  is  given  by  (17.19).  The  function  F(x)  is  plotted 
in  Figure  17.1a,  and  the  energy  £(k)  in  Figure  17.1b. 

Several  features  of  the  energy  (17.19)  deserve  comment: 


1.  Although  the  Hartree-Fock  one-electron  levels  continue  to  be  plane  waves,  the 
energy  of  an  electron  in  the  level  e*"'  is  now  given  by  h2k2/2m  plus  a  term  describing 


3 1  More  complicated  solutions,  known  as  spin  density  waves,  are  also  possible  (Chapter  32). 
12  See  Problem  3. 
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Figure  17.1  [ 

(a)  A  plot  of  the  function 
F(x),  defined  by  Eq.  (17.20). 
Although  the  slope  of  this 
function  diverges  at  x  =  1, 
the  divergence  is  logarithmic, 
and  cannot  be  revealed  by 
changing  the  scale  of  the  plot. 

At  large  values  of  x  the  be¬ 
havior  is  F(x)-*  l/3x2.  (b)The 
Hartree-Fock  energy  (17.19)  j 
may  be  written  j 

where  x  =  k/kF.  This  function  I 
is  plotted  here  for  rja0  =  4,  | 

and  may  be  compared  with 
the  free  electron  energy 
(white  line).  Note  that  in 
addition  to  depressing  the 
free  electron  energy  substan¬ 
tially,  the  exchange  term  has 
led  to  a  considerable  increase 
in  the  bandwidth  (in  these 
units  from  1  to  2.33),  an 
effect  not  corroborated  by 
experiments  such  as  soft  X- 
ray  emission  or  photoelec¬ 
tron  emission  from  metals, 
which  purport  to  measure 
such  bandwidths. 


i 
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the  effects  of  the  electron-electron  interaction.  To  compute  the  contribution  of  these 
interactions  to  the  total  energy  of  the  IV-electron  system,  we  must  sum  this  correction 
over  all  k  <  kF,  multiply  by  2  (for  the  two  spin  levels  that  are  occupied  for  each  k), 
and  divide  by  2  (because,  in  summing  the  interaction  energy  of  a  given  electron  over 
all  electrons,  we  are  counting  each  electron  pair  twice).  In  this  way  we  find  that 


2  I 

k<kF 


h2k2 
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>2k  r 

--  I  1  + 
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2  kkF 


-In 


+  k 

~^k  ' 
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We  have  already  evaluated  the  first  term  in  Chapter  2  (Eq.  (2.31)).  If  we  transform 
the  second  term  into  an  integral,  it  can  be  evaluated  to  give: 


E  = 


3  e2kF 

4  7t 


(17.22) 


This  result  is  con  ventionally  written  in  terms  of  the  rydberg(e2/2«0  =  1  Ry  =  13.6  eV) 
and  the  parameter  rja0  (page  4) : 


Since  rja0  in  metals  is  in  the  range  from  2  to  6,  the  second  term  in  (17.23)  is  quite 
comparable  to  the  first  in  size,  and  indicates  that  electron-electron  interactions  cannot 
be  overlooked  in  any  free  electron  estimate  of  the  electronic  energy  of  a  metal. 

2.  With  much  labor  the  exact  leading  terms  in  a  high-density  (i.e.,  small  rja0) 
expansion  of  the  ground-state  energy  of  the  electron  gas  have  been  calculated:13 

=  \-rr*  -  r!(\  +  °-0622  ln  <r>°)  -  ao96  +  I  Ry-  07.24) 

N  L(rs/«o)  (r,/ao)  J 

Note  that  the  first  two  terms  are  just  the  Hartree-Fock  result  (17.23).  Since  rja0  is 
not  small  in  metals,  this  expansion  is  of  doubtful  relevance,  but  its  derivation  marked 
one  of  the  first  systematic  attempts  to  work  out  a  more  accurate  theory  of  electron- 
electron  interactions.  The  next  two  terms  in  (17.24)  and  all  other  corrections  to  the 
Hartree-Fock  result  are  conventionally  referred  to  as  the  correlation  energy.  Note 
that  the  correlation  energy  is  not  a  quantity  with  physical  significance;  it  merely 
represents  the  error  incurred  in  making  a  fairly  crude  first-order  approximation.14 

3.  The  average  change  in  the  energy  of  an  electron  from  h2k2/2m  due  to  exchange 
is  just  the  second  term  in  E/N;  i.e., 


<£exch8>  =  _ 

4  jr 


0.916 
{rja  0) 


Ry. 


(17.25) 


13  M.  Gell-Mann  and  K.  Brueckner,  Phys.  Rev.  106,  364  (1957). 

14  Indeed,  the  name  ‘correlation  energy”  is  something  of  a  misnomer.  The  Hartree  approximation 
ignores  electron  correlations;  i.e.,  the  N-electron  probability  distribution  factors  into  a  product  of  N 
one-electron  distributions.  The  Hartree-Fock  wave  function  (17.13)  does  not  so  factor;  i.e..  electron 
correlations  are  introduced  at  this  next  level  of  approximation.  Nevertheless  the  “correlation  energy”  is 
defined  to  exclude  the  exchange  contribution,  containing  only  further  corrections  beyond  that  provided 
by  Hartree-Fock  theory. 
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This  form  led  Slater15  to  suggest  that  in  nonuniform  systems  and,  in  particular,  in 
the  presence  of  the  periodic  potential  of  the  lattice,  one  could  simplify  the  Hartree- 
Fock  equations  by  replacing  the  exchange  term  in  (17.15)  by  a  local  energy  given  by 
twice  (17.25)  with  kF  evaluated  at  the  local  density;  i.e.,  he  proposed  an  equation 
in  which  the  effect  of  exchange  was  taken  into  account  by  merely  adding  to  the 
Hartree  term  Lrtl(r)  an  additional  potential  CMchE(r),  given  by 

L/'“hE(r)  =  —  2.95(n03H(r)),/3Ry.  (17.26) 

This  procedure,  gross  and  ad  hoc  though  it  is,  is  actually  followed  in  many  band 
structure  calculations.  There  have  been  some  controversies16  over  whether  it  is  better 
to  average  the  free  electron  exchange  over  all  k  or  to  evaluate  it  at  k  =  kF ,  but  the 
crude  nature  of  the  approximation  makes  the  dispute  one  of  limited  content.  It 
is  hard  to  say  more  for  this  simplification  than  that  it  approximates  the  effects  of 
exchange  by  introducing  a  potential  that  favors  regions  of  high  density,  in  a  way 
that  roughly  mimics  the  density  dependence  of  the  exchange  term  in  the  free  electron 
energy  density. 

4.  Equation  (17.19)  has  one  rather  alarming  feature:  The  derivative  cS/ck  becomes 
logarithmically  infinite17  at  k  —  kF.  Since  (l/fi)  c£/rk|k=kf.  is  precisely  the  velocity 
of  those  electrons  most  important  for  metallic  properties,  this  is  an  unsettling 
result.  A  singularity  at  k  —  kr  in  the  one-electron  energies  makes  the  Sommerfeld 
expansion  (2.70)  invalid,  and  leads  in  this  case  to  an  electronic  heat  capacity  at  low 
temperatures  going  not  as  T,  but  as  T/|ln  T[. 

The  singularity  does  not  occur  for  a  general  noncoulombic  potential,  but  can  be 
traced  back  to  the  divergence  of  the  Fourier  transform  4ne2/k2  of  the  interaction 
e2/r,  at  k  =  0.  This  in  turn  reflects  the  very  long  range  of  the  inverse  square  force. 
If  the  Coulomb  interaction  were  replaced,  for  example,  by  one  of  the  form  e2(e~k°'/r), 
then  its  Fourier  transform18  would  be  4ne2/(k2  +  k02),  the  k  =  0  divergence  would 
be  eliminated,  and  the  unphysical  singularity  of  the  Hartree-Fock  energies  removed. 
It  can  be  argued  (see  below)  that  the  potential  appearing  in  the  exchange  term  should 
be  modified  in  just  this  way  to  take  into  account  the  fields  of  electrons  other  than 
the  two  at  r  and  r',  which  rearrange  themselves  so  as  partially  to  cancel  the  fields  the 
two  electrons  exert  on  one  another.  This  effect,  known  as  “screening,”  is  of  fundamen¬ 
tal  importance  not  only  for  its  effects  on  the  electron-electron  interaction  energy,  but, 
more  generally,  in  determining  the  behavior  of  any  charge-carrying  disturbance  in 
a  metal.19 

SCREENING  (GENERAL) 

The  phenomenon  of  screening  is  one  of  the  simplest  and  most  important  manifes¬ 
tations  of  electron-electron  interactions.  We  consider  here  only  screening  in  a  free 


15  J.  C.  Slater.  Phys  Re r.  81,  385  (1951);  82,  538  (1951);  91.  528  (1953). 

16  See,  for  example,  W.  Kohn  and  L.  J.  Sham,  Phys.  Rev.,  140.  All  93  (1965).  and  R.  Gaspar,  Acta. 
Phys.  Acad.  Sci.  Hung.  3,263  (1954). 

17  See  Figure  17.1. 

18  Problem  3. 

19  The  ions  in  a  metal  are  an  important  case  and  will  be  referred  to  in  the  context  of  dynamic  screening 
in  Chapter  26. 
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electron  gas.  The  detailed  theory  of  screening  in  the  presence  of  a  realistic  periodic 
potential  is  much  more  complex,  and  one  is  often  forced  to  use  the  free  electron 
form  of  the  theory  even  in  discussions  of  real  metals. 

Suppose  a  positively  charged  particle  is  placed  at  a  given  position  in  the  electron 
gas  and  rigidly  held  there.  It  will  then  attract  electrons,  creating  a  surplus  of  negative 
charge  in  its  neighborhood,  which  reduces  (or  screens)  its  field.  In  treating  this 
screening  it  is  convenient  to  introduce  two  electrostatic  potentials.  The  first,  <■//", 
arises  solely  from  the  positively  charged  particle  itself,  and  therefore  satisfies  Poisson’s 
equation  in  the  form: 

-V2<f"(r)  =  47tpevl(r),  (17.27) 

where  pc>:,(r)  is  the  particle’s  charge  density.20  The  second.  (f>,  is  the  full  physical 
potential,  produced  by  both  the  positively  charged  particle  and  the  cloud  of  screening 


electrons  it  induces.  It  therefore  satisfies 

-V2(‘>ir)  =  4jip(r), 

(17.28) 

where  p  is  the  full  charge  density, 

p(r)  =  //"(r)  +  pind(r). 

(17.29) 

and  p,nd  is  the  charge  density  induced  in  the  electron  gas  by  the  presence  of  the  external 
particle. 

By  analogy  with  the  theory  of  dielectric  media,  one  assumes  that  (p  and  (pcn  are 
linearly  related  by  an  equation  of  the  form21 

<jfx'(r)  =  J t?r'  e(r.  r')(/>(r'). 

(17.30) 

In  a  spatially  uniform  electron  gas  e  can  depend  only  on 
points  r  and  r',  but  not  on  their  absolute  position: 

the  separation  between  the 

e(r,  r')  =  e(r  -  r'). 

Thus  (17.30)  assumes  the  form 

(17.31) 

=  J*'e(r  -  ryp(r’), 

(17.32) 

20  The  term  “external”  and  superscript  “ext”  used  to  describe  the  applied  charge  are  not  meant  to 
suggest  that  the  charge  is  placed  outside  of  the  metal — it  is  in  fact  inside  the  metal — hut  refer  only  to  its 
origin  from  some  source  of  charge  external  to  the  system  of  electrons. 

21  The  potential  (jf  u  is  analogous  to  the  electric  displacement  D  (whose  sources  are  the  “free”  charges 
extrinsic  to  the  medium);  the  potential  (p  is  analogous  to  the  electric  field  E,  which  arises  from  the  total 
charge  distribution,  including  both  the  “free”  charges  and  the  “bound”  charges  induced  in  the  medium. 
The  relation  D(r)  =  j  drt(r  —  r')E(r')  (or  the  corresponding  relation  (17.32))  reduces  to  the  more  familiar 
local  relation  D(r)  =  eE(r),  with  the  dielectric  constant  r  being  given  by  (  J"  (/rc(r),  when  D  and  E  are 
spatially  uniform  fields  (or,  more  generally,  when  the  fields  vary  slowly  on  the  scale  of  some  r0  for  which 
<r)  *0 ,  r  >  r0.) 
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which  implies22  that  the  corresponding  Fourier  transforms  satisfy 

=  e(q)r/>(q),  (17.33) 

where  the  Fourier  transforms  are  defined  by 

.:(gi  =  jjr  !,  'f(rl,  (17-34) 

'<r)  “  <l7-35) 

with  similar  equations  for  (j>  and  4>ea. 

The  quantity  e(q)  is  called  the  (wave  vector  dependent)  dielectric  constant  of  the 
metal.23  When  written  in  the  form 

(17.36) 


equation  (17.33)  asserts  that  the  qth  Fourier  component  of  the  total  potential  present 
in  the  electron  gas  is  just  the  qth  Fourier  component  of  the  external  potential,  reduced 
by  the  factor  l/e(q).  This  kind  of  relation  is  familiar  in  elementary  discussions  of 
dielectrics,  where,  however,  the  fields  are  generally  uniform  so  that  the  dependence 
on  wave  vector  does  not  come  into  play. 

The  quantity  that  turns  out  to  be  the  most  natural  to  calculate  directly  is  not  the 
dielectric  constant  e(q)  but  the  charge  density  pind(r)  induced  in  the  electron  gas  by 
the  total  potential  <£(r).  We  shall  examine  below  how  this  can  be  calculated.  When 
p“d  and  <j>  are  linearly  related  (as  they  will  be  for  sufficiently  weak  <j>),  then  their 
Fourier  transforms  will  satisfy  a  relation  of  the  form 

pind(q)  =  x(q)<Mq)-  (17.37) 

We  can  relate  e  (the  quantity  of  direct  physical  interest)  to  z  (the  quantity  that 
emerges  naturally  from  a  calculation)  as  follows: 

The  Fourier  transforms  of  the  Poisson  equations  (17.27)  and  (17.28)  are 

q2e/)"'(q)  =  4rcpc''(q), 

q2<£(q)  =  47tp(q).  (17.38) 

Together  with  (17.29)  and  (17.37)  these  give 

— (<^(q)  -  <rl(q))  =  x(q)<Mq)-  (17-39) 

471 

33  Through  the  convolution  theorem  of  Fourier  analysis.  We  follow  the  usual  physicists’  practice  of 
using  the  same  symbol  for  a  function  and  its  Fourier  transform,  distinguishing  the  two  by  the  symbol 

used  for  the  argument.  ....  .  .  .  r 

33  In  elementary  discussions  of  electrostatics  it  is  sometimes  asserted  that  the  dielectric  constant  oi 
a  metal  is  infinite— i.e„  that  the  charge  is  free  to  move  and  the  medium  is  therefore  infinitely  polarizable. 
We  shall  find  that  the  form  ofe(q)  is  consistent  with  this,  for  in  the  limit  of  a  spatially  uniform  applied 
field  (<?  -»  O)e(q)  does  indeed  become  infinite.  (See  Eq.  (17.51).) 
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or 


m  =  r'(q)  i 


Comparing  this  with  (17.36)  leads  to  the  relation 


(17.40) 


.  47T  .  .  ,  4tt 

e(q)  =  1  -  ^2  x(q)  =  1  -  ^2 


Pind(q) 

#1) 


(17.41) 


’  Except  for  the  assumption  that  the  externally  applied  charge  is  weak  enough  to 
produce  only  a  linear  response  in  the  electron  gas,  the  analysis  up  to  this  point  has 
been  exact  (though  little  more  than  a  series  of  definitions).  Serious  approximations 
become  necessary  when  one  tries  to  calculate  x-  Two  widely  prevalent  theories  of  y 
are  employed,  both  simplifications  of  a  general  Hartree  calculation  of  the  charge 
induced  by  the  impurity.  The  first— the  Thomas-Fermi  method— is  basically  the 
classical  (more  precisely,  the  semiclassical)  limit  of  the  Hartree  theory ,  the  second— 
the  Lindhard  method,  also  known  as  the  random  phase  approximation  (or  RPA)— is 
basically  an  exact  Hartree  calculation  of  the  charge  density  in  the  presence  of  the 
self-consistent  field  of  the  external  charge  plus  electron  gas,  except  that  the  Hartree 
calculation  is  simplified  from  the  start  by  the  fact  that  pmi  is  only  required  to  linear 
order  in  <j>. 

The  Thomas-Fermi  method  has  the  advantage  that  it  is  applicable  even  when  a 
linear  relation  between  pmd  and  <l>  does  not  hold;  it  has  the  disadvantage  that  it  is 
reliable  only  for  very  slowly  varying  external  potentials.  When  the  Thomas-Fermi 
result  is  linearized,  it  is  identical  to  the  Lindhard  result  at  small  values  of  q,  and  less 
accurate  than  the  Lindhard  result  when  q  is  not  small.  We  describe  the  two  cases 
separately  below. 


THOMAS-FERMI  THEORY  OF  SCREENING 

In  principle,  to  find  the  charge  density  in  the  presence  of  the  total  potential  <j>  = 
(jfa  +  <-/ynd  we  must  solve  the  one-electron  Schrodinger  equation,24 

-  |-Vfyi(r)  -  e<£U#i(r)  =  (17.42) 

2m 

and  then  construct  the  electronic  density  from  the  one-electron  wave  functions  using 
(17.6).  The  Thomas-Fermi  approach  is  based  on  a  simplification  in  this  procedure 
that  can  be  made  when  the  total  potential  <£( r)  is  a  very  slowly  varying  function  of 
r.  Here  we  mean  “slowly  varying”  in  precisely  the  same  sense  as  in  Chapters  2  and 


24  Because  is  the  total  potential,  arising  from  both  the  external  charge  and  the  charge  densit)  it 
induces  in  the  electron  gas,  Eq.  (17.42)  is  implicitly  treating  electron-electron  interactions  in  the  Hartree 
approximation.  The  self-consistency  problem  (at  least  in  the  linearized  version  of  the  theory)  is  contained 
in  the  stipulation  that  <j>  is  related  to  the  electronic  charge  density  pai  determined  by  the  solutions  to 
(17.42),  through  Eqs.  (17.36) and  (17.41). 
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12;  i.e.,  we  assume  it  is  meaningful  to  specify  the  energy  vs.  wave  vector  relation  of 
an  electron  at  the  position  r,  and  we  take  this  relation  to  be 

£(k)  =  ^  -  e<£(r).  (17.43) 


Thus  the  energy  is  modified  from  its  free  electron  value  by  the  total  local  potential. 

Evidently  (17.43)  makes  sense  only  in  terms  of  wave  packets.  These  will  have  a 
typical  spread  in  position  at  least  of  the  order  of  1  /kF.  We  must  therefore  require 
that  </>(r)  vary  slowly  on  the  scale  of  a  Fermi  wavelength.  In  terms  of  Fourier  com¬ 
ponents  this  means  that  the  calculation  will  be  reliable  only  for  values  of  y(q)  with 
q  «  kF.  We  shall  verify  this  limitation  explicitly  when  we  turn  to  the  more  accurate 
Lindhard  approach. 

Thus  we  assume  that  the  solutions  to  Eq.  (17.42)  describe  a  set  of  electrons  with 
energies  of  the  simple  classical  form  (17.43).  To  calculate  the  charge  density  produced 
by  these  electrons  we  place  their  energies  into  the  expression  (2.58)  for  the  electronic 
number  density,  to  find  (with  /?  =  1  /kBT) 


tit) 


'  dk  _ 1 _ 

47i3  exp  [Ji{(h2k2/2m)  —  e<p( r)  —  p)]  +  1 


(17.44) 


The  induced  charge  density  is  just  —en( r)  +  en0,  where  the  second  term  is  the  charge 
density  of  the  uniform  positive  background.  The  number  density  of  the  background 
is  just  the  density  of  the  electronic  system  when  <//*',  and  hence  4>,  vanishes:25 


n0(n)  = 


dk  _  J _ 

4rt3  exp  [fl((h2k2/2m)  —  p)]  +  1 


(17.45) 


We  combine  (17.44)  and  (17.45)  to  write, 

pi,,d(r)  =  -e[n0(n  +  e<p(r) )  -  n0(p)].  (17.46) 


This  is  the  basic  equation  of  nonlinear  Thomas-Fermi  theory. 

In  the  present  case  we  assume  that  <£  is  small  enough  for  (17.46)  to  be  expanded 
to  give  in  leading  order 

p™(r)=  (17.47) 


Comparing  (17.47)  with  (17.37)  we  find  that  y(q)  is  given  by  the  constant 

y(q)  =  —e2  independent  of  q. 

Substituting  this  in  (17.41)  gives  the  Thomas-Fermi  dielectric  constant26 

47te2  dn0 


e(q)  =  1  + 


q2  dn' 


(17.48) 


(17.49) 


25  The  values  of  the  chemical  potential  fx  appearing  in  (17.44)  and  (17.45)  will  be  the  same  under 
the  assumption  that  $(r)  is  appreciable  only  in  a  finite  region  of  the  electron  gas,  outside  of  which  the 
electron  density  is  negligibly  perturbed  from  its  equilibrium  value. 

26  As  anticipated,  this  form  for  the  dielectric  constant  does  indeed  become  infinite  as  q  -*  0.  (See 
footnote  23.) 
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It  is  customary  to  define  a  Thomas-Fermi  wave  vector  k0  by: 


so  that 


(17.50) 


e(fi) 


k  2 

1  +  -V- 

Q 


(17.51) 


To  illustrate  the  significance  of  k0,  consider  the  case  where  the  external  potential 
is  that  of  a  point  charge: 

4f"(r)  =  — ,  </>ex’(q)  =  ~f-.  (17.52) 

r  q 

The  total  potential  in  the  metal  will  then  be 


<ft(q)  =  2 

e(q)  q  +  k02 

The  Fourier  transform  can  be  inverted  to  give  (see  Problem  3) 


■J: 


dq_ 

(2ft)3 


4nQ 


Q  +  k0 


=  ZLe~kur 
r 


(17.53) 


(17.54) 


Thus  the  total  potential  is  of  the  coulombic  form  times  an  exponential  damping 
factor  that  reduces  it  to  a  negligible  size  at  distances  greater  than  order  l/k0.  This 
form  is  known  as  a  screened  Coulomb  potential21  or  (from  an  analogous  form  in 
meson  theory)  a  Yukawa  potential. 

We  have  thus  extracted  the  anticipated  result  that  the  electrons  screen  out  the 
field  of  the  external  charge.  In  addition,  we  have  an  explicit  expression  for  the  charac¬ 
teristic  distance  beyond  which  the  disturbance  is  effectively  screened.  To  estimate  k0 
note  that  for  a  free  electron  gas,  when  T  «  7>,  dn0/dp  is  simply  the  density  of  levels 
at  the  Fermi  energy,  g(SF )  =  mkr/h2n2  (Eq.  (2.64) ).  Therefore 


k02  _  4  me2  4  1  /  16\2/3/rs\ 

kF2  7ih2kF  nkFa0  \3n2J  \a0J  ’ 


ko 


0.815  kF 


2.95 

(rs/a0)112 


A"1. 


(17.55) 


Since  rs/a0  is  about  2  to  6  at  metallic  densities,  k0  is  of  the  order  of  kF ;  i.e.,  disturbances 
are  screened  in  a  distance  which  is  similar  to  the  interparticle  spacing.  Thus  the 
electrons  are  highly  effective  in  shielding  external  charges. 


27  This  form  appears  in  the  theory  of  electrolytes  as  originally  given  by  P.  Debye  and  E  HiickeJ 
Phys.  Z.  24, 185,  305  (1923). 


Lindhard  Theory  of  Screening  343 


LINDHARD  THEORY  OF  SCREENING 


In  the  Lindhard  approach28  one  returns  to  the  Schrodinger  equation  (17.42)  and 
does  not  make  a  semiclassical  approximation,  requiring  a  slowly  varying  <p.  Instead, 
one  exploits  from  the  outset  the  fact  that  the  induced  density  is  required  only  to 
linear  order  in  the  total  potential  <p.  It  is  then  a  routine  matter  to  solve  (17.42)  to 
linear  order  by  perturbation  theory.  Once  one  knows  the  electronic  wave  functions 
to  linear  order  in  4>  one  can  also  compute  the  linear  change  in  electronic  charge  density 
via  (17.6).  The  procedure  is  straightforward  (Problem  5)  and  here  we  quote  only  the 
result.  Equation  (17.48)  of  linearized  Thomas-Fermi  theory  must  be  generalized  to 


x(q)  = 


A-i<i  A+k 

4k3  h2 k  •  q jm 


(17.56) 


where  fk  denotes  the  equilibrium  Fermi  function  for  a  free  electron  with  energy 
h2k2/2m :  /k  =  l/{exp  [[l(h2k2/2m  —  /;)]  +  1}. 

Note  that  when  q  is  small  compared  to  kF  the  numerator  of  the  integrand  can  be 
expanded  about  its  value  at  q  =  0: 

/«k  =  A  ±  j | A  +  0iq2y  (17-57) 


The  term  linear  in  q  in  this  expansion  gives  the  Thomas-Fermi  result  (17.48).  Thus, 
as  expected,  in  the  limit  of  a  slowly  varying  disturbance  Lindhard  theory  reduces 
to  Thomas-Fermi  theory.29  However,  when  q  becomes  comparable  to  kF  there  is 
considerably  more  structure  in  the  Lindhard  dielectric  constant.  At  T  —  0  the 
integrals  in  (17.56)  can  be  performed  explicitly  to  give 


The  quantity  in  square  brackets,  which  is  1  at  x  =  0,  is  the  Lindhard  correction30  to 
the  Thomas-Fermi  result.  Note  that  at  q  —  2 kF  the  dielectric  constant  e  =  1  —  4ny/q2 
is  not  analytic.  As  a  result  it  can  be  shown  that  at  large  distances  the  screened  potential 
(p  of  a  point  charge  now  has  a  term  that  goes  (at  T  =  0)  as: 


4>( r)  ~  cos  2 kFr. 


(17.59) 


Thus  the  screening  at  large  distances  has  considerably  more  structure  than  the  simple 
Yukawa  potential  predicted  by  the  Thomas-Fermi  theory,  with  a  much  more  weakly 
decaying  oscillatory  term.  Depending  on  the  context  these  oscillations  go  under 
the  name  of  Friedel  oscillations  or  Ruderman-Kittel  oscillations.  We  comment  on 
them  further  in  Chapter  26. 


28  J.  Lindhard,  Kgl.  Danske  Videnskab .  Selskab  M at.-Fys.  Medd.  28,  No.  8  (1954). 

29  Indeed,  the  Thomas-Fermi  x(q)  can  be  characterized  as  the  limit  of  the  Lindhard  /(q)  as  q  -*■  0. 

30  The  function  in  square  brackets  happens  to  be  the  function  F{x )  appearing  in  the  Hartree-Fock 
energy  (17.19)  for  free  electrons  and  plotted  in  Figure  17.1a. 
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FREQUENCY-DEPENDENT  LINDHARD  SCREENING 

If  the  external  charge  density  has  time  dependence  e_i“  then  the  induced  potential 
and  charge  density  will  also  have  such  a  time  dependence,  and  the  dielectric  constant 
will  depend  on  frequency  as  well  as  on  wave  vector.  In  the  limiting  case,  where 
collisions  can  be  ignored,  the  Lindhard  argument  can  be  straightforwardly  generalized 
by  using  time-dependent  rather  than  stationary  perturbation  theory.  One  finds  that 
the  static  result  (17.56)  need  be  modified  only  by  the  addition  of  the  quantity  hw  to 
the  denominator  of  the  integrand.31  This  more  general  form  is  of  considerable 
importance  in  the  theory  of  lattice  vibrations  in  metals,  as  well  as  in  the  theory  of 
superconductivity.  Here  we  note  only  that  when  q  approaches  zero  at  fixed  co,  the 
Lindhard  dielectric  constant. 


e(q,  a) 


4ne2 

1  +  7T 


J 


dk  fk-b  ~  fk  +  jq 
47i3  h2 k  •  q/rn  +  ha>’ 


(17.60) 


reduces  to  the  Drude  result  (1.37),  which  we  derived  under  the  assumption  of  a 
spatially  uniform  disturbance.  Thus  the  more  sophisticated  Lindhard  approach  is 
consistent  with  more  elementary  investigations  in  the  regimes  in  which  they  are 
applicable. 


SCREENING  THE  HARTREE-FOCK  APPROXIMATION 

We  have  discussed  screening  by  metallic  electrons  of  an  externally  imposed  charge 
distribution.  However,  screening  will  also  affect  the  interaction  of  two  electrons  with 
each  other,  since  from  the  point  of  view  of  the  remaining  electrons  these  tw  o  can  be 
considered  as  external  charges.  If  one  returns  to  the  HartreeTock  equations  and 
takes  this  point  of  view,  one  can  make  an  important  improvement.  One  cannot 
tamper  with  the  Hartree  self-consistent  field  term,  since  this  is  the  term  that  gives 
rise  to  the  screening  in  the  first  place.  However,  it  is  tempting32  to  replace  the  electron- 
electron  interaction  that  occurs  in  the  exchange  term  by  its  screened  form,  multiplying 
l/(k  -  k')2  by  the  inverse  dielectric  constant  l/e(k  -  k')  in  (17.19).  This  eliminates 
the  singularity  responsible  for  the  anomalous  divergence  in  the  one  electron  velocity 
v(k)  =  (l/fr)(rS(k)/ck)  at  k  =  kF,  for  in  the  neighborhood  of  q  =  0  the  screened 
interaction  approaches  not  e2/q2  but  e2/k02.  If  one  now  calculates  the  value  of  v(k) 
at  k  =  kF,  one  finds  that  for  values  of  rs  typical  of  metallic  densities  the  velocity 
differs  from  its  free  electron  value  by  only  about  5  percent.  Thus  screening  has 
characteristically  reduced  the  importance  of  electron-electron  interactions.33 


31  When  the  denominator  vanishes,  the  integral  is  made  unambiguous  by  the  requirement  that  it 
be  evaluated  by  giving  co  a  vanishingly  small  positive  imaginary  part. 

32  One  of  the  successes  of  the  Green's  function  approach  is  the  justification  in  a  systematic  way  of 
this  apparently  ad  hoc  introduction  of  screening  into  the  exchange  term. 

33  In  the  present  case  the  reduction  is  rather  dramatic,  from  a  divergence  to  a  minor  correction  of 
a  few  percent. 
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FERMI  LIQUID  THEORY 

We  conclude  this  chapter  with  a  brief  look  at  some  deep  and  subtle  arguments, 
primarily  due  to  Landau,34  that  (a)  explain  the  remarkable  success  of  the  independent 
electron  approximation  in  spite  of  the  strength  of  electron-electron  interactions  and 
(b)  indicate  how,  in  many  cases,  particularly  in  the  calculation  of  transport  properties, 
the  consequences  of  electron-electron  interactions  can  qualitatively  be  taken  into 
account.  Landau’s  approach  is  known  as  Fermi  liquid  theory.  It  was  designed  to 
deal  with  the  liquid  state  of  the  isotope  of  helium  of  mass  number  3,  but  is  increasingly 
being  applied  to  the  theory  of  electron-electron  interactions  in  metals.35 

We  first  observe  that  up  to  this  point  our  analysis  of  electron-electron  interactions 
has  led  to  a  substantially  modified  energy  vs.  wave  vector  relation  for  the  one-electron 
levels  (e.g.,  Eq.  (17.19)),  but  has  not  in  any  substantial  way  challenged  the  basic 
structure  of  the  independent  electron  model,  in  which  the  electronic  properties  of  a 
metal  are  viewed  as  arising  from  the  occupation  of  a  specified  set  of  one-electron 
levels.  Thus  even  in  the  Hartree-Fock  approximation  we  continue  to  describe  the 
stationary  electronic  states  by  specifying  which  one-electron  levels  i//,  are  present  in 
the  Slater  determinant  (17.13).  The  IV-electron  wave  function  therefore  has  exactly 
the  same  structure  as  it  would  have  for  noninteracting  electrons,  the  only  modification 
being  that  the  form  of  the  one-electron  wave  functions  may  be  affected  by  the 
interactions.36  It  is  far  from  clear  that  this  is  a  sensible  way  to  describe  the  stationary 
states  of  the  A'-electron  system.  Suppose,  for  example,  that  the  net  electron-electron 
interaction  were  attractive,  and  so  much  so  that  pairs  of  electrons  formed  bound 
states.37  Then  the  natural  way  to  describe  the  electrons  in  a  metal  would  be  in  terms 
of  electron  pairs.  Such  a  metal  could  no  more  be  adequately  described  by  the  sta¬ 
tionary  states  of  a  set  of  independent  single  electrons  than  a  gas  of  oxygen  molecules 
could  be  described  in  terms  of  independent  oxygen  atoms. 

Even  if  nothing  as  drastic  as  pair  formation  happens,  it  is  still  far  from  obvious 
that  an  independent  electron  description,  with  suitably  modified  energies,  will  be 
anywhere  near  the  mark  in  describing  the  electrons  in  an  actual  metal.  There  is, 
however,  reason  to  expect  that  this  may  be  the  case  for  electrons  with  energies  near 
the  Fermi  energy.38  The  argument,  due  to  Landau,  can  be  divided  into  two  stages. 
The  first  is  fairly  straightforward,  but  the  second  is  very  subtle  indeed. 


34  L.  D.  Landau,  Sov.  Phys.  JETP  3,  920  (1957);  5, 101  (1957);  and  8,  70  (1959). 

35  A  thorough  and  fairly  elementary  survey  of  the  theory  of  charged  Fermi  liquids  up  to  1966 
can  be  found  in  The  Theory  of  Quantum  Liquids  /,  D.  Pines  and  P.  Nozieres,  W.  A.  Benjamin,  Menlo 
Park,  California,  1966. 

36  For  free  electrons,  not  even  this  change  is  made.  The  wave  functions  continue  to  be  plane  waves. 

37  Something  like  this  actually  happens  in  a  superconductor.  See  Chapter  34. 

38  There  is  no  justification  for  the  independent  electron  picture  when  the  electronic  energies  are  far 
from  the  Fermi  energy,  but  happily,  as  we  have  seen  in  Chapters  2, 12,  and  13,  many  of  the  most  interesting 
electronic  properties  of  a  metal  are  almost  completely  determined  by  electrons  within  kBT  of  the  Fermi 
energy.  However,  any  property  involving  electronic  levels  well  below  or  high  above  the  Fermi  energy 
(such  as  soft  X-ray  emission,  photoelectric  emission,  or  optical  absorption)  may  very  well  be  substantially 
affected  by  electron-electron  interactions. 
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FERMI  LIQUID  THEORY:  CONSEQUENCES  OF  THE 
EXCLUSION  PRINCIPLE  ON  ELECTRON-ELECTRON 
SCATTERING  NEAR  THE  FERMI  ENERGY 

Consider  a  set  of  noninteracting  electrons.  If  we  imagine  gradually  turning  on  the 
interactions  between  electrons,  they  will  lead  to  two  kinds  of  effects: 

1.  The  energies  of  each  one-electron  level  will  be  modified.39  This  is  the  kind  of 
effect  illustrated  by  the  Hartree-Fock  approximation  and  its  refinements.  We 
shall  return  to  it  below. 

2.  Electrons  will  be  scattered  in  and  out  of  the  single  electron  levels,  which  are  no 
longer  stationary.  This  does  not  happen  in  the  Hartree-Fock  approximation, 
where  one-electron  levels  continue  to  give  valid  stationary  states  of  the  interacting 
system.  Whether  this  scattering  is  serious  enough  to  invalidate  the  independent 
electron  picture  depends  on  how  rapid  the  rate  of  scattering  is.  If  it  is  sufficiently 
low,  we  could  introduce  a  relaxation  time  and  treat  the  scattering  in  the  same  way 
as  the  other  scattering  mechanisms  we  have  discussed  in  our  theories  of  transport 
processes.  If  it  should  happen  (and  we  shall  see  that  it  usually  does)  that  the 
electron-electron  relaxation  time  is  much  larger  than  other  relaxation  times,  then 
we  can  safely  ignore  it  altogether  and  use  the  independent  electron  model  with 
considerably  more  confidence,  subject  only  to  modifications  required  by  the 
altered  energy  vs.  k  relation.40 

One  naively  might  expect  the  electron-electron  scattering  rate  to  be  quite  high, 
since  the  Coulomb  interaction,  even  when  screened,  is  rather  strong.  However,  the 
exclusion  principle  comes  dramatically  to  the  rescue  by  reducing  the  scattering  rate 
quite  spectacularly  in  many  cases  of  major  interest.  This  reduction  occurs  when  the 
electronic  configuration  differs  only  slightly  from  its  thermal  equilibrium  form  (as  is 
the  case  in  all  of  the  transport  processes  we  have  investigated  in  Chapter  13).  To 
illustrate  the  effect  of  the  exclusion  principle  on  the  scattering  rate,  suppose,  for 
example,  that  the  A'-electron  state  consists  of  a  filled  Fermi  sphere  (thermal  equilib¬ 
rium  at  T  —  0)  plus  a  single  excited  electron  in  a  level  with  8!  >  8F .  In  order  for  this 
electron  to  be  scattered,  it  must  interact  with  an  electron  of  energy  S2,  which  must 
be  less  than  SF,  since  only  electronic  levels  with  energies  less  than  8F  are  occupied. 
The  exclusion  principle  requires  that  these  two  electrons  can  only  scatter  into 
unoccupied  levels,  whose  energies  83  and  £4  must  therefore  be  greater  than  8F.  Thus 
we  require  that 

S2  <  8F,  83  >  ZF,  84  >  £f.  (17.61) 

In  addition,  energy  conservation  requires  that 

8,  +  82  =  83  +  S4.  (17.62) 


39  We  defer,  for  the  moment,  the  question  of  whether  it  makes  sense  to  speak  of  “one-electron  levels” 
at  all,  when  the  interaction  is  turned  on.  (This  is,  of  course,  the  central  problem,  which  is  why  the  argument 
is  so  subtle.) 

40  And  subject  to  the  delicate  change  in  point  of  view  associated  with  the  introduction  of  “quasi¬ 
particles”  (see  below). 
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Using  the  form  (17.55)  for  k0  we  can  write  this  as 


(17.65) 
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To  establish  the  form  of  the  proportionality  constant  we  appeal  to  dimensional 
analysis.  We  have  left  at  our  disposal  only  the  temperature-independent  quantities 
characterizing  a  noninteracting  electron  gas  :kF,m,  and  h.  We  can  construct  a  quantity 
with  dimensions  of  inverse  time  by  multiplying  (17.65)  by  m3/h',  to  get 


1  ,  1  ( kBT )2 

x  h  £r 


(17.66) 


Since  no  dimensionless  factor  can  be  constructed  out  of  kF,  m ,  and  h,  (17.66)  is  the 
only  possible  form.  We  take  the  dimensionless  number  A  to  be  of  order  unity  to 
within  a  power  or  two  of  ten. 

At  room  temperature  kBT  is  of  the  order  of  10  2  eV,  and  £f  is  of  the  order  of 
electron  volts.  Therefore  (kBT)2/ Sf  is  of  the  order  of  10“4  eV,  which  leads  to  a  lifetime 
x  of  the  order  of  10" 10  second.  In  Chapter  1  we  found  that  typical  metallic  relaxation 
times  at  room  temperature  were  of  the  order  of  10  14  second.  We  therefore  conclude 
that  at  room  temperature  electron-electron  scattering  proceeds  at  a  rate  104  times 
slower  than  the  dominant  scattering  mechanism.  This  is  a  sufficiently  large  factor  to 
allow  for  the  power  or  two  often  error  that  might  easily  have  crept  into  our  crude 
dimensional  analysis;  there  is  no  doubt  that  at  room  temperature  electron-electron 
scattering  is  of  little  consequence  in  a  metal.  Since  the  electron-electron  relaxation 
time  increases  as  1/T2  with  falling  temperature,  it  is  quite  possible  that  it  can  be  of 
little  consequence  at  all  temperatures.  It  is  certainly  necessary  to  go  to  very  low 
temperatures  (to  eliminate  thermal  scattering  by  the  ionic  vibrations)  in  very  pure 
specimens  (to  eliminate  impurity  scattering)  before  one  can  hope  to  see  effects  of 
electron-electron  scattering,  and  indications  are  only  just  emerging  that  it  may  be 
possible  under  these  extreme  conditions  to  see  the  characteristic  T 2  dependence. 

Therefore,  at  least  for  levels  within  kBT  of  the  Fermi  energy,  electron-electron 
interactions  do  not  appear  to  invalidate  the  independent  electron  picture.  However, 
there  is  a  serious  gap  in  this  argument,  which  brings  us  to  the  subtle  part  of  Landau  s 
theory. 


FERMI  LIQUID  THEORY:  QUASIPARTICLES 

The  above  argument  indicates  that  if  the  independent  electron  picture  is  a  good 
first  approximation,  then  at  least  for  levels  near  the  Fermi  energy,  electron-electron 
scattering  will  not  invalidate  that  picture  even  if  the  interactions  are  strong.  However, 
if  the  electron-electron  interactions  are  strong  it  is  not  at  all  likely  that  the  independent 
electron  approximation  will  be  a  good  first  approximation,  and  it  is  therefore  not 
clear  that  our  argument  has  any  relevance. 

Landau  cut  this  Gordian  knot  by  acknowledging  that  the  independent  electron 
picture  was  not  a  valid  starting  point.  He  emphasized,  however,  that  the  argument 
described  above  remains  applicable,  provided  that  an  independent  something  picture 
is  still  a  good  first  approximation.  He  christened  the  “somethings”  quasiparticles  (or 
quasielectrons).  If  the  quasiparticles  obey  the  exclusion  principle,  then  the  argument 
we  have  given  works  as  well  for  them  as  it  does  for  independent  electrons,  acquiring 
thereby  a  much  wider  validity,  provided  that  we  can  explain  what  a  quasiparticle 
might  be.  Landau’s  definition  of  a  quasiparticle  is  roughly  this: 
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Suppose  that  as  the  electron-electron  interactions  are  turned  on,  the  states  (at 
least  the  low-lying  ones)  of  the  strongly  interacting  /V-electron  system  evolve  in  a 
continuous  way  from,  and  therefore  remain  in  a  one-to-one  correspondence  with, 
the  states  of  the  noninteracting  /V-electron  system.  We  can  specify  the  excited  states 
of  the  noninteracting  system  by  specifying  how  they  differ  from  the  ground  state — i.e., 
by  listing  those  wave  vectors  k1;  k2, . . . ,  k„  above  kF  that  describe  occupied  levels, 
and  those,  kt',  k2', . . . ,  km'  below  kF,  that  describe  unoccupied  levels.41  We  then 
describe  such  a  state  by  saying  that  m  electrons  have  been  excited  out  of  the  one- 
electron  levels  kj', . . . ,  km',  and  n  excited  electrons  are  present  in  the  one-electron 
levels  kj, . . . ,  k„.  The  energy  of  the  excited  state  is  just  the  ground  state  energy  plus 
SfkJ  +  . . .  +  £(k„)  —  £(ki')  —  ...  —  £(km'),  where,  for  free  electrons,  £(k)  =  h2k2/2m. 

We  now  define  quasiparticles  implicitly,  by  asserting  that  the  corresponding  state 
of  the  interacting  system  is  one  in  which  m  quasiparticles  have  been  excited  out  of 
levels  with  wave  vectors  k^ . . .  km'  and  n  excited  quasiparticles  are  present  in  levels 
with  wave  vectors  k2 . . .  k„.  We  say  that  the  energy  of  the  state  is  the  ground-state 
energy  plus  £(ki)  +  . . .  +  £(k„)  -  £(kt')  -  ...  -  £(km')  where  the  quasiparticle  £ 
vs.  k  relation  is,  in  general,  quite  difficult  to  determine. 

It  is  not  clear,  of  course,  whether  this  is  a  consistent  thing  to  do,  for  it  implies 
that  the  excitation  spectrum  for  the  interacting  system,  though  numerically  different 
from  that  of  the  free  system,  nevertheless  has  a  free  electron  type  of  structure.  However 
we  can  now  return  to  the  argument  of  the  preceding  section  and  point  out  that  this  is 
at  least  a  consistent  possibility,  for  if  the  spectrum  does  have  a  structure  like  the  free 
electron  spectrum,  then  because  of  the  exclusion  principle  quasiparticle-quasiparticle 
interactions  will  not  drastically  alter  that  structure,  at  least  for  quasiparticles  near 
the  Fermi  surface. 

This  glimmering  of  an  idea  is  a  long  way  from  a  coherent  theory.  In  particular, 
we  must  reexamine  the  rules  for  constructing  quantities  like  electric  and  thermal 
currents  from  the  distribution  function,  once  we  acknowledge  that  it  is  describing 
not  electrons,  but  quasiparticles.  Remarkably,  these  rules  turn  out  to  be  very  similar 
(but  not  identical)  to  what  we  would  do  if  we  were,  in  fact,  dealing  with  electrons 
and  not  quasiparticles.  We  cannot  hope  to  give  an  adequate  account  of  this  extra¬ 
ordinary  subject  here,  and  must  refer  the  reader  to  the  papers  of  Landau34  and  the 
book  by  Pines  and  Nozieres35  for  a  fuller  description. 

The  term  “normal  Fermi  system”  is  used  to  refer  to  those  systems  of  interacting 
particles  obeying  Fermi-Dirac  statistics,  for  which  the  quasiparticle  representation 
is  valid.  It  can  be  shown  by  a  difficult  and  ingenious  argument  of  Landau’s  based 
on  Green’s  function  methods,  that  to  all  orders  of  perturbation  theory  (in  the  inter¬ 
action)  every  interacting  Fermi  system  is  normal.  This  does  not  mean,  however,  that 
all  electronic  systems  in  metals  are  normal,  for  it  is  now  well  known  that  the  super¬ 
conducting  ground  state,  as  well  as  several  kinds  of  magnetically  ordered  ground 
states,  cannot  be  constructed  in  a  perturbative  way  from  the  free  electron  ground 


«  Note  that  if  we  are  comparing  the  TV-electron  excited  state  to  an  A'-electron  ground  state,  then 
n  and  m  must  be  the  same.  They  need  not  be  the  same  if  we  are  comparing  the  excited  state  of  the  N 
electron  system  to  an  JV'-electron  ground  state.  Note  also  that  although  we  use  a  language  appropriate 
to  free  electrons  in  describing  the  occupancy  of  levels,  we  could  make  the  same  points  for  a  Fermi  surface 
of  general  shape. 
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state.  We  can  therefore  only  say  that  if  a  Fermi  system  is  not  normal,  it  is  probably 
doing  something  else  quite  interesting  and  dramatic  in  its  own  right. 

FERMI  LIQUID  THEORY:  THE /-FUNCTION 

Finally,  assuming  we  are  dealing  with  a  normal  Fermi  system,  we  comment  briefly 
on  the  remaining  effects  of  electron-electron  interactions  on  the  electronic  behavior. 
If  a  quasiparticle  picture  is  valid,  then  the  primary  effect  of  electron-electron  inter¬ 
actions  is  simply  to  alter  the  excitation  energies  E(k)  from  their  free  electron  values. 
Landau  pointed  out  that  this  has  an  important  implication  for  the  structure  of 
transport  theories.  When  electric  or  thermal  currents  are  carried  in  a  metal,  the 
electronic  distribution  function  g(k)  will  differ  from  its  equilibrium  form/(k).  For 
truly  independent  electrons  this  has  no  bearing  on  the  form  of  the  8  vs.  k  relation, 
but  since  the  quasiparticle  energy  is  a  consequence  of  electron-electron  interactions, 
it  may  well  be  altered  when  the  configuration  of  the  other  electrons  is  changed. 
Landau  noted  that  if  the  distribution  function  differed  from  its  equilibrium  form  by 
<5/i(k)  =  £/(k)  —  /(k),  then  in  a  linearized  theory42  this  would  imply  a  change  in  the 
quasiparticle  energy  of  the  form43 

=  (17.67) 

This  is  precisely  the  state  of  affairs  prevailing  in  Hartree-Fock  theory,  where  /  (k,  k') 
has  the  explicit  form  4rte2/(k  —  k')2.  In  a  more  accurate  screened  Hartree-Fock 
theory,  /  would  have  the  form  4ree2/[(k  -  k')2  +  k02].  In  general  neither  of  these 
approximate  forms  is  correct,  and  the  exact  /-function  is  difficult  to  compute.  Never¬ 
theless,  the  existence  of  the  relation  (17.67)  must  be  allowed  for  in  a  correct  transport 
theory.  It  is  beyond  the  scope  of  this  book  to  carry  out  such  a  program.  However 
one  of  its  most  important  consequences  is  that  for  time  independent  processes  the 
/-function  drops  completely  out  of  the  transport  theory,  and  electron-electron  inter¬ 
actions  are  of  importance  only  insofar  as  they  affect  the  scattering  rate.  This  means,  in 
particular,  that  stationary  processes  in  a  magnetic  field  at  high  coct  will  be  completely 
unaffected  by  electron-electron  interactions  and  correctly  given  by  the  independent 
electron  theory.  These  are  precisely  the  processes  that  give  valuable  and  extensive 
information  about  the  Fermi  surface,  so  that  a  major  stumbling  block  to  one’s  faith 
in  the  absolute  validity  of  that  information  can  be  removed. 

Although  the  /-function  is  beyond  reliable  computational  techniques,  one  can  try 
to  deduce  how  its  mere  existence  should  affect  various  frequency-dependent  transport 
properties.  In  most  cases  the  effects  appear  to  be  small,  and  quite  difficult  to  disen¬ 
tangle  from  band  structure  effects.  However,  attempts  have  recently  been  made  to 
measure  properties  that  do  depend  in  a  critical  way  on  the  /-function,  in  an  effort 
to  extract  its  values  from  experiments.44 


42  Such  as  almost  all  the  transport  theories  used  in  practice. 

43  It  is  conventional  to  exclude  from  (17.67)  the  contribution  to  the  change  in  energy  associated  with 
the  macroscopic  electromagnetic  field  produced  by  the  currents  or  charge  densities  associated  with  the 
deviation  from  equilibrium;  i.e.,  the  /-function  describes  the  exchange  and  correlation  effects.  Self-consistent 
field  effects  are  explicitly  dealt  with  separately,  in  the  usual  way. 

44  See,  for  example,  P.  M.  Platzman,  W.  M.  Walsh,  Jr,  and  E-Ni  Foo,  Phys.  Rev.  172,  689  (1968). 
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FERMI  LIQUID  THEORY:  CONCLUDING  RULES  OF  THUMB 

In  summary,  the  independent  electron  picture  is  quite  likely  to  be  valid: 

1.  Provided  that  we  are  dealing  only  with  electrons  within  kBT  of  SF. 

2.  Provided  that  we  remember,  when  pressed,  that  we  are  not  describing  simple 
electrons  anymore,  but  quasiparticles. 

3.  Provided  that  we  allow  for  the  effects  of  interaction  on  the  S  vs.  k  relation. 

4.  Provided  that  we  allow  for  the  possibility  of  an  /-function  in  our  transport 
theories. 

PROBLEMS 


1.  Derivation  of  the  Hartree  Equations  from  the  Variational  Principle 

(a)  Show  that  the  expectation  value  of  the  Hamiltonian  ( 1 7.2)  in  a  state  of  the  form  ( 1 7. 10)  is45 


<H>  =  I 


dt  <p*(r)  ^  V2  +  U™(r)\  Hr) 


dr  dr'  - ,,  |i//,(r)|2  |i/q(r')|2,  (17.68) 


provided  that  all  the  ipi  satisfy  the  normalization  condition  J  dr  |t/',|2  =  1. 

(b)  Expressing  the  constraint  of  normalization  for  each  ip,  with  a  Lagrange  multiplier  C„  and 
taking  8\pt  and  Sip*  as  independent  variations,  show  that  the  stationary  condition 


5,<H>  =  0 


(17.69) 


leads  directly  to  the  Hartree  equations  (17.7). 


2.  Derivation  of  the  Hartree-Fock  Equations  from  the  Variational  Principle 

(a)  Show  that  the  expectation  value  of  the  Hamiltonian  (17.2)  in  a  state  of  the  form  (17.13) 
is  given  by  (17.14). 

(b)  Show  that  when  applied  to  Eq.  (17.14)  the  procedure  described  in  Problem  1(b)  now  leads 
to  the  Hartree-Fock  equations  (17.15). 


3.  Properties  of  the  Coulomb  and  Screened  Coulomb  Potentials 

(a)  From  the  integral  representation  of  the  delta  function. 


and  the  fact  that  the  Coulomb  potential  </;(r)  =  —  e/r  satisfies  Poisson’s  equation, 

—  V2/(r)  =  —  47ie<5(r), 


(17.70) 


(17.71) 


4S  Note  the  restriction  (somewhat  pedantic  when  large  numbers  of  levels  appear  in  the  sum)  i  r':  j. 
Such  a  restriction  is  not  present  in  the  more  general  Hartree-Fock  energy  of  Eq.  (17.14)  because  there  the 
direct  and  exchange  terms  for  i  =  j  cancel  identically. 
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argue  that  the  electron-electron  pair  potential,  F(r)  =  -e<£(r)  =  e2/r  can  be  written  in  the  form 

C  dk 

Hr)  =  |(2^e'’t'rl/(k)’  (17'72) 

where  the  Fourier  transform  F(k)  is  given  by 

4jre2 

V(k)  =  -p-.  (17.73) 

(b)  Show  that  the  Fourier  transform  of  the  screened  Coulomb  interaction  Fs(r)  =  (e2/r)e~kor 
is 


= 


4ne2 

k2  +  /c02’ 


(17.74) 


by  substituting  (17.74)  into  the  Fourier  integral 

(17'75) 

and  evaluating  that  integral  in  spherical  coordinates.  (The  radial  integral  is  best  done  as  a  contour 
integral.) 

(c)  Deduce  from  (17.74)  that  K(r)  satisfies 

(-V2  +  k02)Vs(r)  =  4ne25(r).  (17.76) 


4.  Hartree-Fock  Effective  Mass  near  k  =  0 

Show  that  near  the  band  minimum  (k  =  0)  the  Hartree-Fock  one-electron  energy  (17.19)  is 


parabolic  in  k: 


6(k) 


h2k2 
2  m*’ 


(17.77) 


where 


m*  _  1 

m  _  1  +  0.22 (rja0)' 


(17.78) 


5.  Calculation  of  the  Lindhard  Response  Function 
Using  the  formula  of  first-order  stationary  perturbation  theory. 


<Ak  =  K  +  I 

k' 


0 

k'» 


(17.79) 


and  expressing  the  charge  density  as 

p(r)  =  -  e  I  A  <Ai(r)  2  =  p°(r)  +  pind(r),  (17.80) 

(where  A  is  the  equilibrium  Fermi  distribution),  show  that  the  Fourier  transform  of  the  charge 
induced  to  first  order  in  a  total  potential  <p  is  given  by 


P"’d(q)  =  -C 


[ 


rfk  A  -tq  /k  +  fq 
4ti3  fi2(k  •  q/wi) 


</>(q)- 


(Equation  (17.56)  then  follows  from  the  definition  (17.37)  of  y(q).) 


(17.81) 
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Because  we  have  been  interested  primarily  in  bulk  properties,  we  have  ignored 
surfaces,  working  with  the  idealized  model  of  an  infinitely  extended  solid.1  Our 
justification  for  this  was  that  of  the  1024  atoms  in  a  macroscopic  crystal  of  typically 
108  atoms  on  a  side,  only  about  one  in  108  reside  near  the  surface. 

In  limiting  ourselves  to  bulk  properties  we  are  neglecting  the  increasingly  impor¬ 
tant  field  of  surface  physics,  which  deals  with  phenomena  such  as  catalysis  or  crystal 
growth,  which  are  entirely  determined  by  the  interaction  of  surface  atoms  with  atoms 
impinging  on  the  crystal.  Since  the  microscopic  structure  of  the  surfaces  of  most 
specimens  tends  to  be  irregular  and  difficult  to  ascertain,  the  field  of  surface  physics 
is  quite  complex,  with  nothing  like  the  variety  of  simple  and  experimentally  verifiable 
models  available  in  the  physics  of  bulk  solids.  We  shall  have  nothing  to  say,  even  in 
this  chapter,  about  such  surface  phenomena,  limiting  ourselves  to  a  description  of 
some  of  the  important  tools  for  determining  surface  structure. 

However,  even  if  we  are  interested  only  in  bulk  properties,  we  are  still  forced  to 
deal  with  the  surface  whenever  a  measurement  (e.g.,  the  application  of  a  voltmeter) 
removes  an  electron  from  the  solid.  The  energy  required  to  extract  an  electron, 
although  it  originates  deep  in  the  interior,  is  determined  by  surface  as  well  as  bulk 
conditions.  This  is  because  there  are  distortions  in  the  electronic  charge  distribution 
near  the  surface  that,  because  of  the  long  range  of  the  Coulomb  potential,  affect  the 
energies  of  levels  far  inside.  Such  effects  are  crucial  to  the  understanding  of  contact 
potentials  (see  below),  thermionic  emission  (the  boiling  of  electrons  out  of  a  metal 
at  high  temperatures),  the  photoelectric  effect  (the  ejection  of  electrons  by  incident 
photons),  or  any  other  phenomena  in  which  electrons  are  removed  from  a  solid  or 
pass  from  one  solid  to  another. 

In  describing  such  phenomena  a  crucial  role  is  played  by  the  work  function,  defined 
as  the  minimum  energy  required  to  remove  an  electron  from  the  interior  of  a  solid 
to  a  position  just  outside.  “Just  outside”  means  a  distance  from  the  surface  that  is 
large  on  the  atomic  scale,  but  small  compared  with  the  linear  dimensions  of  the 
crystal  and  will  be  given  a  more  detailed  specification  in  the  discussion  below. 


EFFECT  OF  THE  SURFACE  ON  THE  BINDING  ENERGY  OF  AN 
ELECTRON:  THE  WORK  FUNCTION 

To  illustrate  how  the  surface  affects  the  energy  required  to  remove  an  electron,  let 
us  compare  the  periodic  potential  I7,nr(r)  of  an  idealized  infinite  crystal,  with  the 
potential  t/fil1( r)  appearing  in  the  one-electron  Schrodinger  equation  for  a  finite 
specimen  of  the  same  substance.  For  simplicity  we  consider  only  crystals  from  the 
cubic  system  that  possess  inversion  symmetry.  In  the  infinite  (or  periodically  extended 
crystal)  we  can  represent  t/lnr  as  a  sum  of  contributions  from  primitive  Wigner-Seitz 
cells  about  each  lattice  point: 

U"*(r)  =  £  i<r  -  R),  (18.1) 

R 


1  For  mathematical  convenience,  we  have  also  generally  replaced  the  infinite  solid  with  the  period¬ 
ically  repeated  solid  portrayed  by  iheBorn-von  Karman  boundary  conditions. 
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where 


rfr)  =  -e  dr'  p(r') 


l 


(18.2) 


The  integration  in  (18.2)  is  over  a  Wigner-Seitz  cell  C  centered  at  the  origin;  p(r)  is  the 
total  charge  density,  electronic  and  ionic.2 

At  distances  from  the  cell  that  are  large  compared  with  its  dimensions  we  can 
make  the  multipole  expansion  of  electrostatics,  writing 


1  _  r'  ■  r  3(r'  *  f)2  -  r'1  +  1  ^  /A3 


—  — I - 2 — h 

r  r 


to  find  that 


where 


L’(r)  =  -e—  -  e 
r 


is  the  total  charge  of  the  cell,  and 


1  =  £  dr'  p(r’) 
=  dr’  r'  p(r') 


(18.3) 


(18.4) 

(18.5) 


(18.6) 


is  its  total  dipole  moment. 

Since  the  crystal  is  electrically  neutral  and  p(r)  has  the  periodicity  of  the  Bravais 
lattice,  each  primitive  cell  must  be  electrically  neutral:  6  =  0-  Furthermore,  in  a 
crystal  with  inversion  symmetry  the  contribution  of  the  Wigner-Seitz  cell  to  the  dipole 
moment  vanishes.  As  a  consequence  of  cubic  symmetry  the  coefficient  of  the  1/r3 
term  (the  quadrupole  potential)  will  also  vanish,3  and  since  inversion  symmetry  also 
requires  the  coefficient  of  1/r4  to  vanish,  we  can  conclude  that  the  contribution  of  a 
Wigner-Seitz  cell  to  e(r)  decreases  as  1/r5 — i.e.,  quite  rapidly — at  large  distances  from 
the  cell. 


2  The  one-electron  SchrCdinger  equation  we  have  in  mind  is  therefore  the  self-consistent  Hartree 
equation  discussed  in  Chapters  11  and  17. 

3  This  follows  from  the  fact  that  dr’  rl'r/p(r')  must  vanish  when  i  ^  j,  and  must  equal  its  average 

value,  I J  dr  r,2p( r'),  when  i  =  j.  Consequently  the  first  term  in 


3(r'  *  f)2 


P(r') 


must  cancel  the  second. 

If  the  crystal  does  not  have  cubic  symmetry  our  general  conclusions  are  unaffected,  but  considerable 
care  must  be  exercised  in  dealing  with  the  quadrupole  term.  A  1/r3  dependence  by  itself  does  not  diminish 
quite  rapidly  enough  with  distance  to  insure  that  remote  cells  are  without  mutual  influence,  and  the  angular 
dependence  of  the  quadrupole  potential  must  also  be  taken  into  account.  This  makes  the  discussion  much 
more  technical,  and  is  not,  for  our  purposes,  worth  the  trouble. 
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Thus  the  contribution  to  Um\r)  from  cells  far  (on  the  atomic  scale)  from  the  point 
r  is  negligibly  small,  and  [7inl(r)  is  very  well  approximated  by  the  contribution  from 
cells  within  a  few  lattice  constants  of  r. 

Now  consider  a  finite  crystal.  Suppose  that  we  could  represent  the  ionic  con¬ 
figuration  by  simply  occupying  some  finite  region  V  of  the  Bravais  lattice  occupied 
in  the  infinite  crystal.  Suppose,  furthermore,  that  the  electronic  charge  density  in  the 
Wigner-Seitz  cell  about  each  ion  remained  completely  undistorted  from  the  form  it 
takes  in  the  infinite  crystal,  even  in  cells  near  the  surface  (Figure  18.1a).  Under  these 
assumptions  each  occupied  cell  would  continue  to  contribute  n(r  —  R)  to  the  poten¬ 
tial,  and  we  would  have: 

lf\r)  =  £  r(r  -  R).  (18.7) 

R  in  V 


Figure  18.1 

(a)  The  electric  charge  density  near 
the  surface  of  a  finite  crystal  if  there 
were  no  distortion  in  cells  near'the 
surface.  The  density  is  plotted  along 
a  line  of  ions.  Vertical  dashed  lines 
indicate  cell  boundaries,  (b)  The 
form  of  the  crystal  potential  U  (or 
the  electrostatic  potential  tj>  = 
-Vie)  determined  by  the  charge 
density  in  (a),  along  the  same  line. 
Far  from  the  crystal  V  and  tj>  drop 
to  zero.  The  (negative)  Fermi  energy 
is  indicated  on  the  vertical  axis.  The 
shading  below  the  Fermi  energy  is 
meant  to  suggest  the  filled  electronic 
levels  in  the  metal.  Since  the  lowest 
electronic  levels  outside  the  metal 
have  zero  energy,  an  energy  W  = 
— Sr  must  be  supplied  to  remove  an 
electron. 

If  (18.7)  were  correct,  then  at  points  r  far  inside  the  crystal  on  the  atomic  scale,  Uf‘" 
would  differ  from  Umf  only  because  the  finite  crystal  lacked  cells  at  sites  R  far  from 
r.  Since  such  cells  make  a  negligible  contribution  to  the  potential  at  r,  t/fi”(r)  would 
be  indistinguishable  from  t/lnf(r)  when  r  was  more  than  a  few  lattice  constants  inside 
the  crystal.  Furthermore,  when  r  was  more  than  a  few  lattice  constants  outside  of  the 
crystal,  t/fin(r)  would  be  vanishingly  small,  because  of  the  rapidly  diminishing  inverse 
fifth-power  dependence  of  the  contribution  to  Ulm  from  each  occupied  cell  of  the 
cubic  crystal  (Figure  18.1b). 

As  a  result,  the  energy  of  the  highest  occupied  electronic  level  well  within  the 
crystal  would  still  be  8f,  where  ZF  is  the  Fermi  energy  calculated  for  the  ideal  infinite 
crystal  with  the  periodic  potential  Furthermore  the  lowest  energy  of  an  electronic 

level  outside  of  the  crystal  would  be  zero  (since  Uf‘"  would  approach  zero  outside  of 
the  crystal,  and  the  kinetic  energy  of  a  free  electron  can  be  made  arbitrarily  small). 
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Therefore  if  there  were  no  distortion  in  the  charge  distribution  in  surface  cells,  the 
minimum  energy  required  to  remove  an  electron  from  the  interior  of  the  crystal  to 
a  point  just  outside  the  crystal  would  be4 

W  =  0  —  =  -£f.  (18.8) 

This  conclusion  is  incorrect.  The  actual  charge  distribution  in  cells  near  the  surface 
of  a  finite  crystal  does  differ  from  the  charge  distribution  of  cells  in  the  interior.  For 
one  thing  the  positions  of  the  surface  ions  will,  in  general,  be  slightly  displaced  from 
their  ideal  Bravais  lattice  positions.  Furthermore  the  electronic  charge  distribution 
in  cells  near  the  surface  need  not  have  the  symmetry  of  the  Bravais  lattice  (Figure 
18.2a).  Such  cells  will,  in  general,  have  a  nonvanishing  electric  dipole  moment  and 
may  even  yield  a  nonvanishing  net  electrical  surface  charge. 

The  particular  way  in  which  the  charge  distribution  in  cells  near  the  surface  differs 
from  that  in  the  bulk  depends  on  such  details  as  whether  the  surface  is  plane  or  rough, 
and,  if  plane,  on  the  orientation  of  the  plane  with  respect  to  the  crystallographic  axes. 
Determining  this  distorted  charge  distribution  for  various  types  of  surfaces  is  a 
difficult  problem  in  surface  physics  that  we  shall  not  explore.  Our  primary  interest 
is  in  the  consequences  of  such  a  distortion. 

We  first  consider  the  case  in  which  the  distortion  of  the  surface  cells  does  not 
result  in  a  net  macroscopic  charge  per  unit  area  of  metallic  surface.  If  we  require  the 
metal  as  a  whole  to  be  electrically  neutral,  this  will  be  the  case  if  all  its  surfaces  have 
equivalent  structures,  either  because  they  are  crystallographically  equivalent  planes 
or,  if  rough,  because  they  have  been  prepared  by  identical  processes.  At  distances  far 
(on  the  atomic  scale)  from  such  an  electrically  neutral  surface,  the  charge  distributions 
of  the  individual  distorted  surface  cells  will  continue  to  yield  no  net  macroscopic 
electric  field.5  However,  within  the  surface  layer  in  which  the  cells  are  distorted,  this 
distortion  will  give  rise  to  appreciable  electric  fields,  against  which  an  amount  of 
work  Ws  —  JeE  •  dt  must  be  performed,  in  moving  an  electron  through  the  layer. 

The  value  of  Ws  is  determined  by  the  manner  in  which  the  charge  distribution  in 
the  surface  cells  differs  from  that  of  the  bulk,  which,  in  turn,  depends  on  the  type  of 
surface  being  considered.  In  some  models5  the  distortion  in  the  charge  of  surface  cells 
is  represented  as  a  uniform  macroscopic  surface  density  of  dipoles,  and  with  this  model 


4  Since  electrons  are  bound  to  the  metal,  work  must  be  done  to  extract  them,  so  £F  must  be  negative. 
This  is  not  hard  to  reconcile  with  the  convention  of  free  electron  theory  that  6F  =  fi2kF2/2m.  The  point 
is  that  in  theories  of  bulk  properties  that  use  the  model  of  an  infinite  metal,  there  is  no  reason  to  consider 
any  particular  value  for  the  arbitrary  additive  constant  in  the  electronic  energy;  we  implicitly  committed 
ourselves  to  a  particular  choice  for  that  constant  by  assigning  the  energy  zero  to  the  lowest  electronic 
level.  With  that  convention,  the  potential  energy  for  an  electron  outside  of  the  crystal  must  be  large  and 
positive  (larger,  in  fact,  than  6F)  if  electrons  are  indeed  to  be  bound  to  the  metal.  In  the  present  discussion, 
however,  we  adopted  the  familiar  electrostatic  convention  of  taking  the  potential  to  be  zero  at  large 
distances  from  a  finite  metallic  specimen.  To  be  consistent  with  this  alternative  convention,  it  is  necessary 
to  add  a  large  negative  constant  to  the  energy  of  each  electronic  level  within  the  metal.  One  can  regard 
that  negative  constant  as  providing  a  crude  representation  of  the  attractive  potential  of  the  lattice  of  ions. 
The  value  of  that  constant  has  no  effect  on  the  determination  of  bulk  properties,  but  in  comparing  energies 
inside  and  outside  of  the  metal  one  must  either  introduce  such  a  term  explicitly,  or  abandon  the  convention 
that  the  potential  is  zero  far  from  the  metal 

5  See,  for  example.  Problem  la. 
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Figure  18.2 

(a)  The  actual  form  of  the  electric 
charge  density  near  the  surface 
of  a  crystal  (neglecting  possible 
slight  displacements  of  the  ions 
near  the  surface  from  their  sites 
in  the  infinite  crystal).  Note  the 
electron  deficiency  in  the  two 
cells  nearest  the  surface  and  the 
presence  of  electronic  charge  in 
the  first  “cell”  on  the  vacuum 
side  of  the  surface.  Tt  is  this  kind 
of  distortion  that  produces  the 
"double  layer”  described  below 

(b)  The  form  of  the  crystal  poten¬ 
tial  U  determined  by  the  charge 
density  in  (a).  If  the  additive 
constant  is  chosen  so  that  U 
resembles  the  potential  of  Fig. 
18.1b  far  inside  the  crystal,  then 
outside  of  the  crystal  U  will  not 
approach  zero,  but  the  value 
Vi'  equal  to  the  work  that  must 
be  done  to  carry  an  electron 
through  the  electric  field  in  the 
double  layer.  The  lowest  levels 
outside  the  crystal  now  have  an 
energy  Ws,  and  therefore  an 
energy  W  =  — Ef  +  Ws  must  be 
supplied  to  remove  an  electron. 

in  mind,  the  surface  layer  (quite  generally)  is  often  referred  to  as  the  “double  layer.” 

The  work  Ws  done  by  the  field  prevailing  in  the  double  layer  must  be  added  to  the 
expression  (18.8)  that  gives  the  work  function  when  the  distortion  of  the  surface  cells 
is  ignored.  The  correct  work  function  is  therefore  given  by6 

W  =  -Ef  +  Ws.  (18.9) 

The  corresponding  form  for  the  crystal  potential  l/(r)  is  shown  in  Fig.  18.2b. 

If  the  crystal  faces  are  not  equivalent,  then  there  is  nothing  to  prohibit  a  net 
macroscopic  surface  charge  developing  in  each  face  in  addition  to  the  double  layers, 
provided  the  total  charge  on  all  surfaces  of  the  crystal  vanishes.  Indeed,  it  is  easy  to 
see  that  small  but  nonvanishing  surface  charges  must  arise  for  the  following  reason: 

Consider  a  crystal  with  two  inequivalent  faces  F  and  F'.  Their  work  functions  W 
and  W  need  not  be  the  same,  since  the  contributions  Ws  and  Ws'  to  the  work  functions 

6  Like  Eq.  (18.8),  Eq.  (18.9)  assumes  that  has  been  calculated  for  the  infinite  crystal  with  a  particular 
choice  for  the  additive  constant  in  the  periodic  potential,  namely  the  one  that  insures  that  for  a  finite 
crystal  where  distortion  of  the  surface  cell  charge  distributions  is  not  taken  into  account,  U  vanishes  at 
large  distances  from  the  crystal. 
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will  arise  from  double  layers  of  different  interna!  structures.  Consider  now  extracting 
an  electron  at  the  Fermi  level  through  face  F,  and  then  bringing  it  back  in  through 
face  F'  to  an  interior  level  that  is  again  at  the  Fermi  energy  (Figure  18.3).  The  total 
work  done  in  such  a  cycle  must  vanish,  if  energy  is  to  be  conserved.  However,  the 
work  done  in  extracting  and  reintroducing  the  electron  is  W  -  W',  which  need  not 
vanish  if  the  surfaces  are  not  equivalent.  There  must  therefore  be  an  electric  field 
outside  of  the  metal  against  which  a  compensating  amount  of  work  is  done  as  the 
electron  is  carried  from  face  F  to  face  F';  i.e.,  the  two  faces  of  the  crystals  must  be  at 
different  electrostatic  potentials  cp  and  <p'  satisfying 

-e{<j>  -  <P')  =  W  -  W'.  (18-10) 


Figure  18.3 

Zero  total  work  is  done  in  taking  an  electron  from 
an  interior  level  at  the  Fermi  energy  over  the  path 
shown,  returning  it  at  the  end  to  an  interior  level  at 
the  Fermi  energy.  That  work,  however,  is  the  sum 
of  three  contributions:  W  (in  going  from  1  to  2), 
ei<p  —  4>')  (in  going  from  2  to  3,  where  <p  and  <f>'  are 
the  electrostatic  potentials  just  outside  faces  Fand 
and  F'),  and  -  W'  (in  going  from  3  back  to  1). 


\ 


Since  the  double  layers  cannot  yield  macroscopic  fields  outside  of  the  metal  these 
fields  must  arise  from  net  macroscopic  distributions  of  electric  charge  on  the  surfaces.7 8 
The  quantity  of  charge  that  must  be  redistributed  among  the  surfaces  to  produce  such 
exterior  fields  is  minute,  compared  with  the  amount  of  charge  that  is  redistributed 
among  neighboring  surface  cells  in  establishing  the  double  layer.  Correspondingly, 
the  electric  field  prevailing  inside  the  double  layer  is  immense  compared  with  the 
exterior  electric  field  arising  from  the  net  surface  charge.9 

When  all  surfaces  of  the  solid  are  not  equivalent,  the  work  function  is  defined  for 
a  particular  surface  to  include  only  the  work  that  must  be  done  against  the  field  in 
its  double  layer  (a  feature  intrinsic  to  that  surface),  but  not  the  additional  work  that 
must  be  done  against  exterior  fields  that  may  be  present  due  to  the  redistribution  of 
surfaces  charges  (a  quantity  that  depends  on  what  other  surfaces  happen  to  be 
exposed).  Because  such  exterior  fields  are  minute  compared  with  the  fields  in  the 
double  layers,  one  insures  that  only  the  latter  fields  contribute  to  the  work  function 
of  a  surface  by  defining  it  to  be  the  minimum  work  required  to  remove  an  electron 
through  that  surface  to  a  point  that  is  far  away  on  the  atomic  scale  (so  that  the  electron 


7  The  condition  that  the  crystal  as  a  whole  be  neutral  requires  only  that  the  sum  over  all  faces  of 
the  macroscopic  surface  charge  on  each  should  vanish. 

8  See  Problem  lb. 

9  The  potential  drop  from  face  to  face  is  comparable  to  the  potential  drop  across  the  double  layers 
(see  Eq.  (18.10)).  The  former  drop,  however,  takes  place  over  macroscopic  distances  (of  the  order  of  the 
dimensions  of  the  crystal  faces)  while  the  latter  occurs  over  microscopic  distances  (of  the  order  of  the 
thickness  of  the  double  layer — i.e.,  a  few  lattice  constants). 
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has  passed  through  the  entire  double  layer)  but  not  far  on  the  scale  of  the  dimensions 
of  the  macroscopic  crystal  faces  (so  that  fields  existing  outside  of  the  crystal  do 
negligible  work  on  the  electron10). 


CONTACT  POTENTIALS 

Suppose  two  metals  are  connected  in  a  way  that  permits  electrons  to  flow  freely  from 
one  to  the  other.  When  equilibrium  is  attained,  the  electrons  in  each  metal  must  be  at 
the  same  chemical  potential.  This  is  achieved  by  a  momentary  flow  of  charge  from  the 
surface  of  one  to  the  surface  of  the  other.  The  surface  charge  on  each  metal  gives  rise 
to  a  potential  in  the  interior  that  uniformly  shifts  all  of  the  bulk  levels  together  with 
the  chemical  potential  (so  that  bulk  properties  in  the  interior  remain  unaltered). 

Because  charge  has  been  transferred,  the  two  metals  will  no  longer  be  at  the  same 
electrostatic  potential.  The  potential  difference  between  any  two  faces  of  the  two 
metals  can  be  expressed  in  terms  of  their  work  functions  by  the  same  argument  we 
used  to  find  the  potential  difference  between  two  inequivalent  faces  of  a  single  metallic 
specimen  (Figure  18.3).  One  again  observes  that  if  an  electron  at  the  Fermi  level11 
is  extracted  through  a  face  of  the  first  metal  (with  work  function  W),  and  reintroduced 
through  a  face  of  the  second  metal  (with  work  function  W)  at  the  (same)  Fermi  level, 
then  if  energy  is  to  be  conserved,  there  must  be  an  exterior  electric  field  that  does 
work  W  -  W  on  the  electron,  which  requires,  in  turn,  a  potential  difference  between 
the  two  faces  given  by 

-e(4>  -0')=  W-  W'.  (18.11) 

The  two  metals  (before  and  after  their  electrons  have  come  to  equilibrium)  are 
depicted  schematically  in  Figure  18.4.  Since  some  contact  must  be  made  between  the 
metals  to  permit  their  electrons  to  come  to  equilibrium,  the  potential  difference 
(18.11)  is  known  as  a  contact  potential. 


THE  MEASUREMENT  OF  WORK  FUNCTIONS  VIA  THE 
MEASUREMENT  OF  CONTACT  POTENTIALS 

Equation  (18.1 1)  suggests  that  a  simple  way  to  measure  the  work  function  of  a  metal12 
would  be  to  measure  the  contact  potential  between  it  and  a  metal  of  known  work 
function.  This  cannot  be  done  simply  by  connecting  a  galvanometer  across  two  faces, 
for  if  it  could  one  would  have  produced  a  flow  of  current  in  a  circuit  without  a 
sustaining  source  of  energy. 


10  Even  if  all  faces  are  equivalent,  the  interaction  of  the  removed  electron  with  those  remaining  in 
the  metal  will  induce  macroscopic  surface  charges  (giving  rise  to  the  “image  charge”  of  electrostatics) 
w;hose  contribution  to  the  work  function  is  also  made  negligible  by  this  definition. 

11  In  metals  at  room  temperature  and  below,  the  chemical  potential  differs  negligibly  from  the 
Fermi  energy.  See  Eq.  (2.77). 

12  When  speaking  of  the  work  function  of  a  metal  without  reference  to  a  particular  crystal  face  one 
has  in  mind  the  value  appropriate  to  some  rough  (on  the  microscopic  scale)  face,  which  therefore  represents 
some  average  value  of  the  work  functions  for  crystallographically  well-defined  faces. 
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U=-ei>  U'=~eip' 


(c) 


Figure  18.4 

(a)  The  crystal  potential  U  (or  electrostatic  potential  <p  =  U/(  -  e) )  for  a  metal  with 
work  function  IT and  Fermi  energy  Ef .  (The  figure  is  essentially  that  of  Figure  18.2b.) 

(b)  A  similar  plot  for  a  second  metal  with  work  function  W  and  Fermi  energy  EF', 
electrically  insulated  from  the  first.  In  (c)  the  two  metals  have  been  joined  together  by 
a  conducting  wire,  so  that  charge  can  pass  freely  from  one  to  the  other.  The  only  result 
is  to  introduce  small  amounts  of  net  surface  charge  in  each  metal,  sufficient  to  shift 
uniformly  the  level  structures  in  (a)  and  (b)  so  as  to  bring  the  two  Fermi  levels  into 
coincidence.  Because  of  the  slight  surface  charges  on  the  metals  the  potentials  outside 
them  are  no  longer  strictly  constant,  and  there  is  a  potential  drop  from  one  metal 
to  the  other  given  by  —e[(p  —  <p')  =  W  —  W'. 


However,  a  simple  method  due  to  Kelvin  can  be  used  to  measure  contact  potentials. 
Suppose  the  two  samples  are  arranged  so  that  the  two  crystal  faces  form  a  plane 
parallel  capacitor.  If  there  is  a  potential  difference  V  between  the  faces,  then  there 
will  be  a  charge  per  unit  area  given  by 


£  _  V 

4jj  4jhT 


(18.12) 


where  d  is  the  distance  between  the  faces.  If  the  plates  are  connected  and  no  external 
voltage  imposed,  the  potential  difference  will  just  be  the  contact  potential  Vc.  As  the 
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distance  d  between  plates  is  varied,  the  contact  potential  remains  unchanged,  and 
therefore  charge  must  flow  between  the  faces  to  maintain  the  relation 


a 


_K_ 

And 


(18.13) 


By  measuring  the  flow  of  charge  one  can  measure  the  contact  potential.  One  can 
simplify  the  procedure  by  adding  to  the  circuit  an  external  potential  bias  and 
adjusting  it  so  that  no  current  flows  when  d  is  varied  (Figure  18.5).  When  this  is 
achieved,  the  bias  will  be  equal  and  opposite  to  the  contact  potential. 


V— ^ 


1 


Figure  18.5 

How  to  measure  a  contact  potential.  As  the  distance  between  two 
parallel  plane  faces  is  varied,  the  capacitance  will  vary.  Since  the 
potential  difference  is  fixed  at  the  contact  potential,  variable 
capacitance  implies  variable  charge  density  on  the  faces.  To  permit 
the  charge  on  the  faces  to  readjust  as  the  separation  between  them 
is  changed,  a  current  must  flow  in  the  wire  joining  them.  The 
measurement  can  be  made  simpler  by  adding  an  external  potential 
bias  and  adjusting  it  so  that  no  current  flows  through  the  ammeter 
A  when  d  is  varied.  When  this  situation  is  achieved,  the  bias  will 
just  cancel  the  contact  potential. 


OTHER  WAYS  TO  MEASURE  WORK  FUNCTIONS:  THERMIONIC 
EMISSION 

There  are  several  other  ways  of  measuring  work  functions.  One  method  exploits  the 
photoelectric  effect,  measuring  the  minimum  photon  energy  necessary  to  eject  an 
electron  through  the  crystal  face,  and  setting  that  equal  to  W. 

Another  method,  of  some  importance  in  designing  cathode-ray  tubes,  measures 
the  temperature  dependence  of  the  electron  current  flowing  from  the  face  of  a  hot 
metal.  To  understand  such  thermionic  emission,  first  consider  the  idealized  case  in 
which  a  metal  surface  is  in  thermal  equilibrium  with  a  dilute  electron  gas  outside 
of  the  metal  (Figure  18.6).  At  temperature  T  the  electronic  distribution  function  is 

Figure  18.6 

A  simple  model  of  thermionic 
emission.  The  current  flowing 
out  of  a  metal  when  electrons 
that  escape  are  continually 
swept  away  is  calculated  by 
assuming  that  the  metal  is  in 
thermal  equilibrium  with  a 
dilute  gas  of  free  electrons, 
and  identifying  the  thermion¬ 
ic  current  with  that  carried 
by  those  electrons  moving 
away  from  the  surface  (solid 
circles). 
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f(ki  = - - - q - .  (18.14) 

J{  }  exp  [(£„(k)  —  n)/kBT]  +  1 

Inside  the  metal  £„(k)  is  determined  by  the  band  structure.13  Outside  of  the  metal, 
£„(k)  should  be  taken  to  have  the  free  particle  form 

h2k2 

~  -  e4>,  (18.15) 

2m 

where  tp  is  the  local  value  of  the  electrostatic  potential.14  If  the  additive  constant  in 
the  periodic  potential  is  defined  according  to  the  convention  of  Figure  18.2b  (so  that 
the  expression  (18.9)  gives  the  work  function),  then  outside  of  the  double  layer  we 
have  (see  Figure  18.2b): 

—  e<p—  Ws.  (18.16) 

Therefore  the  distribution  of  external  electrons  beyond  the  double  layer  will  be 


m  = 


exp  [(h2k2/2m  +  W,  —  n)/kBT~\  +  1 
However,  Eq.  (18.9)  permits  this  to  be  written  in  the  form15 

1 

/(k)  “  exp  [(h2k2/2m  +  W)/k„T]  +  V 


(18.17) 


(18.18) 


where  W  is  the  work  function  of  the  surface. 

Since  work  functions  are  typically  a  few  electron  volts  in  size  (see  Table  18.1), 
W/kB  is  of  order  104K.  Therefore  at  temperatures  below  several  thousand  degrees 
(18.18)  reduces  to:16 

/(k)  =  exp  [-(h2k2/2m  +  W)/kBT].  (18.19) 

The  electronic  current  density  flowing  away  from  the  surface  is  given  by  adding 
the  contributions  from  all  electrons  with  positive  vx  =  hkjm ,  where  the  positive 
x-direction  is  taken  to  be  the  direction  of  the  normal  out  of  the  surface: 


J  =  ~e 


f 

J**>0 


dk 

4^ 


vj( k)  =  cw^r{ 


-«( 


dk  hkx 
47t3  m 


-hVnmkgT  _  (18.20) 


The  integration  is  elementary,  giving  for  the  current  per  unit  area  emitted  by  the 
surface. 


j  = 


.  em  (k..T)2e^wlkRT 
2n2h 3 (  B  ’ 


(18.21) 


13  The  derivation  given  here  does  not  require  the  bands  to  be  free  eiectron-like.  The  result  (18.21) 
is  independent  of  the  details  of  the  band  structure. 

14  Ignore  for  the  moment  the  contribution  of  the  dilute  electron  gas  itself  to  this  potential.  The  current 
calculated  by  ignoring  such  complications  (known  as  space-charge  effects)  is  called  the  saturation  current. 

15  See  footnote  11. 

16  The  experimental  confirmation  of  the  Maxwell— Bojtzman  form  (18.19)  of  this  distribution  function 
was  a  considerable  obstruction  on  the  road  to  the  discovery  that  metallic  electrons  obey  Fermi-Dirac 
statistics. 
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This  result,  known  as  the  Richardson-Dushman  equation,  asserts  that  if  In  (j/T 2) 
is  plotted  against  1  /kBT,  the  resulting  curve  will  be  a  straight  line  with  slope  —  W. 
In  this  way  the  absolute  work  function  can  be  determined. 

In  practice,  the  neglected  space-charge  effects  are  made  unimportant  by  the  ap¬ 
plication  of  a  small  electric  field  that  sweeps  away  the  electrons  as  they  are  emitted. 
In  addition,  for  our  model  to  be  applicable,  the  flow  of  electrons  away  from  the  metal 
must  be  dominated  by  electrons  that  have  originated  within  the  metal,  and  not  by 
electrons  from  the  gas  outside,  which  have  been  reflected  back  after  striking  the 
surface.  If  surface  scattering  is  important,  the  current  will  be  less  than  predicted  by 
(18.21). 

MEASURED  WORK  FUNCTIONS  OF  SEUECTED  METALS 

In  Table  18.1  we  list  the  work  functions  for  some  typical  metals  as  determined  by 
these  three  methods.  Generally  speaking,  the  various  methods  all  give  results  that 
differ  by  about  5  percent.  Since  the  variation  in  work  function  over  different  crystal¬ 
lographic  faces  can  easily  be  of  this  size,  it  is  not  worth  quoting  the  separate  results. 
Nor  should  a  number  for  the  metal  as  a  whole  be  relied  on  to  more  than  a  few  percent. 


Table  18.1 

WORK  FUNCTIONS  OF  TYPICAL  METALS 


METAL 

W  (eV) 

METAL 

W(eV) 

METAL 

W  (eV) 

Li 

2.38 

Ca 

2.80 

In 

3.8 

Na 

2.35 

Sr 

2.35 

Ga 

3.96 

K 

2.22 

Ba 

2.49 

T1 

3.7 

Rb 

2.16 

Nb 

3.99 

Sn 

4.38 

Cs 

1.81 

Fe 

4.31 

Pb 

4.0 

Cu 

4.4 

Mn 

3.83 

Bi 

4.4 

Ag 

4.3 

Zn 

4.24 

Sb 

4.08 

Au 

4.3 

Cd 

4.1 

W 

4.5 

Be 

3.92 

Hg 

4.52 

Mg 

3.64 

A1 

4.25 

Source:  V.  S.  Fomenko,  Handbook  of  Thermionic  Properties ,  G.  V.  Samsanov,  ed.. 
Plenum  Press  Data  Division,  New  York,  1966.  (Values  given  are  the  author’s  dis¬ 
tillation  of  many  different  experimental  determinations.) 


We  conclude  out  discussion  of  surfaces  with  brief  descriptions  of  two  of  the  major 
techniques  used  to  probe  surface  structure. 

LOW-ENERGY  ELECTRON  DIFFRACTION 

The  structure  of  the  surface  of  a  crystal  specimen  with  a  good  plane  surface  (on  the 
microscopic  level)  can  be  studied  by  the  technique  of  low-energy  electron  diffraction 
(LEED).  The  basis  for  the  method  is  very  much  the  same  as  in  the  theory  of  X-ray 
diffraction,  modified  to  take  into  account  the  fact  that  the  diffracting  surface  is  only 
periodic  in  two  dimensions  (i.e.,  in  its  own  plane).  Elastically  scattered  electrons  are 
more  suitable  for  the  study  of  surfaces  than  X  rays,  because  they  penetrate  only  a 
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very  short  distance  into  the  solid,  so  that  the  diffraction  pattern  is  determined  almost 
entirely  by  the  surface  atoms. 

The  energies  of  the  electrons  required  for  such  a  study  are  easily  estimated.  A  free 
electron  with  wave  vector  k  has  energy 

£  =  =  (ka0)2 Ry  =  I3.6(ka0)2  eV,  (18.22) 


from  which  it  follows  that  the  de  Broglie  wavelength 
energy  in  electron  volts  by 


A  = 


12.3 

(£ev)'/2 


A. 


of  an  electron  is  related  to  its 


(18.23) 


Since  the  electron  wavelengths  must  be  of  the  order  of  a  lattice  constant  or  less, 
energies  of  some  tens  of  electron  volts  or  higher  are  necessary. 

To  understand  qualitatively  the  pattern  produced  in  an  electron  diffraction  mea¬ 
surement,  suppose  the  scattering  is  elastic,17  and  that  the  incident  and  scattered 
electron  wave  vectors  are  k  and  k'.  Suppose,  furthermore,  that  the  crystal  surface  is 
a  lattice  plane  perpendicular  to  the  reciprocal  lattice  vector  b3  (see  page  90).  Choose 
a  set  of  primitive  vectors  including  b3  for  the  reciprocal  lattice,  and  primitive  vectors 
a*  for  the  direct  lattice,  satisfying 

a,  •  by  =  2nbij.  (18.24) 

If  the  electron  beam  penetrates  so  little  that  only  scattering  from  the  surface  plane 
is  significant,  then  the  condition  for  constructive  interference  is  that  the  change  q  in 
the  wave  vector  of  the  scattered  electron  satisfy 


q  •  d  =  In  x  integer,  q  =  k'  —  k,  (18.25) 

for  all  vectors  d  joining  lattice  points  in  the  plane  of  the  surface  (cf.  Eq.  (6.5)). 
Since  such  d  are  perpendicular  to  b3,  they  can  be  written  as: 

d  =  nlal  +  «2a2-  (18.26) 


Writing  q  in  the  general  form 

q  =  £  9,b„  (18.27) 

i=l 

we  find  that  conditions  (18.25)  and  (18.26)  require 

<7,  =  2n  x  integer, 

q2  —  2n  x  integer, 

q3  =  arbitrary.  (18.28) 

Since  b3  is  normal  to  the  surface,  these  conditions  will  be  satisfied  by  discrete  lines18 
in  q-space  perpendicular  to  the  crystal  surface  (as  opposed  to  discrete  points  in  the 
case  of  diffraction  by  a  three-dimensional  lattice).  Consequently,  even  when  the 


17  In  fact,  however,  the  elastically  scattered  component  is  generally  a  very  small  fraction  of  the  total 
reflected  flux  of  electrons. 

18  Sometimes  referred  to  as  “rods”  in  the  literature. 
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energy  conservation  condition  k  —  k!  is  added,  there  will  always  be  nontrivial 
solutions  unless  the  incident  wave  vector  is  too  small. 

Equation  (18.28)  (together  with  an  experimental  arrangement  that  selects  out 
the  elastically  scattered  component)  enables  one  to  deduce  the  surface  Bravais  lattice 
from  the  structure  of  the  reflected  pattern.  If  scattering  from  more  than  one  surface 
layer  is  important,  the  general  structure  of  the  pattern  will  be  unchanged,  for  lower- 
lying  planes  will  produce  a  weaker  version  of  the  same  pattern  (attenuated  because 
only  a  smaller  fraction  of  the  beam  can  penetrate  to  the  next  layer). 

Much  more  information  is  contained  in  the  detailed  distribution  of  scattered 
electrons  than  the  mere  arrangement  of  atoms  in  the  surface  plane,  but  to  extract 
such  information  is  a  difficult  problem,  whose  solution  remains  elusive. 


THE  FIELD  ION  MICROSCOPE 

Elastic  low-energy  electron  diffraction  reveals  the  structure  of  the  Fourier  transform  of 
the  surface  charge  density— i.e.,  its  shape  in  k-space.  The  structure  in  real  space  can  be 
seen  by  the  technique  of  the  field  ion  microscope.  The  surface  must  be  that  of  a  sharply 
pointed  specimen  (Figure  18.7),  with  a  point  that  is  more  or  less  hemispherical  on 
the  atomic  scale,  and  with  a  radius  of  order  a  few  thousand  angstroms.  The  specimen  is 
placed  in  a  high  vacuum  facing  an  electrode.  A  large  voltage  is  applied  between  the 


Figure  18.7 

Schematic  representation  of  a  field  ion  microscope. 
The  sample  (pointed  cone)  is  at  a  positive  potential 
wi  th  respect  to  the  plate,  so  that  field  lines  point  radi¬ 
ally  outward.  A  neutral  helium  atom  (empty  circle) 
will  be  attracted  to  the  region  of  high  field  by  the 
induced  dipole  interaction.  A  helium  ion  (circle 
with  plus)  will  be  strongly  repelled  along  field  lines. 
The  field  will  only  be  strong  enough  to  ionize  helium 
atoms  in  the  immediate  vicinity  of  the  tip.  The  basic 
assumption  is  that  most  of  the  helium  atoms  are 
ionized  in  the  immediate  neighborhood  of  surface 
atoms,  where  the  field  is  at  its  strongest.  Since  the 
variation  of  the  field  near  the  surface  reflects  the 
variation  of  the  atomic  structure,  the  pattern  of  ions 
impinging  on  the  plate  should  give  a  representation 
of  the  atomic  structure  of  the  tip. 


specimen  and  the  electrode,  with  a  polarity  chosen  to  make  the  specimen  positive. 
Neutral  helium  atoms  are  then  introduced  into  the  vacuum  chamber,  which  are 
polarized  by  the  field.  The  interaction  of  the  field  with  this  induced  dipole  moment 
draws  the  helium  atoms  into  the  region  of  strongest  field — i.e.,  toward  the  tip  of  the 
specimen.  At  a  few  atomic  spacings  from  the  tip  the  field  becomes  so  strong  that  an 
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electron  can  be  torn  from  the  helium  atom.  When  this  happens,  the  atom  becomes 
a  positively  charged  helium  ion,  and  is  energetically  repelled  from  the  tip  toward  the 
electrode.  If  the  field  strength  is  adjusted  so  that  ionization  occurs  only  very  close 
to  the  surface,  one  might  hope  that  the  angular  distribution  of  ions  fleeing  from  the 
tip  would  reflect  the  microscopic  structure  of  the  surface— via  the  structure  of  the 
field  in  the  immediate  vicinity  of  the  surface — magnified  by  the  ratio  of  the  radius 
of  the  collecting  electrode  to  the  radius  of  the  hemispherical  sample. 

In  fact,  the  pictures  so  obtained  not  only  reflect  the  detailed  symmetry  of  the  crystal, 
but  actually  indicate  the  positions  of  individual  atoms  (Figure  18.8).  The  technique 
can  be  used  to  study  the  behavior  of  individual  atomic  impurities. 


Figure  18.8 

A  field  ion  micrograph  of  a 
gold  tip.  In  this  example  the 
imaging  gas  is  neon  rather 
than  helium.  (From  R.  S. 
Averbach  and  D.  N.  Seid- 
man.  Surface  Science  40,  249 
(1973).  We  thank  Professor 
Seidman  for  giving  us  the 
original  micrograph.) 


ELECTRONIC  SURFACE  LEVELS 

Any  attempt  to  describe  a  solid  surface  in  detail  must  use  the  fact  that  in  addition 
to  the  Bloch  solutions  to  the  one-electron  Schrodinger  equation  for  the  familiar 
periodically  extended  crystal,  there  are  other  solutions  with  complex  wave  vectors 
describing  electronic  levels  that  are  localized  in  the  neighborhood  of  the  surface  of 
a  real  crystal.  We  have  properly  neglected  such  levels  in  our  earlier  discussions  of 
bulk  properties.  The  number  of  surface  levels  compared  with  the  number  of  Bloch 
levels  turns  out  to  be  at  most  of  the  order  of  the  number  of  surface  atoms  compared 
with  the  number  of  atoms  in  the  entire  crystal,  i.e.,  about  one  in  10s  for  a  macroscopic 
specimen.  As  a  result  the  surface  levels  make  negligible  contribution  to  bulk  properties 
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except  for  exceedingly  small  specimens.  However,  they  are  of  considerable  importance 
in  determining  the  structure  of  the  crystal  surface.  For  example,  they  must  play  a 
role  in  any  genuinely  microscopic  calculation  of  the  structure  of  the  surface  dipole 
layer. 

To  understand  qualitatively  how  these  surface  levels  arise,  we  reexamine  our 
derivation  of  Bloch’s  theorem  in  Chapter  8. 

The  argument  leading  to  the  Bloch  form 

ip(r)  =  e‘kru(r),  u(r  +  R)  =  u(r),  (18.29) 

did  not  require  the  wave  vector  k  to  be  real.  This  further  restriction  emerged  from  an 
application  of  the  Born-von  Karman  periodic  boundary  condition.  This  boundary 
condition,  however,  is  an  artifact  of  the  infinite  crystal.  If  it  is  abandoned,  we  can 
find  many  more  solutions  to  the  infinite  crystal  Schrodinger  equation,  having  the 
form 

ip(r)  =  [c*-,u(t)>-K-'  (18.30) 

where  k  is  now  the  real  part  of  the  Bloch  wave  vector,  which  may  also  have  an 
imaginary  part  k. 

The  wave  function  (18.30)  grows  without  bound  in  the  direction  opposite  to  k, 
and  decays  exponentially  in  the  direction  of  k.  Since  the  electronic  density  is  every¬ 
where  finite,  such  levels  have  no  relevance  for  an  infinite  crystal.  If,  however,  there 
is  a  plane  surface  perpendicular  to  k,  then  one  might  attempt  to  join  a  solution 
of  the  form  (18.30)  within  the  crystal,  that  grows  exponentially  as  the  surface  is 
approached,  with  one  that  is  exponentially  damped  outside  of  the  crystal  (Figure  18.9). 


Wave  function  for  a  one-electron  surface  level  plotted  in  a  direction  x,  perpendicular  to  the 
surface.  Note  that  ijj  decays  exponentially  outside  and  has  an  exponentially  decreasing  envelope 
inside. 
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In  general,  for  a  fixed  component  of  k  parallel  to  the  surface  this  matching  will  be 
possible  only  for  a  discrete  set  of  k  (as  is  the  case  for  any  problem  concerning  localized 
levels). 

To  explore  this  problem  further  would  take  us  well  beyond  the  scope  of  this  book, 
for  we  should  first  have  to  reexamine  the  entire  discussion  of  Bloch  functions  without 
the  restriction  that  the  wave  vector  k  be  real,  and  then  discuss  the  problem  of  how 
such  Bloch  functions  with  complex  wave  vectors  can  be  matched  onto  exponentially 
decaying  levels  in  empty  space.  Features  of  such  solutions  in  the  nearly  free  electron 
approximation  are  explored  in  Problem  2. 


PROBLEMS 


1.  Some  Problems  in  Electrostatics  Bearing  on  Contact  Potentials  and  the  Double 
Layer 

(a)  Consider  a  plane  surface  of  a  metal  perpendicular  to  the  x-axis.  Perhaps  the  simplest 
model  of  the  distortion  in  the  charge  densities  of  the  cells  near  the  surface  is  one  that  ignores  any 
variation  in  the  plane  of  the  surface,  describing  the  deviation  in  the  charge  density  from  its  form 
in  the  bulk  by  a  function  of  the  single  variable  x,  Sp(x).  The  condition  that  there  is  no  net  macro¬ 
scopic  charge  density  on  the  surface  is 

0  =  Jdx  8p(x).  (18.31) 

The  charge  density  <5p(x)  will  give  rise  to  an  electric  field  £(x)  also  normal  to  the  surface.  Deduce 
directly  from  Gauss’s  law  (V  •  E  =  4n  dp)  that  if  the  field  vanishes  on  one  side  of  the  double  layer 
(as  it  does  inside  the  metal)  then  it  must  also  vanish  on  the  other  side.  Deduce  also  that  the  work 
that  must  be  done  to  move  an  electron  through  the  double  layer  is  just 

Ws  =  -4? teP,  (18.32) 

where  P  is  the  dipole  moment  per  unit  surface  area  produced  by  the  double  layer.  (Hint:  Write 
the  work  as  an  integral  and  introduce  a  judicious  integration  by  parts.) 

(b)  Show  that  the  charge  density  that  must  be  supplied  to  a  conducting  sphere  of  radius 
1  cm  to  raise  its  potential  from  zero  to  1  volt  is  of  order  10-1°  electrons  per  square  angstrom. 

2.  Electron  Surface  Levels  for  a  Weak  Periodic  Potential19 

The  method  of  Chapter  9  can  be  used  to  investigate  electronic  surface  levels.  Consider  a  semi¬ 
infinite  crystal  with  a  plane  surface,  perpendicular  to  a  reciprocal  lattice  vector  K  (crystal  surfaces 
are  parallel  to  lattice  planes).  Talcing  the  x-axis  to  lie  along  K  and  the  origin  to  be  a  Bravais  lattice 
point,  to  a  rough  approximation  we  can  represent  the  potential  of  the  semi-infinite  crystal  by 
F(r)  =  L'(r),  x  <  a:  F(r)  =  0.  x  >  a.  Here  U(t)  is  the  periodic  potential  of  the  infinite  crystal. 
The  distance  a  lies  between  zero  and  the  interplanar  distance  in  the  family  of  planes  parallel  to 
the  surface,  and  should  be  chosen  in  any  particular  problem  to  give  that  (7(r)  which  most  resembles 
the  actual  potential  at  the  surface. 


19  See  E.  T.  Goodwin  Proc.  Camb.  Phil.  Soc.  35,  205  (1935). 
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We  continue  to  assume  that  the  Fourier  components  UK  are  real.  However,  if  we  wish  the 
lowest  level  outside  of  the  crystal  to  have  energy  0,  we  can  no  longer  neglect  the  zeroth  Fourier 
component  U0  w  ithin  the  crystal,  as  we  did  in  Chapter  9.  The  result  of  retaining  U0  is  simply  to 
shift  the  formulas  of  Chapter  9  for  the  levels  within  the  crystal  by  that  amount.  Note  that  U0 
need  not  be  small  for  the  method  of  Chapter  9  to  work,  in  contrast  to  the  Fourier  components 
14 ,  with  K  i=  0. 

We  examine  a  Bloch  level  for  the  infinite  crystal  with  a  wave  vector  k  that  is  near  the  Bragg 
plane  determined  by  K.  but  near  no  other  Bragg  plane,  so  that  in  a  weak  periodic  potential  the 
wave  function  of  the  level  is  a  linear  combination  of  plane  waves  with  wave  vectors  k  and  k  -  K. 
In  Chapter  9  we  required  k  to  be  real,  to  satisfy  the  Born-von  Harman  boundary  condition.  In 
a  semi-infinite  crystal,  however,  the  component  of  k  normal  to  the  crystal  surface  need  not  be  real, 
provided  that  it  yields  a  decaying  wave  in  the  negative  x-direction  (into  the  metal).  Outside  of 
the  metal,  the  Bloch  function  must  be  joined  onto  a  solution  of  the  free-space  Schrodinger  equation 
that  decays  in  the  positive  x-direction  (away  from  the  metal).  Thus,  outside  the  metal  we  take 


<P( r)  =  e  p*+'Vr,  x  >  a,  (18.33) 

and  inside, 

^(r)  =  e,v+tt”x+,Vr(t-1I  +  ck_Ke~ilCx),  x  <  a,  (18.34) 

where  k|(  is  the  part  of  k  parallel  to  the  surface,  and  the  coefficients  in  (1 8.34)  are  determined  by 
the  secular  equation  (9.24)  (with  the  energy  e  shifted  by  the  constant  U0): 

(£  —  £k  —  U 0)ck  —  UKck_K  =  0, 

-  UKck  +  (£  -  £°_k  -  l/0)ck_K  =  0.  (18.35) 

(a)  Verify  that  for  (18.35)  to  yield  real  energies  for  q  J=  0  it  is  necessary  that  k0  =  K/2. 

(b)  Show  that  when  k0  =  K/2,  the  resulting  energies  are 

s  =  L  (*"2  +  \K1  -  ?)  +  l/o  ±  Jv*2  -  (~q  kJ.  (18.36) 

(c)  Show  that  continuity  of  ip  and  \7tp  at  the  surface  leads  to  the  condition 

V  +  q  =  xK  tan(^a  +  A  (18.37) 

where  ^ 

~~  =  e2id.  (18.38) 

Ck-K 

(d)  Taking  the  case  a  =  0,  and  using  the  fact  that  outside  of  the  metal 


£ 


2  in 


(V  -  P21 


show  that  (18.35)  to  (18.39)  have  a  solution: 


1  UK 

q  =  —  -  K  —  sin  25, 
4  e0 


(18.39) 
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where 


sec2  <5  = 


(U0  +  £  k) 

£o 


£o  = 


h2 

2m 


(Note  that  this  solution  exists  only  when  U0  and  UK  are  negative,  and  V  0|  +  |LK| 


(18.41) 

>  6o  ) 
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In  Chapter  7  we  discussed  the  classification  of  solids  on  the  basis  of  the  symmetry  of 
their  crystal  structures.  The  categories  described  there  are  very  important,  but  are 
based  entirely  on  a  single  aspect  of  the  solid:  its  geometrical  symmetries.  Such  a 
classification  scheme  is  blind  to  important  structural  aspects  of  a  solid  that  affect  its 
physical  (even  if  not  its  purely  geometrical)  properties.  Thus  within  each  of  the  seven 
crystal  systems  one  can  find  solids  exhibiting  the  full  range  of  electrical,  mechanical, 
and  optical  properties. 

In  this  chapter  we  describe  another,  less  rigorous,  classification  scheme,  which  is 
not  based  on  symmetry,  but  emphasizes  physical  properties.  The  scheme  is  based  on 
the  configuration  of  the  valence  electrons.1 

The  most  important  distinction  determined  by  the  valence  electrons  is  that  between 
metals  and  insulators.  As  we  have  seen  (Chapter  8)  the  difference  between  metals  and 
insulators  depends  on  whether  there  are  (metals)  or  are  not  (insulators)  any  partially 
filled  energy  bands.2  In  perfect  crystals  at  zero  temperature,  provided  that  the 
independent  electron  approximation  is  valid,  this  is  a  completely  rigorous  criterion, 
leading  to  two  unambiguous  categories.3 

The  basis  for  these  two  categories  is  the  distribution  of  electrons,  not  in  real  space, 
but  in  wave  vector  space.  Nowhere  near  as  rigorous  a  criterion  can  be  found  to 
distinguish  metals  from  insulators  based  on  the  distribution  of  electrons  in  real 
space.  One  can  only  make  the  qualitative  observation  that  the  electronic  distribution 
in  metals  is  generally  not  nearly  so  concentrated  in  the  vicinity  of  the  ion  cores  as 
it  is  in  insulators.  This  is  illustrated  in  Figure  19.1,  where  the  wave  functions  of  the 
occupied  electronic  levels  in  atomic  sodium  and  atomic  neon  are  plotted  about  two 
centers  whose  separation  is  equal  to  the  nearest-neighbor  separation  in  the  solid. 
The  electronic  density  in  sodium  remains  appreciable  even  midway  between  the 
atoms,  but  it  is  quite  small  in  neon.  If  one  tried  to  construct  from  this  an  argument 
that  solid  neon  should  be  insulating  and  solid  sodium  conducting,  the  chain  of 
thought  would  be  something  like  this:  Appreciable  overlap  of  atomic  wave  functions 
suggests — from  the  point  of  view  of  tight-binding  theory  (Chapter  10) — the  presence 
of  broad  bands,  which  in  turn  leads  to  the  possibility  of  considerable  band  overlap, 
and,  hence,  metallic  properties.  One  is  thus  led  rather  quickly  back  to  k-space,  where 
the  only  really  satisfactory  criterion  can  be  given. 


1  As  elsewhere  in  this  book,  we  take  the  view  that  solids  are  composed  of  ion  cores  (i.e.,  nuclei  and 
those  electrons  so  strongly  bound  as  to  be  negligibly  perturbed  from  their  atomic  configurations  by  their 
environment  in  the  solid)  and  valence  electrons  (i.e.,  those  electrons  whose  configuration  in  the  solid  may 
differ  significantly  from  that  in  the  isolated  atom.)  As  emphasized  earlier,  the  distinction  between  core 
and  valence  electrons  is  one  of  convenience.  In  metals — especially  in  simple  metals — it  is  frequently  enough 
to  consider  only  the  conduction  electrons  as  valence  electrons,  including  all  others  in  the  rigid  ion  core. 
In  transition  metals,  however,  it  may  be  of  considerable  importance  to  consider  the  electrons  in  the  highest 
d-shells  as  valence  rather  than  core  electrons.  In  saying  that  the  classification  scheme  is  based  on  the  valence 
electrons,  we  mean  only  that  it  depends  on  those  aspects  of  the  atomic  electronic  configuration  that  are 
significantly  altered  when  the  atoms  arc  assembled  together  into  the  solid. 

2  The  distinction  also  depends  on  the  validity  of  the  independent  electron  approximation  (or,  less 
stringently,  on  the  validity  of  the  quasiparticle  picture  (Chapter  17)). 

3  At  nonzero  temperatures  the  distinction  may  be  blurred  in  insulators  with  small  energy  gaps, 
because  of  thermal  excitation  of  electrons  into  the  conduction  band.  Such  solids  are  known  as  intrinsic 
semiconductors.  Impurities  in  an  otherwise  insulating  solid  may  also  contribute  electrons  that  are  easily 
thermally  excited  into  the  conduction  band,  leading  to  extrinsic  semiconductors.  The  characteristic 
properties  of  semiconductors  are  discussed  in  Chapter  28.  From  the  point  of  view  of  this  chapter  (which 
is  concerned  only  with  perfect  crystals  at  T  =  0),  all  semiconductors  are  insulators. 
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Figure  19.1 

The  calculated  radial  atomic 
wave  functions  r\j/(r)  for  (a) 
neon  [ls22s22p6]  and  (b)  so¬ 
dium  [ls22s22ps3s1].  The 
wave  functions  are  drawn 
about  two  centers  whose  sep¬ 
aration  is  taken  to  be  the 
observed  nearest-enighbor 
distance  in  the  solid  (neon, 
3.1  A;  sodium,  3.7A.)  There  is 
a  very  small  overlap  of  the  2s 
and  2 p  orbitals  in  neon.  The 
2s  and  2 p  orbitals  overlap 
considerably  less  in  sodium, 
but  there  is  enormous  over¬ 
lap  of  the  3s  wave  functions. 
(Curves  are  from  calculations 
by  D.  R.  Hartree  and  W. 
Hartree,  Proc.  Roy.  Soc. 
A193,  299  (1948).) 
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THE  CLASSIFICATION  OF  INSULATORS 

The  distinction  between  metals  and  insulators  is  based  on  the  electronic  distribution 
in  A-space,  which  specifies  which  of  the  possible  k-levels  are  occupied.  However,  it 
is  of  great  utility  to  draw  further  distinctions,  within  the  family  of  insulators,  on 
the  basis  of  the  spatial  electronic  distribution.  There  are  three  broadly  recognizable 
kinds  of  insulating  solids  with  clearly  distinguishable  types  of  spatial  electronic 
distributions.4  The  categories  cannot  be  rigorously  specified,  and  we  shall  encounter 
borderline  cases,  but  the  prototypes  of  each  class  are  easily  delineated. 

1.  Covalent  Crystals  These  can  be  described  as  having  spatial  electronic  distribu¬ 
tions  not  that  dissimilar  from  metals,  but  with  no  partially  filled  bands  in  fc-space. 
Thus  the  electrons  in  covalent  crystals  need  not  be  sharply  localized  in  the  neighbor¬ 
hood  of  the  ion  cores.  On  the  other  hand,  covalent  crystals  are  not  likely  to  have 
the  nearly  uniform  distribution  of  electronic  density  in  the  interstitial  region  charac¬ 
teristic  of  simple  metals,  whose  one-electron  wave  functions  are  nearly  plane  waves 
between  ion  cores.  It  is  more  likely  that  the  interstitial  electronic  distribution  will 
be  localized  in  certain  preferred  directions,  leading  to  what  are  known  in  the  language 
of  chemistry  as  “bonds”.  An  example  of  a  covalent  crystal  (also  known  as  a  valence 
crystal)  is  diamond,  an  insulator  with  a  band  gap  of  5.5  eV.  There  is  an  appreciable 
interstitial  density  of  electrons  in  diamond,  which  is  highly  concentrated  in  the  vicinity 
of  the  lines  joining  each  carbon  ion  core  to  its  four  nearest  neighbors  (Figure  19.2). 
This  interstitial  charge  density  is  the  characteristic  feature  distinguishing  covalent 
crystals  from  the  other  two  insulating  types. 

2.  Molecular  Crystals  The  outstanding  examples  of  molecular  crystals6  are  the 
solid  noble  gases,  neon,  argon,  krypton,  and  xenon.7  In  the  atomic  state  they  have 


4  Because  hydrogen  is  in  several  respects  unique  among  atoms,  a  fourth  type — the  hydrogen-bonded 
solid— is  frequently  added  to  the  list.  We  shall  briefly  describe  this  type  of  solid  at  the  end  of  the  chapter. 

*  Chemists  would  refer  to  this  distribution  of  charge  as  the  four  electronic  carbon  bonds.  From  the 
point  of  view  of  Bloch  theory,  however,  it  is  simply  a  property  of  the  occupied  electronic  levels  that  leads 
to  a  charge  density 

P(r)  =  -*  £  |lMr)|2> 

all  valence 
band  levels 

that  is  appreciable  in  certain  directions  far  from  the  ion  cores,  in  spite  of  there  being  a  substantial  energy 
gap  between  occupied  and  unoccupied  levels,  so  that  the  crystal  is  an  insulator. 

4  The  name  “molecular  crystal”  reflects  the  fact  that  the  entities  out  of  which  such  substances  are 
composed  differ  little  from  the  isolated  individual  molecules.  In  the  case  of  the  noble  gases  the  “molecules” 
happen  to  be  identical  to  the  atoms.  Nevertheless,  these  structures  are  called  molecular  crystals,  rather 
than  atomic  crystals,  to  permit  the  inclusion  of  such  additional  substances  as  solid  hydrogen,  nitrogen, 
etc.  In  these  materials  the  constituent  entities  are  H2  or  N2  molecules,  which  are  weakly  perturbed  from 
their  free  form  in  the  solid.  Although  solid  hydrogen  and  nitrogen  are  perhaps  more  deserving  of  the  name 
“molecular  crystal”  than  the  solid  noble  gases,  they  provide  considerably  less  clearcut  examples,  since 
within  each  molecule  the  electronic  distribution  is  not  localized  about  the  two  ion  cores.  Therefore  if  one 
should  focus  one's  attention  on  the  individual  ion  cores  as  the  fundamental  building  blocks,  one  would 
have  to  describe  solids  such  as  hydrogen  and  nitrogen  as  partially  molecular  and  partially  covalent. 

1  Solid  helium  is  a  somewhat  pathological  example  of  a  molecular  solid,  because  of  the  very  light 
mass  of  the  helium  atom.  Even  at  T  =  0  the  liquid  phase  is  more  stable,  unless  considerable  external 
pressure  is  applied. 
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Figure  19.2 

The  electronic  charge  distri¬ 
bution  on  a  plane  section  of 
the  conventional  cubic  cell  of 
diamond,  as  suggested  by 
X-ray  diffraction  data.  The 
curves  in  (a)  are  curves  of 
constant  electronic  density. 
The  numbers  along  the 
curves  indicate  electronic 
density  in  electrons  per  cubic 
angstrom.  The  plane  section 
of  the  cell  that  (a)  describes  is 
displayed  in  (b).  Note  that  the 
electron  density  is  quite  high 
(5.02  electrons  per  cubic  ang¬ 
strom,  as  compared  with 
0.034  in  the  regions  of  lowest 
density)  at  the  points  where 
the  plane  intersects  the  near¬ 
est  neighbor  bonds.  This  is 
characteristic  of  covalent 
crystals.  (Based  on  a  figure 
from  Y.  K.  Syrkin  and  M.  E. 
Dyatkina,  Structure  of  Mole¬ 
cules  and  the  Chemical  Bond, 
translated  and  revised  by 
M.  A.  Partridge  and  D.  O. 
Jordan,  Interscience,  New 
York,  1950.) 


(b> 


completely  filled  electronic  shells — a  highly  stable  configuration  which  is  only  weakly 
perturbed  in  the  solid.  From  the  point  of  view  of  band  structure  the  noble  gases  are 
fine  examples  of  extreme  tight-binding  solids;  i.e.,  there  is  very  little  electronic  density 
between  ion  cores,  all  the  electrons  remaining  well  localized  in  the  neighborhood  of 
their  parent  ions.  For  many  purposes  the  whole  theory  of  band  structure  is  somewhat 
beside  the  point  for  such  solids,  since  all  electrons  can  be  considered  to  be  core 
electrons.8  A  discussion  of  molecular  crystals  must  start  with  a  consideration  of 
those  weak  perturbations  on  the  atoms  that  do  occur  when  the  solid  is  formed. 

3.  Ionic  Crystals  Ionic  crystals,  such  as  sodium  chloride  are  compounds  composed 
of  a  metallic  and  nonmetallic  element.  Like  molecular  crystals,  ionic  crystals  have 


8  The  conduction  band  levels  have  been  calculated  and,  as  expected,  lie  several  electron  volts  above 
the  filled  band  that  accommodates  the  eight  atomic  valence  electrons. 
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electronic  charge  distributions  that  are  highly  localized  in  the  neighborhood  of  the 
ion  cores.  However,  in  molecular  crystals  all  electrons  remain  very  close  to  their 
parent  atoms,  while  in  ionic  crystals  some  electrons  have  strayed  so  far  from  their 
parents  as  to  be  closely  bound  to  the  constituent  of  the  opposite  type.  Indeed,  one 
might  consider  an  ionic  crystal  to  be  a  molecular  crystal  in  which  the  constituent 
molecules  (which  come  in  two  varieties)  are  not  sodium  and  chlorine  atoms,  but  Na+ 
and  Cl~  ions,  whose  charge  distribution  is  only  weakly  perturbed  in  the  solid  from 
what  it  would  be  in  the  isolated  free  ions.  Because,  however,  the  localized  entities 
composing  an  ionic  crystal  are  not  neutral  atoms,  but  charged  ions,  the  immense 
electrostatic  forces  among  the  ions  play  an  overbearing  role  in  determining  the 
properties  of  ionic  crystals,  which  are  very  different  from  those  of  molecular  solids. 

The  electronic  charge  distributions  that  characterize  the  three  basic  categories  of 
insulators  are  summarized  in  Figure  19.3. 
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Figure  19.3 

Highly  schematic  two-dimensional  representation  of  the  electronic  charge  distribution  in  the  basic 
solid  types.  The  small  circles  represent  the  positively  charged  nuclei,  and  the  shaded  parts, 
regions  in  which  the  electronic  density  is  appreciable  (though  by  no  means  uniform).  We  have 
(a)  molecular  (represented  by  two-dimensional  “argon”):  (b)  ionic  (“potassium  chloride”); 
(c)  covalent  (“carbon”);  (d)  metallic  (“potassium"). 


One  frequently  finds  these  distinctions  drawn  with  emphasis  not  so  much  on  the 
spatial  configuration  of  the  electron  as  on  the  so-called  bonding.  This  is  a  point  of 
view  particularly  dear  to  chemists,  for  whom  the  question  of  overwhelming  impor¬ 
tance  in  classifying  a  solid  is  what  holds  it  together.  The  two  points  of  view  are 
closely  related,  since  the  Coulomb  attraction  between  the  electrons  and  atomic  nuclei 
is  the  ultimate  “glue”  binding  any  solid  together.  Thus  the  nature  of  the  binding 
depends  critically  on  the  spatial  arrangement  of  the  electrons.  However,  from  the 
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point  of  view  of  contemporary  physics,  and  especially  in  the  study  of  macroscopic 
solids,  the  energy  required  to  assemble  an  object  is  not  nearly  as  fundamental  a 
property  as  it  can  be  for  the  chemist.  We  have  therefore  chosen  to  emphasize  spatial 
electronic  structure  in  describing  the  categories,  rather  than  (as  has  become  tradi¬ 
tional)  bonding.  To  the  physicist,  bonding  is  only  one  of  many  properties  strongly 
affected  by  this  spatial  distribution. 

One  should  nevertheless  be  aware  of  the  nomenclatural  implications  of  this 
chemical  point  of  view:  One  speaks  of  “the  metallic  bond,”  “the  ionic  bond,  the 
covalent  bond,”  and  “the  hydrogen  bond”  when  referring  to  the  characteristic  way 
in  which  the  electrostatic  forces  conspire  to  hold  solids  of  the  corresponding  types 
together.  We  shall  have  more  to  say  about  the  binding  energies  (also  known  as  cohesive 
energies)  of  the  various  solid  types  in  Chapter  20. 

In  the  remainder  of  this  chapter  we  wish  to  amplify  the  distinctions  among  the 
basic  categories  of  solids,  emphasizing  both  the  very  different  kinds  of  models  used 
in  describing  the  extreme  types,  as  well  as  the  continuity  that  prevails  between  the 
different  categories.  The  discussion  that  follows  vastly  oversimplifies  each  type.  What 
we  are  offering  is  a  series  of  models  on  the  level  of  sophistication  of  the  Drude  model 
of  a  metal,  from  which  to  begin  the  analysis  of  the  several  categories.  More  quantitative 
aspects  of  these  models  will  be  found  in  Chapter  20. 

IONIC  CRYSTALS 

The  most  naive  model  of  an  ionic  crystal  treats  all  ions  as  impenetrable  charged 
spheres.  The  crystal  is  held  together  by  the  electrostatic  attraction  between  posi¬ 
tively  and  negatively  charged  spheres,  and  is  prevented  from  collapsing  by  their 
impenetrability. 

The  impenetrability  is  a  consequence  of  the  Pauli  exclusion  principle  and  the 
stable  closed-shell  electronic  configurations  of  the  ions.  When  two  ions  are  brought 
so  close  together  that  their  electronic  charge  distributions  start  to  overlap,  the 
exclusion  principle  requires  that  the  excess  charge  introduced  in  the  neighborhood 
of  each  ion  by  the  other  be  accommodated  in  unoccupied  levels.  However,  the 
electronic  configuration  of  both  positive  and  negative  ions  is  of  the  stable  closed-shell 
ns2np6  variety,  which  means  that  a  large  energy  gap  exists  between  the  lowest  un¬ 
occupied  levels  and  the  occupied  ones.  As  a  result  it  costs  much  energy  to  force 
the  charge  distributions  to  overlap;  i.e.,  a  strongly  repulsive  force  exists  between 
ions,  whenever  they  are  so  close  together  that  their  electronic  charge  distributions 
interpenetrate. 

For  the  qualitative  points  we  wish  to  make  in  this  chapter  it  is  enough  to  consider 
the  ions  as  impenetrable  spheres — i.e.,  to  take  the  potential  representing  this  repulsive 
force  as  infinite  within  a  certain  distance,  and  zero  beyond  that  distance.  We  stress, 
though,  that  the  ions  are  not  rigorously  impenetrable.  In  detailed  calculations  on 
ionic  crystals  one  must  assume  a  less  simplistic  form  for  the  dependence  of  the  repul¬ 
sive  potential  on  interionic  separation.  (We  shall  give  an  elementary  illustration  of 
this  in  Chapter  20.)  Furthermore,  in  a  more  realistic  picture  the  ions  cannot  be  taken 
to  be  rigorously  spherical,  since  they  are  distorted  from  their  (rigorously  spherical) 
shape  in  free  space  by  their  neighbors  in  the  crystal. 
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ALKALI  HALIDES  (I -VII  IONIC  CRYSTALS) 

The  ideal  ionic  crystal  of  spherical,  charged  billiard  balls  is  most  nearly  realized  by 
the  alkali  halides.  These  crystals  are  all  cubic  at  normal  pressures.  The  positive  ion 
(cation)  is  one  of  the  alkali  metals  (Li+,  Na+,  K+,  Rb+,  or  Cs+)  and  the  negative 
ion  (anion)  one  of  the  halogens  (F~,  CL,  Br“,  or  I-).  They  all  crystallize  under 
normal  conditions  in  the  sodium  chloride  structure  (Figure  19.4a)  except  for  CsCl, 
CsBr,  and  Csl,  which  are  most  stable  in  the  cesium  chloride  structure  (Figure  19.4b). 

To  understand  why  the  basic  entities  in  these  structures  are  ions  rather  than 
atoms,  consider,  for  example,  RbBr.  An  isolated  bromine  atom  can  actually  attract 
an  additional  electron  to  form  the  stable  anion  Br“,  with  the  closed-shell  electronic 
configuration  of  krypton.  The  additional  electron  has  a  binding  energy9  of  about 


(a)  The  sodium  chloride  structure;  (b)  the  cesium  chloride  structure;  (c)  the  zincblende 
(sphalerite)  structure.  The  side  of  the  conventional  cubic  cell,  a,  and  the  nearest-neighbor 
distance,  d ,  are  indicated  in  each  case.  Inspection  of  the  figure  rev  eals  that  they  are  related 
by:  (a)  sodium  chloride:  d  =  a/ 2;  (b)  cesium  chloride:  d  =  s/3a/2 ;  (c)  zincblende:  d  = 
v/3o/4.  For  detailed  descriptions  of  these  structures  see  Chapter  4. 


9  This  is  usually  put  by  saying  that  bromine  has  an  “electron  affinity”  of  3.5  eV.  It  may  at  first  appear 
surprising  that  a  neutral  atom  can  bind  an  extra  electron.  The  binding  is  possible  because  the  cloud  of 
atomic  electrons  surrounding  the  nucleus  is  not  completely  effective  in  screening  its  field  over  the  sixth 
and  last  p-orbital  (the  one  holding  the  extra  electron)  which  penetrates  rather  far  into  the  cloud. 
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3.5  eV.  However,  to  produce  the  cation  Rb+  (also  with  the  closed-shell  krypton 
configuration)  it  costs  ^4.2  eV  to  liberate  the  extra  electron.  It  might  therefore  appear 
that  a  rubidium  atom  and  a  bromine  atom  together  would  have  an  energy  0.5  eV 
lower  than  the  correspondingions.  This  is  indeed  the  case,  provided  that  the  ions 
are  very  far  apart.  If,  however,  we  bring  the  ions  together,  the  energy  of  the  pair  is 
lowered  by  their  attractive  electrostatic  interaction.  In  crystalline  RbBr  the  interionic 
distance  is  about  r  =  3.4  A.  A  pair  of  ions  at  this  distance  has  an  additional  Coulomb 
energy  of  -e2/r  =  -4.2  eV,  which  more  than  compensates  for  the  0.5  eV  favoring 
the  atoms  over  the  ions  at  large  separation. 

The  picture  of  an  alkali  halide  as  a  set  of  spherical  ions  packed  together  is  confirmed 
by  the  electronic  charge  distributions  inferred  from  X-ray  diffraction  data.  Figure  19.5 
shows  the  charge  distribution  these  experiments  suggest  for  sodium  chloride. 


Electronic  charge  density  in  a  [100]  plane  of  NaCI  containing  the  ions,  as  inferred  from  X-ray 
diffraction  data.  The  numbers  give  the  values  of  the  density  along  lines  of  constant  density,  in 
units  of  electrons  per  cubic  angstrom.  The  lines  perpendicular  to  the  constant  density  curves  are 
error  bars.  (After  G.  Sc^oknecht,  Z.  Naturforschung  12, 983  (1957).) 
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The  idea  that  the  alkali  halides  are  composed  of  slightly  distorted  localized  ions 
is  also  confirmed  by  band  structure  calculations.  Figure  19.6  shows  the  energy  bands 
calculated  for  KC1  as  a  function  of  (externally  imposed)  lattice  constant,  compared 
with  the  corresponding  levels  of  the  free  ions.  The  band  energies  can  differ  by  as  much 
as  half  a  rydberg  from  the  energies  of  the  levels  of  the  isolated  ions  even  at  large 
separations,  because  of  the  interionic  Coulomb  interactions.  However,  the  band 
widths  at  the  observed  lattice  constant  are  all  exceedingly  narrow,  indicating  that 
there  is  little  overlap  of  the  ionic  charge  distributions. 


d/aD 
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Figure  19.6 

The  four  highest  filled  energy  bands  of  KCI,  calculated  as  a  func¬ 
tion  of  the  interionic  distance  d  (measured  in  Bohr  radii).  The 
vertical  line  is  at  the  observed  d.  The  energies  of  the  free  ions  are 
indicated  by  the  arrows  on  the  right.  Note  that  although  the 
energies  are  considerably  shifted  in  the  crystalline  state,  the  bands 
remain  very  narrow.  (From  L.  P.  Howard.  Phys.  Rev.  109,  1927 
(1958).)  ' 


Ionic  Radii 

The  values  of  the  conventional  cubic  cell  side  a  given  by  X-ray  diffraction  measure¬ 
ments  on  the  20  alkali  halide  crystals  are  consistent  with  an  elementary  model  in 
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which  the  ions  are  regarded  as  impenetrable  spheres  of  a  definite  radius  r,  known 
as  the  ionic  radius.  Let  d  be  the  distance  between  the  centers  of  neighboring  positive 
and  negative  ions,  so  that  d  =  a/2  in  the  sodium  chloride  structure,  and  a^/3/2  in 
the  cesium  chloride  structure  (see  Figure  19.4).  Table  19.1  lists  the  values  of  d  for  the 
alkali  halide  crystals.10  If  we  take  each  of  the  nine  ions  to  be  a  sphere  of  definite 
radius,  then  in  most  cases  we  can  fit  the  nearest-neighbor  distance  dxr  for  the  alkali 
halide  XT  to  an  accuracy  of  around  2  percent  by  taking  dXY  =  rx  +  rr.  The  only 
exceptions  are  LiCl,  LiBr,  and  Lil,  where  the  radius  sum  falls  short  of  d  by  6,  7,  and 
8  percent,  respectively,  and  NaBr  and  Nal,  where  the  radius  sum  falls  short  by  3  and 
4  percent. 

Leaving  aside,  for  the  moment,  these  exceptions,  this  shows  that  the  observed 
lattice  constants  can  be  reproduced  to  a  few  percent  accuracy  by  assuming  the  ions 
to  be  hard  spheres  of  the  specified  radii,  close  packed  into  a  sodium  chloride  (or 
cesium  chloride)  structure.  However,  the  choice  of  ionic  radii  is  not  unique,  for  by 
adding  a  fixed  amount  A r  to  all  the  alkali  radii  and  subtracting  the  same  A r  from 
the  halogen  radii  (rx  -*■  r  +  Ar,  rr  -►  r  —  Ar)  the  value  of  rx  +  r,  will  be  unaffected.1 1 
This  ambiguity  can  be  resolved  and  the  anomalous  behavior  of  the  lithium  halides 
can  be  explained  by  the  following  further  observation: 

Our  assertion  that  the  nearest-neighbor  distance  d  is  given  by  the  sum  of  the  radii 
of  the  ions  whose  centers  are  separated  by  d  assumes  that  such  ions  are  in  fact  in 
contact  (Figure  19.7).  This  will  be  the  case  provided  the  radius  r>  of  the  larger  ion 
is  not  too  much  greater  than  the  radius  rK  of  the  smaller.  Should  the  disparity  be 
too  great,  however,  the  smaller  ions  may  not  touch  the  larger  ones  at  all  (Figure  19.8). 
In  that  case  d  will  be  independent  of  the  size  of  the  smaller  ion  altogether,  and  be 
determined  entirely  by  the  size  of  the  larger.  The  relation  r+  +  r~  =  d  appropriate 
to  the  sodium  chloride  structure  must  then  be  replaced  by  >/2r>  =  d  (Figure  19.7 
and  19.8).  The  critical  radius  ratio,  at  which  the  smaller  ion  loses  contact,  occurs  when 
each  large  ion  touches  both  the  small  ion  centered  on  the  nearest-neighbor  site  and 
the  large  ion  centered  on  the  next  nearest-neighbor  site  (Figure  19.9).  The  critical 
ratio  satisfies  (see  Figure  19.9) 


r>  _  1 

r<  yjl  —  1 


y/2  +  1  =  2.41 


(sodium  chloride  structure).  (19.1) 


From  the  values  of  r>/r<  given  in  Table  19.1,  one  finds  that  the  critical  value  2.41 
is  exceeded  only  in  LiCl,  LiBr,  and  Lil.  It  is  thus  to  be  expected  that  the  value  observed 
for  d  exceeds  the  radial  sum  in  these  lithium  halides,  for  in  these  cases  d  should  be 
compared  not  with  r+  +  r“,  but  with  x/2r>.  This  latter  quantity  is  listed  in  square 
brackets  after  the  value  of  r >/r<  for  the  three  lithium  halides.  It  fits  the  observed  d 
to  the  same  2  percent  accuracy  that  the  values  of  r+  +  r~  yield  in  the  cases  in  which 


10  By  “nearest-neighbor  distance”  we  always  mean  the  minimum  distance  between  ionic  centers. 
Thus  (for  example)  in  Figure  19.8  the  nearest-neighbor  distance  is  dy  even  though  the  large  circles  touch 
each  other,  but  not  the  little  circles.  The  distance  between  the  center  of  a  big  circle  and  the  center  of  a 
little  circle  is  less  than  the  distance  between  centers  of  neighboring  big  circles. 

11  This  leads  to  rival  schemes  of  ionic  radii  being  put  forth,  but  those  quoted  in  Table  19.1  are  still 
the  most  widely  used. 
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Figure  19.7 

A  [100]  plane  of  the  sodium  chloride  structure 
containing  the  centers  of  ions.  Each  large  ion 
makes  contact  only  with  the  neighboring  small 
ions.  Therefore  the  nearest-neighbor  distance  d 
is  equal  to  the  sum  of  the  ionic  radii,  r*  +  r‘. 
This  is  the  normal  state  of  affairs. 
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Same  plane  as  in  Figure  19.7,  but  now  the 
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disparity  in  ionic  radii  is  so  great  that  each  large 

1  .  |  ion  makes  contact  only  with  the  nearest  large 

1  '  ion.  In  this  case  the  nearest-neighbor  distance  d 
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(defined  to  be  the  shortest  distance  between  ionic 

centers)  is  related  only  to  the  larger  ionic  radius 
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*  j  Figure  19.9 

"  The  state  or  affairs  when  the  radius  ratio  has  its 
critical  value  r>/r<  =  v/2  +  1.  For  smaller 
ratios  Figure  19.7  holds;  for  larger  ratios. 
Figure  19.8  holds.  The  value  for  the  critical  ratio 
follows  from  the  observation  that  when  the  ratio 
.  is  critical  the  relations  d  =  v/2r>and  d  —  r*  + 
„  r~  =  r'  +  r<  must  both  hold. 


they  are  applicable.  It  is  not  significant  that  one  of  the  three  errant  lithium  halides 
can  be  brought  into  line  in  this  way,  since  the  free  variable  A r  can  be  chosen  to  bring 
this  about.  However,  the  fact  that  the  other  two  are  also  thereby  made  to  agree  lends 
great  credence  to  the  picture  of  the  ions  as  impenetrable  spheres,  with  the  radii  given 

in  Table  19.1.  .  ,  „  ...  . 

A  similar  calculation  for  the  cesium  chloride  structure  yields  the  smaller  critical 

value 


^  =  1(73  +  1)=  1.37 
r<  2 


(cesium  chloride). 


(19.2) 


This  value  is  not  exceeded  by  the  radius  ratios  of  the  three  alkali  halides  with  this 
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Table  19.1 

PROPOSED  IONIC  RADII  FOR  THE  ALKALI  HALIDES" 


Li+  (0.60) 

NaT  (0.95) 

K+  (1.33) 

Rb+  (1.48) 

Cs+  (1.69) 

F“  (1.36) 

d 

2.01 

2.31 

2.67 

2.82 

3.00 

r~  +  r+ 

1.96 

2.31 

269 

2.84 

3.05 

V 

A 

2.27 

1.43 

1.02 

1.09 

1.24 

cr  (i.8i) 

d 

2.57 

2.82 

3.15 

3.29 

3.57 

r~  +  r* 

2.41 

2.76 

3.14 

3.29 

3.50 

V 

A 

V. 

3.02  [2.56] 

1.91 

1.36 

1.22 

1.07 

Br“  (1.95) 

d 

2.75 

2.99 

3.30 

3.43 

3.71 

r~  +  r+ 

2.55 

2.90 

3.28 

3.43 

3.64 

"t 

V 

A 

3.25  [2.76] 

2.05 

1.47 

1.32 

1.15 

I-  (2.16) 

d 

3.00 

3.24 

3.53 

3.67 

3.95 

r~  +  r+ 

2.76 

3.11 

3.49 

3.64 

3.85 

r>/r< 

3.60[3.05] 

2.27 

1.62 

1.46 

1.28 

“  The  ionic  radius  (in  angstroms)  is  given  in  parentheses  immediately  following  the  name  of  each 
ion.  The  following  additional  information  (all  in  angstroms)  is  listed  in  the  box  corresponding 
to  each  alkali  halide: 


1.  The  nearest-neighbor  distance  d.lc  In  the  sodium  chloride  structure,  d  =  aj 2,  where  a  is 
the  side  of  the  conventional  cubic  cell;  in  the  cesium  chloride  structure  (CsCl,  CsBr,  and  Csl), 
d  —  v/3«/2.  (See  Figure  19.4.) 

2.  The  sum  of  the  ionic  radii,  r~  +  r*. 

3.  The  ratio  of  the  ionic  radii,  r>/r<.  In  the  three  cases  where  this  ratio  is  so  large  that  d  is 
not  given  by  the  radius  sum,  the  new  theoretical  value  (•s/2'‘>)  is  listed  in  square  brackets  imme¬ 
diately  after  the  radius  ratio. 

Source:  L.  Pauling,  The  Nature  of  the  Chemical  Bond,  3rd  ed.,  Cornell  University  Press,  Ithaca,  New 
York,  1960,  p.  514. 


structure,  and  the  radial  sums  are  in  good  agreement  with  the  observed  lattice 
constants.12 

II-VI  Ionic  Crystals 

Doubly  ionized  elements  from  columns  II  and  VI  of  the  periodic  table  may  also 
form  ionic  crystals.  Except  for  the  beryllium  compounds  and  MgTe,  these  also  assume 
the  sodium  chloride  structure.  It  can  be  seen  from  Table  19.2  that  d  =  a/2  again 


12  Agreement  can  be  somewhat  improved  by  increasing  the  ionic  radii  in  the  cesium  chloride  structure 
to  account  for  the  fart  that  each  ion  has  eight  nearest  neighbors  rather  than  the  six  it  is  surrounded  by 
in  the  sodium  chloride  structure.  Consequently,  the  Pauli  principle  repulsion  is  stronger,  and  the  ions  will 
not  be  squeezed  so  closely  together. 
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Table  19.2 

PROPOSED  IONIC  RADII  FOR  THE  DOUBLY  IONIZED  ELEMENTS  FROM 
COLUMNS  II  AND  VI  OF  THE  PERIODIC  TABLE" 


Be++  (0.31) 

Mg+  +  (0.65) 

Ca  +  +  (0.99) 

Sr+  +  (1-13) 

Ba++  (1.35) 

O'- (1.40) 

d 

1.64 

2.10 

2.40 

2.54 

2.76 

r~  +  r+ 

1.71 

2.05 

2.39 

2.53 

2.75 

V 

n" 

A 

4.52 

2.15 

1.41 

1.24 

1.04 

S'"  (1.84) 

d 

2.10 

2.60 

2.85 

3.01 

3.19 

r~  +  r+ 

2.15 

2.49 

2.83 

2.97 

3.19 

V 

A 

5.94[2.25] 

2.83[2.60] 

1.86 

1.63 

1.36 

Se“  '  (1.98) 

d 

2.20 

2.72 

2.96 

3.11 

3.30 

r~  +  r+ 

2.29 

2.63 

2.97 

3.11 

3.33 

r>/r< 

6.39[2.42] 

3.05  [2.80] 

2.00 

1.75 

1.47 

Te'  '  (2.21) 

d 

2.41 

2.75 

3.17 

3.33 

3.50 

r~  +  r+ 

2.52 

2.86 

3.20 

3.34 

3.56 

-I 

V 

A 

7.13[2.71] 

3.40 

2.23 

1.96 

1.64 

"  The  ionic  radius  (in  angstroms)  is  given  in  parentheses  immediately  following  the  name  of  each 
ion.  In  addition,  the  following  information  is  given  in  the  box  corresponding  to  each  compound: 

1.  The  nearest-neighbor  distance  d. 

2.  The  sum  of  the  ionic  radii,  r+  +  r~ 

3.  The  ratio  of  the  ionic  radii,  r>/r<. 

All  compounds  have  the  sodium  chloride  structure  except  for  BeS,  BeSe,  and  BeTe  (zincblende) 
and  BeO,  MgTe  (wurtzite.)  In  the  two  magnesium  compounds  for  which  the  radius  ratio  exceeds 
the  critical  value  2.42  for  the  sodium  chloride  structure,  the  corrected  theoretical  value  d  =  %/2r  * 
is  given  in  brackets.  In  the  zincblende  structures,  the  critical  ratio  4.45  is  exceeded  in  all  cases, 
and  the  corrected  value  d  =  v'6r>/2  is  given  in  brackets.  These  (and  crystals  with  the  wurtzite 
structure)  are  better  treated  as  covalent. 

Source:  L.  Pauling,  The  Nature  of  the  Chemical  Bond,  3rd  ed.,  Cornell  University  Press,  Ithaca,  N.Y., 
1960,  p.  514. 


agrees  to  within  a  few  percent  with  r+  +  r~  for  the  calcium,  strontium,  and  barium 
salts,  and  MgO.  In  MgS  and  MgSe  the  critical  ratio  r>/r<  =  2.42  is  exceeded;  here 
d  =  a/2  agrees  with  v/2r>  to  within  about  3  percent. 

However,  agreement  with  the  ionic  radii  is  not  nearly  as  satisfactory  for  MgTe 
and  the  beryllium  compounds.  BeS,  BeSe,  and  BeTe  crystallize  in  the  zincblende 
(sphalerite)  structure  (see  Chapter  4  and  Figure  19.4),  and  the  other  two  in  the 
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wurtzite  structure.13  The  critical  ratio14  r>/r<  is  2  +  v/6  =  4.45,  which  is  exceeded 
in  all  three  beryllium  compounds  with  the  zincblende  structure.15  The  quoted  value 
of  d  (taken  to  be  v/3/4  times  the  measured  side  a  of  the  conventional  cubic  cell) 
should  then  be  compared  not  with  r+  4-  r~,  but  with  %_/6r>/2,  which  is  listed  in 
square  brackets  in  Table  19.2  after  r>/r<.  The  agreement  is  relatively  poor  compared 
with  the  impressive  agreement  we  found  in  the  crystals  with  the  sodium  chloride 
structure. 

One  reason  for  this  is  that  beryllium  (and  to  some  extent  magnesium)  is  substan¬ 
tially  more  difficult  to  ionize  than  the  other  elements  in  column  II.  (First  ionization 
potentials,  in  electron  volts:  Be,  9.32;  Mg,  7.64;  Ca,  6.11;  Sr,  5.69;  Ba,  5.21.)  Thus 
tile  cost  in  energy  to  produce  widely  separated  ions,  as  opposed  to  atoms,  is  quite 
high  in  the  beryllium  compounds.  In  addition,  because  it  is  so  small,  the  beryllium 
ion  cannot  take  full  advantage  of  the  crystal  structures  with  high  coordination  num¬ 
ber  to  compensate  for  this  by  maximizing  the  intcrionic  Coulomb  interaction:  The 
anions  would  be  repelled  by  the  overlap  of  their  own  charge  distributions,  before 
they  came  close  enough  to  the  beryllium  ion  (as,  for  example,  in  Figure  19.8).  These 
considerations  suggest  that  in  the  beryllium  compounds  we  are  already  moving  away 
from  the  realm  of  purely  ionic  crystals. 

As  it  turns  out,  tetrahedrally  coordinated  structures  (such  as  the  zincblende  and 
wurtzite  structures)  tend  to  be  primarily  covalently  bonded.  The  tetrahedrally  co¬ 
ordinated  II— VI  compounds  are  more  covalent  than  ionic  in  character.16 

III-V  CRYSTALS  (MIXED  IONIC  AND  COVALENT) 

The  crystals  that  pair  elements  from  columns  III  and  V  of  the  periodic  table  are  still 
less  ionic  in  character.  They  almost  all  assume  the  zincblende  structure  characteristic 
of  covalent  crystals.  Some  examples  are  given  in  Table  19.3.  Most  behave  as  semi¬ 
conductors  rather  than  insulators;  i.e.,  their  band  gaps  are  relatively  small.  This  is 
another  indication  that  their  ionic  nature  is  quite  weak,  and  the  electrons  not  strongly 
localized.  The  III-V  compounds  are  thus  good  examples  of  substances  that  are 
partially  ionic  and  partially  covalent.  They  are  conventionally  described  as  primarily 
covalent,  with  some  residual  concentration  of  excess  charge  about  the  ion  cores. 

COVALENT  CRYSTALS 

The  striking  difference  in  interstitial  charge  distributions  in  ionic  and  covalent  crystals 
is  seen  by  comparing  Figures  19.2  and  19.5;  the  electron  density  along  a  nearest- 

13  Tn  the  zincblende  structure,  a  view  along  a  [1 1 1]  direction  reveals  that  the  atoms  of  a  given  kind 
are  stacked  in  the  sequence  . . .  ABCABC  . . .  while  maintaining  tetrahedral  bonds  with  those  of  the  other 
kind.  The  underlying  Bravais  lattice  is  cubic.  There  is  another  arrangement  that  preserves  the  tetrahedral 

bonding  but  stacks  the  atoms  of  a  given  kind  in  the  sequence  . . .  ABABAB - This  is  the  wurtzite 

structure,  whose  underlying  Bravais  lattice  is  hexagonal. 

14  The  reader  is  invited  to  verify  these  numerological  assertions. 

15  This  is  also  the  critical  ratio  for  the  wurtzite  structure,  provided  the  c/a  of  the  underlying  hep 
structure  is  close  to  ideal,  as  it  almost  always  is. 

16  It  is  possible  to  specify  covalent  radii ,  for  use  in  tetrahedrally  coordinated  structures,  which  are 
almost  as  successful  in  fitting  their  lattice  constants  as  are  the  ionic  radii  in  ionic  crystals.  (See  L.  Pauling, 
The  Nature  of  the  Chemical  Bond,  3rd  ed.,  Cornell  University  Press,  Ithaca,  New  York,  I960. 
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Figure  19.10 

A  highly  schematic  representation  of  the  continuity  from  perfect  covalent  to  perfect 
ionic  crystals,  (a)  Perfectly  covalent  germanium.  Four  electrons  per  unit  cell  are  identically 
distributed  about  the  Ge4+  ion  cores.  The  electronic  density  is  large  in  certain  directions 
in  the  interstitial  region,  (b)  Covalent  gallium  arsenide.  The  interstitial  electron  density 
has  diminished  somewhat  and  there  is  a  slight  tendency  for  the  cloud  of  electrons  around 
each  Ass+  ion  core  to  have  somewhat  more  charge  than  is  necessary  to  compensate  the 
positive  charge,  while  the  electron  cloud  around  each  Ga3+  ion  core  has  somewhat  less. 
The  crystal  thus  has  a  very  slight  ionic  character  as  well,  (c)  Ionic  calcium  selenide.  The 
Ca2+  ion  is  almost  denuded  of  valence  electrons  and  the  cloud  of  electrons  around  the 
Se6+  ion  core  is  almost  the  full  eight  necessary  to  produce  Se--.  (It  would  be  more 
conventional,  in  fact,  to  represent  the  selenium  as  an  Se--  ion  core  lacking  a  small 
fraction  of  an  electron.)  The  crystal  is  weakly  covalent,  to  the  extent  that  the  Ca2+  is 
slightly  shielded  by  electrons  in  its  immediate  neighborhood,  and  the  Se6+  does  not 
have  quite  enough  electrons  to  filled  the  outer  eight  shells  completely,  making  Se-  . 
The  covalent  character  is  also  seen  in  the  slight  distortion  of  the  charge  distribution 
outward  along  nearest-neighbor  lines,  (d)  Perfectly  ionic  potassium  chloride.  The  K+ 
ion  is  bare  of  excess  electrons,  and  all  eight  electrons  cluster  around  the  Cl73  to  make 
Cl-.  (It  would  be  more  conventional  to  show  no  electrons  at  all,  and  simply  draw  a  Cl 
ion  core  for  the  chlorine.) 
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neighbor  line  drops  below  0.1  electron  per  cubic  angstrom  in  NaCl,  while  it  does 
not  get  less  than  5  electrons  per  cubic  angstrom,  in  the  covalent  crystal  par  excellence, 
diamond. 

Diamond  is  typical  of  the  crystal  structures  formed  by  the  elements  from  column 
IV  of  the  period  table:  carbon,  silicon,  germanium,  and  (grey)  tin  (see  Table  4.3). 
These  elements  all  crystallize  in  the  tetrahedrally  coordinated  diamond  structure.  In 
chemical  terminology,  each  atom  participates  in  four  covalent  bonds,  by  sharing  an 
electron  with  each  of  its  four  neighbors.  Although  the  ultimate  basis  for  the  bonding 
remains  electrostatic,  the  reason  why  the  crystal  remains  bound  is  considerably  more 
complex  than  the  simple  picture  of  oppositely  charged  billiard  balls  that  suits  the 
ionic  crystals  so  well.  We  shall  have  more  to  say  about  this  in  Chapter  20. 

Table  19.3 

SOME  COVALENT  I11V  COMPOUNDS0 


A1 

Ga 

In 

P  5.45 

5.45 

5.87 

As  5.62 

5.65 

6.04 

Sb  6.13 

6.12 

6.48 

“  All  have  the  zincblende  structure.  The  side  of  the  conventional 
cubic  cell  (in  angstroms)  is  given. 

The  continuous  variation  in  the  nature  of  the  charge  distribution  as  one  progresses 
from  the  extreme  ionic  I -VII  compounds  through  the  progressively  more  ambiguous 
II-VI,  and  III-V  compounds  over  to  the  extreme  covalent  elements  from  column 
IV,  is  indicated  schematically  in  Figure  19.10. 

Covalent  crystals  are  not  as  good  insulators  as  ionic  crystals;  this  is  consistent 
with  the  delocalization  of  charge  in  the  covalent  bond.  The  semiconductors  are  all 
covalent  crystals,  sometimes  (as  in  the  III— V  compounds)  with  a  small  touch  of 
ionic  bonding. 

MOLECULAR  CRYSTALS 

As  one  moves  to  the  right  in  the  periodic  table  from  any  of  the  elements  in  the  upper 
part  of  column  IV,  the  solids  one  encounters  become  successively  more  insulating 
(or,  if  one  starts  lower  down  in  column  IV,  less  metallic)  and  more  weakly  bound 
(with,  for  example,  lower  and  lower  melting  points).  At  the  extreme  right  of  the  table 
are  the  column  VIII  elements,  which  afford  the  best  example  of  molecular  solids.  The 
solid  noble  gases  (except  for  helium)  all  crystallize  in  monatomic  fee  Bravais  lattices. 
The  electronic  configuration  of  each  atom  is  of  the  stable  closed-shell  type,  which 
is  little  deformed  in  the  solid.  The  solid  is  held  together  by  very  weak  forces,  the 
so-called  van  der  Waals  or  fluctuating  dipole  forces.  The  qualitative  physical  origin 
of  this  force  is  easily  explained:17 

11  Note  that,  again,  the  ultimate  source  of  the  bonding  must  be  electrostatic.  However,  the  way  in 
which  the  electrostatic  attraction  now  manifests  itself  is  so  different  (from,  for  example,  the  ionic  crystals) 
that  this  type  of  binding  is  given  a  separate  name  of  its  own.  A  more  rigorous  quantum-mechanical 
derivation  of  the  van  der  Waals  attraction  is  given  in  Problem  1. 
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Consider  two  atoms  (1  and  2)  separated  by  a  distance  r.  Although  the  average 
charge  distribution  in  a  single  rare  gas  atom  is  spherically  symmetric,  at  any  instant 
there  may  be  a  net  dipole  moment  (whose  time-averaged  value  must  vanish).  If  the 
instantaneous  dipole  moment  of  atom  1  is  p,,  then  there  will  be  an  electric  field 
proportional  to  p,/r3  at  a  distance  r  from  the  atom.18  This  will  induce  a  dipole 
moment  in  atom  2  proportional  to  the  field: 


p2  =  aE  ~  (19.3) 

r 

where  a  is  the  polarizability19  of  the  atom.  Since  two  dipoles  have  an  energy  of 
interaction  proportional  to  the  product  of  their  moments  divided  by  the  cube  of 
the  distance  between  them,  there  will  be  a  lowering  of  energy  of  order 
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associated  with  the  induced  moment.  Since  this  quantity  depends  on  p, 2,  its  time 
average  does  not  vanish,  even  though  the  average  value  of  p,  is  zero.  Because  this 
force  falls  off  rapidly  with  distance,  it  is  exceedingly  weak,  whence  the  low  melting 
and  boiling  points  of  the  condensed  noble  gases. 

For  a  more  precise  treatment  of  molecular  solids  one  must  also  consider  the 
fluctuating  dipolar  interaction  between  groups  of  three  or  more  atoms  that  cannot 
be  represented  as  a  sum  of  pair  interactions.  These  fall  off  more  rapidly  than  1/r6, 
but  are  also  important  in  arriving  at  an  accurate  theory  of  the  solid  state.20 

The  elements  in  groups  V,  VI,  and  VII  (with  the  exception  of  metallic  polonium 
and  the  semimetals  antimony  and  bismuth)  partake  to  varying  degrees  of  both 
molecular  and  covalent  character.  Solid  oxygen  and  nitrogen,  as  we  have  mentioned, 
are  molecular  crystals  in  which  the  weakly  perturbed  entities  are  not  free  atoms,  but 
Oz  or  N2  molecules.  Within  these  molecules  the  binding  is  covalent,  so  that  the 
electronic  distribution  in  the  crystal  as  a  whole  has  a  mixed  molecular  and  covalent 
structure.  There  are  also  substances  (sulfur  and  selenium  are  examples)  that  possess 
elaborate  crystal  structures  and  for  which  no  simple  categorization  of  their  bonding 
is  possible. 

METALS 

As  one  moves  to  the  left  in  the  periodic  table  from  column  IV,  one  enters  the  family 
of  metals;  i.e.,  the  covalent  bond  expands  until  there  is  an  appreciable  density  of  elec¬ 
trons  throughout  the  interstitial  regions,  and,  in  fc-space,  appreciable  band  overlap. 
The  prime  examples  of  metallic  crystals  are  the  alkali  metals  of  column  I,  which  for 
many  purposes  can  be  accurately  described  by  the  Sommerfeld  free  electron  model,  in 


18  We  have  in  mind  a  distance  rather  farther  from  the  atom  than  its  linear  dimensions.  When  one 
gets  too  close  to  the  atom  the  dipole  approximation  is  no  longer  valid,  but,  more  importantly,  the  strong 
core-core  repulsion  begins  to  dominate  the  attractive  fluctuating  dipole  interaction. 

19  See  Chapter  27. 

20  B.  M-  Axilrod  and  E.  Teller,  J.  Chem.  Phys.  22,  1619  (1943);  B.  M.  Axilrod,  3.  Chem.  Phys.  29, 
719,  724  (1951). 
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which  the  valence  electrons  are  completely  separated  from  their  ion  cores  and  form 
a  nearly  uniform  gas. 

More  generally,  one  can  find  aspects  of  covalent  and  molecular  bonding  even  in 
metals,  particularly  in  the  noble  metals  in  which  the  filled  atomic  d-shells  are  not 
very  tightly  bound,  and  as  a  consequence  suffer  considerable  distortion  in  the  metal. 

It  is  instructive  to  compare  the  ionic  radii  of  the  metallic  elements  (as  calculated 
from  the  structure  of  the  ionic  crystals  they  participate  in)  with  the  nearest -neighbor 
distance  in  the  metal  (Table  19.4).  It  is  evident  that  the  concept  of  ionic  radius  is 
completely  irrelevant  in  determining  the  alkali  metal  lattice  constants.  This  is  con¬ 
sistent  with  the  fact  that  such  quantities  as  the  alkali  metal  compressibilities  are  of 
the  order  of  their  electron  gas  values;  the  ions  are  genuinely  small  objects,  embedded 
in  a  sea  of  electrons.  In  the  noble  metals,  on  the  other  hand,  as  noted  in  Chapter  15, 
the  closed  d-shell  plays  a  far  more  important  role  in  determining  metallic  properties 
than  do  the  ion  cores  of  the  alkali  metals.  This  is  reflected  in  the  fact  that  in  Cu, 
Ag,  and  Au,  the  nearest-neighbor  distances  in  the  metal  are  not  that  much  larger  than 
the  ionic  radii  in  ionic  crystals.  In  both  the  ionic  crystals  and  (to  only  a  slightly  lesser 
degree)  the  metal,  size  is  determined  by  the  (/-shells. 


Table  19.4 

IONIC  RADII  COMPARED  WITH  HALF  THE  NEAREST-NEIGHBOR 
DISTANCES  LN  METALS 


SINGLY  IONIZED 

NEAREST-NEIGHBOR  HALF 

metal 

IONIC  RADIUS,  rjon  (A) 

DISTANCE  IN  METAL,  rma  (A) 

*met  A  ion 

Li 

0.60 

1.51 

2.52 

Na 

0.95 

1.83 

1.93 

K 

1.33 

2.26 

1.70 

Rb 

1.48 

2.42 

1.64 

Cs 

1.69 

2.62 

1.55 

Cu 

0.96 

1.28 

1.33 

Ag 

1.26 

1.45 

1.15 

Au 

1.37 

1.44 

1.05 

HYDROGEN-BONDED  CRYSTALS 

Some  classifications  list  hydrogen-bonded  crystals  as  a  fourth  category  of  insulator. 

This  is  in  recognition  of  the  fact  that  hydrogen  is  unique  in  three  important  ways: 

1.  The  ion  core  of  a  hydrogen  atom  is  a  bare  proton  of  order  10  3  cm  in  radius, 

a  factor  of  105  smaller  than  any  other  ion  core. 

2.  Hydrogen  is  but  one  electron  shy  of  the  stable  helium  configuration,  which, 
uniquely  among  stable  configurations,  has  not  eight  but  only  two  electrons  in  the 
outermost  shell. 

3.  The  first  ionization  potential  of  atomic  hydrogen  is  unusually  high  (H,  13.59  e  , 
Li,  5.39  eV;  Na,  5.14  eV;  K,  4.34  eV;  Rb,  4.18  eV;  Cs,  3.89  eV). 

As  a  result  of  these  properties  hydrogen  can  play  a  role  unlike  any  other  element 

in  crystalline  structures.  Because  of  its  large  ionization  potential,  it  is  much  more 

difficult  to  remove  an  electron  completely  from  hydrogen,  and  it  therefore  does  not 
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behave  as  an  alkali  metal  ion  (of  minute  radius)  in  forming  ionic  crystals.  On  the 
other  hand,  it  cannot  behave  as  the  atoms  in  typical  covalent  crystals,  for  lacking 
only  one  electron  from  a  closed-shell  configuration  it  can,  in  chemical  terms,  form 
only  one  covalent  bond  through  electron  sharing.21  Finally,  because  the  proton  has, 
for  all  practical  purposes,  no  size  at  all,  it  can  essentially  sit  on  the  surface  of  the 
large  negative  ions,  resulting  in  a  type  of  structure  unattainable  with  any  other 
positive  ion. 

One  manifestation  of  these  peculiar  properties  is  illustrated  in  Figure  19.11  for 
the  case  of  ice.  The  electron  from  the  hydrogen  atom  is,  like  the  proton,  fairly  well 
localized  in  the  neighborhood  of  the  oxygen  ions.  The  positive  proton  resides  close 
to  a  single  oxygen  ion,  along  the  line  joining  it  to  one  of  its  neighbors,  thereby  helping 
to  bind  the  two  oxygen  ions  together.  (Note  the  lack  of  regularity  in  the  positions  of 
the  protons.  This  can  be  observed  thermodynamically  in  the  large  “residual  entropy” 
possessed  by  ice  at  low  temperatures,  corresponding  to  the  large  number  of  ways  of 
assigning  a  proton  to  either  end  of  each  bond,  consistent  with  two  protons  being 
close  to  each  oxygen  atom.) 


Figure  19.11 

The  crystal  structure  of  one  of  the  many  phases  of  ice.  The 
large  circles  are  oxygen  ions;  the  small  circles  are  protons. 
Ice  is  an  example  in  which  hydrogen  bonding  plays  a 
crucial  role.  (After  L.  Pauling,  The  Nature  of  The  Chemical 
Bond,  3rd.  ed.,  Cornell  University  Press,  Ithaca,  New 
York,  1960.) 


This  completes  our  descriptive  survey  of  some  of  the  various  solid  types.  We  now 
turn  to  some  of  the  elementary  quantitative  implications  of  the  various  structures 
on  bulk  solid  properties — particularly  the  binding,  or  cohesive,  energy. 


PROBLEMS 


1.  Origin  of  the  van  der  Waals  Force 

Consider  two  noble  gas  atoms  a  distance  R  apart,  represented  by  fixed  nuclei  of  charge  Ze  located 
at  0  and  R,  each  surrounded  by  Z  electrons.  The  electrons  bound  to  the  nucleus  at  0  have  co- 


21  This  contrasts  with  the  four  bonds  that  are  present  in  tetrahedrally  coordinated  covalent  crystals, 
as  the  result  of  forming  two  closed  shells  from  eight  electrons. 
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ordinates  r‘n  and  those  bound  to  the  nucleus  at  R  have  coordinates  rj2',  i  =  1, . . . ,  Z.  We  suppose 
R  to  be  so  large  that  there  is  negligible  overlap  between  the  electronic  charge  distributions  about 
the  two  nuclei.22  Let  //,  and  H2  be  the  Hamiltonians  for  atoms  1  and  2  alone.  The  Hamiltonian 
for  the  two-atom  system  will  be  H  =  H1  +  H2  +  U,  where  U  gives  the  Coulomb  interaction 
between  all  pairs  of  charged  particles,  one  from  atom  1,  and  one  from  atom  2: 


To  second  order  in  perturbation  theory,  the  interaction  energy  between  the  two  atoms  will  be 
given  by: 


A  E  =  <0|t/|0>  + 


V1<0M”>|2 

L  F  __  Z7  ’ 
n  ^0  Z'n 


(19.6) 


where  |0>  is  the  ground  state  of  the  unperturbed  two-atom  system,  and  |n>  its  excited  states. 

(a)  Show  that  the  first-order  term  in  (19.6)  is  just  the  electrostatic  interaction  energy  between 
two  distributions  of  charge  density  p<u(r)  and  p<2)(r),  where  p(1)  and  p(2)  are  the  ground-state 
charge  distributions  of  atoms  1  and  2. 

(b)  Prove  that  if  the  charge  distributions  have  zero  overlap  and  are  spherically  symmetric, 
this  interaction  energy  is  identically  zero.23 

(c)  The  assumption  that  there  is  negligible  overlap  between  the  electronic  states  on  the  two 
atoms  also  means  that  the  wave  functions  appearing  in  the  second  order  term  in  (19.6)  are 
negligibly  small  unless  |r<l,|  and  |r'2)  -  R|  are  small  compared  with  R.  Show  that  if  (19.5)  is 
expanded  in  these  quantities,  the  leading  nonvanishing  term  is 

-  ^  I  [3(r<»  •  R)([rf>  -  R]  •  ft)  -  •  (rf  -  R)J-  (19.7) 

(d)  Show,  as  a  result,  that  the  leading  term  in  (19.6)  varies  as  1/P6  and  is  negative. 

2.  Geometrical  Relations  in  Diatomic  Crystals 

Verify  that  the  critical  ratios  r>/r<  are  (v/3  +  l)/2  for  the  cesium  chloride  structure  and  2  +  ,/6 
for  the  zincblende  structure,  as  asserted  in  the  text. 


22  Because  of  this  we  can  ignore  the  Pauli  principle  as  it  affects  the  interchange  of  electrons  between 
atoms,  and  regard  the  electrons  on  atom  1  as  distinguishable  from  those  on  atom  2.  In  particular,  we  need 
not  antisymmetrize  the  states  appearing  in  (19.6). 

23  If  the  atoms  are  too  close  together,  overlap  cannot  be  ignored,  and  it  leads  to  a  strong  (short-range) 
repulsion.  The  very  slight  overlap  when  the  atoms  are  far  apart  yields  corrections  to  the  interaction  that 
fall  off  exponentially  with  separation. 
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The  cohesive  energy  of  a  solid  is  the  energy  required  to  disassemble  it  into  its  con¬ 
stituent  parts — i.e.,  its  binding  energy.1  This  energy  depends,  of  course,  on  what  the 
constituent  parts  are  considered  to  be.  They  are  generally  taken  to  be  the  individual 
atoms  of  the  chemical  elements  out  of  which  the  solid  is  composed,  but  other  con¬ 
ventions  are  sometimes  used.  For  example,  it  might  be  convenient  to  define  the 
cohesive  energy  of  solid  nitrogen  as  that  required  to  separate  it  into  a  set  of  isolated 
nitrogen  molecules,  rather  than  atoms.  Knowing  the  binding  energy  of  an  isolated 
nitrogen  molecule,  one  can  easily  convert  from  one  definition  to  the  other.  Similarly, 
in  the  alkali  halide  crystals  we  shall  discuss  the  energy  necessary  to  separate  the  solid 
into  isolated  ions,  rather  than  atoms.  The  link  between  the  two  energies  is  provided 
by  the  first  ionization  potential  of  the  alkali  metal  atom  and  the  electron  affinity  of 
the  halogen  atom. 

In  the  early  days  of  solid  state  physics  much  effort  was  devoted  to  computing 
cohesive  energies,  and  the  subject  pervaded  the  theory  of  solids  much  more  than  it 
does  today.  Older  discussions  of  the  classification  of  solids,  for  example,  rely  heavily 
on  the  nature  of  the  cohesion,  rather  than  emphasizing  (as  in  Chapter  19)  the  (closely 
related)  spatial  electronic  arrangement.  The  importance  of  the  cohesive  energy  is 
that  it  is  the  ground-state  energy  of  the  solid;  its  sign,  for  example,  determines  whether 
the  solid  will  be  stable  at  all.  Indeed,  its  generalization  to  nonzero  temperatures,  the 
Helmholtz  free  energy,  if  known  as  a  function  of  volume  and  temperature,  contains 
all  equilibrium  thermodynamic  information  about  the  solid.  However,  solid  state 
physics  has  come  to  focus  more  and  more  on  nonequilibrium  properties  (e.g.,  transport 
properties  and  optical  properties)  and  the  study  of  cohesion  no  longer  plays  the 
dominant  role  it  once  did. 

In  this  chapter  we  shall  discuss  some  elementary  facts  about  cohesive  energies  at 
zero  temperature.  We  shall  calculate  these  energies  for  an  externally  imposed  lattice 
constant,  and  shall  therefore  be  considering  solids  under  external  pressure.  By  cal¬ 
culating  the  rate  of  change  of  cohesive  energy  with  lattice  constant  we  can  find  the 
pressure  necessary  to  maintain  a  given  volume,  and  therefore  determine  the  equilib¬ 
rium  lattice  constant  as  that  requiring  zero  pressure2  for  its  maintenance.  In  the  same 
way  we  can  calculate  the  compressibility  of  the  solid — i.e.,  the  change  in  volume 
produced  by  a  given  change  in  pressure.  This  is  more  accessible  to  physical  measure¬ 
ment  than  the  cohesive  energy  itself,  since  it  does  not  require  dismantling  of  the  solid 
into  its  constituents. 

Throughout  this  chapter  we  shall  treat  the  ion  cores  as  classical  particles,  which 
can  be  perfectly  localized  with  zero  kinetic  energy  at  the  sites  of  a  lattice.  This  is 
incorrect,  for  it  violates  the  uncertainty  principle.  If  an  ion  core  is  confined  to  a 
region  of  linear  dimensions  Ax,  the  uncertainty  in  its  momentum  will  be  of  order 
h/ Ax.  It  will  therefore  have  a  kinetic  energy  of  order  h2/M{ Ax)2,  known  as  the  zero- 
point  kinetic  energy,  whose  contribution  to  the  energy  of  the  solid  must  be  taken 


1  It  is  frequently  given  in  kilocalories  per  mole.  A  useful  conversion  factor  is  23.05  kcal/mole  =  1 
eV/molecule. 

2  More  correctly,  atmospheric  pressure.  However,  the  difference  in  size  between  a  solid  at  atmospheric 
pressure  and  one  in  vacuum  is  negligible  at  the  level  of  accuracy  of  our  analysis. 
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into  account.  Furthermore,  since  the  ions  are  not  perfectly  localized  (for  that  would 
imply  infinite  zero-point  kinetic  energy),  the  deviations  in  their  potential  energy  from 
that  of  classical  particles  fixed  at  lattice  sites  must  be  allowed  for.  We  shall  not  be 
able  to  do  this  in  any  but  the  crudest  way  (Problem  1)  until  Chapter  23,  where  we 
describe  the  theory  of  lattice  vibrations.  For  now  we  only  note  that  the  lighter  the 
ionic  mass,  the  larger  is  the  zero-point  kinetic  energy,  and  the  more  suspect  the 
approximation  of  perfectly  localized  ions.  We  shall  see  some  simple  evidence  for  the 
importance  of  zero-point  motion  in  the  lighter  noble  gases  in  our  discussion  below.3 
In  most  other  cases  the  errors  introduced  by  the  neglect  of  zero-point  motion  are 
of  the  order  of  1  percent  or  less. 

Having  noted  this  oversimplification,  we  turn  to  the  other,  generally  more  impor¬ 
tant,  factors  contributing  to  the  binding  energy  of  the  various  types  of  solids.4  We  start 
with  the  molecular  solids  (the  crude  theory  of  which  is  particularly  simple)  treating 
them  as  atoms  held  together  by  the  short-range  fluctuating  dipole  interaction,  and 
prevented  from  coming  close  together  by  the  still  shorter-range  core-core  repulsion.5 
On  a  similar  level  of  sophistication,  ionic  crystals  are  somewhat  more  subtle,  because 
the  basic  building  blocks  are  now  electrically  charged  ions,  and  problems  arise 
connected  with  the  very  long  range  of  the  interionic  force.  On  the  other  hand,  the 
electrostatic  interaction  energy  of  the  ions  is  so  large  that  it  completely  dominates 
all  other  sources  of  attraction.6  In  this  respect  the  crude  theory  of  ionic  crystals  is 
the  simplest  of  all. 

However,  when  we  turn  to  covalent  crystals  and  metals  we  find  that  even  a  crude 
theory  is  difficult  to  construct.  The  basic  problem  is  that  the  arrangement  of  the 
valence  electrons,  whether  in  the  well-localized  bonds  of  good  covalent  insulators  or 
the  electron  gas  of  the  alkali  metals,  is  vastly  different  from  what  it  is  in  either  the 
isolated  atoms  or  the  isolated  ions.  Our  discussion  in  these  cases  will  be  highly 
qualitative. 

For  simplicity  we  shall  discuss  only  cubic  crystals  in  this  chapter,  and  consider 
the  energy  of  the  solid  as  a  function  of  the  cubic  cell  side,  a.  In  so  doing  we  are  ignoring 
crystals  whose  energy  may  depend  on  more  than  one  geometrical  parameter  (e.g.,  on 
c  and  a  in  hep  structures).  We  shall  also  ignore  deformations  of  cubic  crystals  from 
their  equilibrium  size  and  shape  more  general  than  a  mere  uniform  compression 
(which  preserves  their  cubic  symmetry).  The  physics  of  more  complex  deformations 
is  no  different,  but  the  geometrical  aspects  of  the  more  general  deformations  can  be 
complicated.  We  shall  limit  our  discussion  of  such  deformations  to  the  less  funda¬ 
mental  description  given  in  the  discussion  of  elastic  constants  in  Chapter  22. 


3  Only  in  solid  helium  do  considerations  determined  by  zero-point  motion  become  of  really  crucial 
importance.  The  mass  of  helium  is  so  light  that  quantum  effects  prevent  it  from  solidifying  at  all,  unless 
an  external  pressure  is  imposed. 

4  We  emphasize  again  that  the  only  attractive  forces  at  work  are  electrostatic,  but  the  way  in  which 
they  manifest  themselves  varies  so  dramatically  from  category  to  category  as  to  require  in  each  case 
separate  discussions,  and  even  separate  nomenclature. 

5  Recall  (page  379)  that  this  is  simply  a  crude  way  of  representing  classically  some  effects  of  the 
Pauli  exclusion  principle,  when  applied  to  filled  atomic  shells. 

6  Such  as  the  fluctuating  dipole  interactions  between  the  ions. 
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MOLECULAR  CRYSTALS:  THE  NOBLE  GASES 


We  consider  only  the  simplest  molecular  crystals,  in  which  the  constituent  entities 
are  noble  gas  atoms.  We  omit  solid  helium,  because  of  the  crucial  role  played  by 
quantum  effects.7  As  described  in  Chapter  19,  the  atoms  in  a  solid  noble  gas  are  only 
slightly  distorted  from  the  stable  closed-shell  configuration  they  possess  in  the  free 
state.  Such  small  distortion  as  occurs  can  be  described  by  the  fluctuating  dipole 
interaction  and  represented  as  a  weak  attractive  potential  which  varies  as  the  inverse 
sixth  power  of  the  interatomic  separation.  It  is  this  weak  attraction  that  holds  the 
solid  together. 

When  the  atoms  approach  one  another  too  closely,  the  repulsion  of  the  ion  core 
comes  into  play  and  is  crucial  in  determining  the  equilibrium  size  of  the  solid.  At 
short  distances  this  repulsion  must  be  stronger  than  the  attraction,  and  it  has  become 
conventional  to  represent  it  in  the  form  of  a  power  law  as  well.  The  power  generally 
chosen  is  12,  and  the  resulting  potential  then  has  the  form 

<M=-£  +  -£z'  (201> 


where  A  and  B  are  positive  constants,  and  r  is  the  distance  between  atoms.  The 
potential  is  usually  written  in  the  dimensionally  more  appealing  form 


4>{r)  =  4c 


<7  =  (B/A)1'6, 
c  =  A2/4B, 


(20.2) 


and  is  known  as  the  Lennard-Jones  6-12  potential.  There  is  no  reason  for  choosing 
the  exponent  in  the  repulsive  term  to  be  12,  other  than  the  resulting  analytic  simplicity 
and  the  requirement  that  the  number  be  larger  than  6.  With  this  choice,  however, 
the  thermodynamic  properties  of  gaseous  neon,  argon,  krypton,  and  xenon  at  low 
densities  can  be  well  reproduced  by  suitable  choices  of  the  parameters  e  and  a  for 
each.  The  values  so  obtained  are  displayed8 * * *  in  Table  20.1. 


Table  20.1 

VALUES  OF  THE  LENNARD-JONES  PARAMETERS  FOR  THE 
NOBLE  GASES0 


Ne 

Ar 

Kr 

Xe 

e(10  13  erg) 

0.050 

0.167 

0.225 

0.320 

e(eV) 

0.0031 

0.0104 

0.0140 

0.0200 

«r(A) 

2.74 

3.40 

3.65 

3.98 

°  As  deduced  from  properties  of  the  low-density  gases  (second  virial  coefficient). 
Source:  N.  Bernardes,  Phys.  Rev .  112,  1534(1958). 


7  And  also  because  solid  helium  (of  either  isotope)  does  not  exist  at  zero  pressure;  25  atmospheres 
are  required  for  4He,  and  33  for  3He. 

8  These  values  must  be  used  with  caution  in  the  solid  state,  for  at  high  densities  the  interaction  cannot 

be  represented  as  a  sum  of  pair  potentials  (see  page  390).  If  one  nevertheless  insists  on  fitting  data  in  the 

solid  with  a  sum  of  pair  potentials  of  the  form  (20.2),  the  best  choice  of  e  and  a  need  not  be  the  same  as 

that  determined  by  properties  in  the  gaseous  state. 
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We  emphasize  that  the  precise  form  of  the  potential  (20.2)  should  not  be  taken  too 
seriously.  It  is  nothing  more  than  a  simple  way  of  taking  into  account  the  following : 


1.  The  potential  is  attractive  and  varies  as  1/r6  at  large  separations. 

2.  The  potential  is  strongly  repulsive  at  small  separations. 

3.  The  parameters  e  and  a  measure  the  strength  of  the  attraction  and  the  radius 
of  the  repulsive  core,  as  determined  by  fitting  data  in  the  gaseous  state. 


Note  that  e  is  only  of  order  0.01  eV,  consistent  with  the  very  weak  binding  of  the 
solidified  noble  gases.  The  Lennard-Jones  potential  is  shown  in  Figure  20.1. 


Figure  20.1 

The  Lennard  Jones 
potential  (Eq.  (20.2)). 


Let  us  try  to  fit  some  of  the  observed  properties  of  the  solid  noble  gases  using 
only  the  data  from  the  gaseous  state  contained  in  Table  20.1,  and  the  potential  (20.2). 
We  treat  the  rare  gas  solid  as  a  set  of  classical  particles,  localized  with  negligible 
kinetic  energy  at  the  points  of  the  observed  face-centered  cubic  Bravais  lattice.  To 
compute  the  total  potential  energy  of  the  solid  first  note  that  the  energy  of  interaction 
of  the  atom  at  the  origin  with  all  the  others  is 

£  $(R).  (20.3) 

R  *  0 

If  we  multiply  this  by  N,  the  total  number  of  atoms  in  the  crystal,  we  get  twice  the 
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total  potential  energy  of  the  crystal,  because  we  have  thereby  counted  the  interaction 
energy  of  each  atom  pair  twice.  Thus  the  energy  per  particle,  u,  is  just: 

u  =  \  I  (KR),  (20.4) 

•4  R  *  0 

where  the  sum  is  over  all  nonzero  vectors  in  the  fee  Bravais  lattice. 

It  is  convenient  to  write  the  length  of  the  Bravais  lattice  vector  R  as  a  dimensionless 
number,  ct(R),  times  the  nearest-neighbor  separation,  r.  Equations  (20.2)  and  (20.4) 


then  give 

(20.5) 

where 

A"  -  r?o«(R)"' 

(20.6) 

The  constants  A„  depend  only  on  the  crystal  structure  (in  this  case,  fee)  and  the  number 
n.  Evidently  when  n  is  very  large,  only  the  nearest  neighbors  of  the  origin  will  con¬ 
tribute  to  the  sum  (20.6).  Since  by  definition  a(R)  =  1,  when  R  is  a  vector  joining 
nearest  neighbors,  as  n  -*  oo,  A„  approaches  the  number  of  nearest  neighbors,  which  is 
12  for  the  fee  Bravais  lattice.  As  n  diminishes,  A„  increases,  since  next-nearest  neighbors 
start  to  contribute.  When  n  is  12,  A„  is  given  to  a  tenth  of  a  percent  by  the  contributions 
from  the  nearest,  next-nearest,  and  third-nearest  neighbors  of  the  origin.  Values  of 
A„  have  been  computed  for  most  common  crystal  structures  and  a  range  of  values 
of  n.  The  values  of  A„  for  the  most  common  cubic  structures  are  shown  in  Table  20.2. 


Table  20.2 

THE  LATTICE  SUMS  A„  FOR  THE  THREE  CUBIC  BRAVAIS  LATTICES0 


n 

SIMPLE  CUBIC 

BODY-CENTERED  CUBIC 

FACE-CENTERED  CUBIC 

«  3 

00 

00 

00 

4 

16.53 

22.64 

25.34 

5 

10.38 

14.76 

16.97 

6 

8.40 

12.25 

14.45 

7 

7.47 

11.05 

13.36 

8 

6.95 

10.36 

12.80 

9 

6.63 

9.89 

12.49 

10 

6.43 

9.56 

12.31 

11 

6.29 

9.31 

12.20 

12 

6.20 

9.11 

12.13 

13 

6.14 

8.95 

12.09 

14 

6.10 

8.82 

12.06 

15 

6.07 

8.70 

12.04 

16 

6.05 

8.61 

12.03 

n  >  17 

6  +  12(l/2y,/2 

8  +  6t3/4)"'2 

12  +  6(l/2)"/2 

“  A„  is  the  sum  of  the  inverse  nth  powers  of  the  distances  from  a  given  Bravais  lattice  point 
to  all  others,  where  the  unit  of  distance  is  taken  to  be  the  distance  between  nearest  neigh¬ 
bors  (Eq.  (20.6)).  To  the  accuracy  of  the  table  only  nearest-  and  next-nearest  neighbors 
contribute  when  n  3*  17,  and  the  given  formulas  may  be  used. 

Source:  J.  E.  Jones  and  A.  E.  Ingham,  Proc.  Roy.  Soc.  (London)  A107.  636  (1925). 
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Equilibrium  Density  of  the  Solid  Noble  Gases 


To  find  the  nearest-neighbor  separation  in  equilibrium,  r0,  and  hence  the  density, 
we  need  only  minimize  (20.5)  with  respect  to  r,  to  find  that  du/dr  =  0  at 


~th  _ 

r0  — 


1.09  <7. 


(20.7) 


In  Table  20.3  the  theoretical  value  rf  =  1.09c  is  compared  with  the  measured  value, 
p.  Agreement  is  quite  good,  although  r“ p  becomes  progressively  bigger  than  rfi  as 
the  atomic  mass  becomes  lighter.  This  can  be  understood  as  an  effect  of  the  zero-point 
kinetic  energy  we  have  neglected.  This  energy  becomes  greater,  the  smaller  the  volume 
into  which  the  atoms  are  squeezed.  It  should  therefore  behave  as  an  effectively  repul¬ 
sive  force,  increasing  the  lattice  constant  over  the  value  given  by  (20.7).  Since  the 
zero-point  energy  becomes  more  important  with  decreasing  atomic  mass,  we  should 
expect  (20.7)  to  fall  short  of  r"p  most  for  the  lightest  masses. 


Table  20.3 

NEAREST-NEIGHBOR  DISTANCE  r0,  COHESIVE  ENERGY  uc,  AND 
BULK  MODULUS  B0  AT  ZERO  PRESSURE  FOR  THE  SOLID  NOBLE 
GASES- 


Ne 

Ar 

Kr 

Xe 

r0  (angstroms) 

(Experiment) 

3.13 

3.75 

3.99 

4.33 

r0  =  1.09c; 

(Theory) 

2.99 

3.71 

3.98 

4.34 

u0  (eV/atom) 

(Experiment) 

-0.02 

-0.08 

-0.11 

-0.17 

ti0  =  —  8.6e 

(Theory) 

-0.027 

-0.089 

-0.120 

-0.172 

flo(1010  dync/cm2)1 

(Experiment) 

1.1 

2.7 

3.5 

3.6 

B0  =  75e /a3 

(Theory) 

1.81 

3.18 

3.46 

3.81 

“  The  theoretical  values  are  those  calculated  from  the  elementary  classical  theory. 
b  One  atmosphere  of  pressure  =  1.01  x  106  dynes/cm2;  1  bar  of  pressure  =  106 
dynes/cm2. 

Source:  Data  quoted  by  M.  L.  Klein,  G.  K.  Horton,  and  J.  L.  Feldman,  Phys.  Rev.  184, 
968  (1969);  D.  N.  Batchelder,  et  at,  Phys.  Rev.  162,  767  (1967);  E.  R.  Dobbs  and 
G.  O.  Jones,  Rep.  Prog.  Phys.  xx,  516  (1957). 


Equilibrium  Cohesive  Energy  of  the  Solid  Noble  Gases 

If  we  substitute  the  equilibrium  nearest-neighbor  separation  (20.7)  into  the  energy 
per  particle  (20.5)  we  find  the  equilibrium  cohesive  energy: 

t<o  =  =  -8.6e.  (20.8) 

2A12 

If  Uq  is  compared  with  the  measured  value  u|xp  (Table  20.3),  the  agreement  is  again 
found  to  be  good,  although  |uq |  increasingly  exceeds  |uqp|  as  the  atomic  mass  decreases. 
This  again  makes  sense  as  an  effect  of  the  neglected  zero-point  motion.  We  have 
ignored  a  positive  term  in  the  energy  (kinetic  energy  is  always  positive)  that  decreases 
the  binding,  and  becomes  more  important  with  decreasing  atomic  mass. 
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Equilibrium  Bulk  Modulus  of  the  Solid  Noble  Gases 


The  bulk  modulus  B  =  —  V(cP/PV)T  can  also  be  calculated  in  terms  of  e  and  a. 
Since  the  pressure  at  T  —  0  is  given  by  P  =  — dU/dV. ,  where  U  is  the  total  energy, 
we  can  write  B  in  terms  of  the  energy  per  particle  u  —  U/N  and  volume  per  particle 
v  -  V/N  as 


(20.9) 


The  volume  per  particle  v  in  a  fee  lattice  is  v  =  a2/ 4,  where  the  side  a  of  the  conven¬ 
tional  cubic  cell  is  related  to  the  nearest-neighbor  separation  r  by  a  =  x/2r.  We  may 
therefore  write 


B^_  v2^ 
\/2’  cv  3 r2  cr' 


(20.10) 


and  rewrite  the  bulk  modulus  as 


B  = 


1  8 


y/2  B_ _ 

9  dr  r2  Br 


(20.11) 


The  equilibrium  separation  i0  is  that  which  minimizes  the  energy  per  particle  u. 
Therefore  Pu/Br  vanishes  in  equilibrium  and  (20.11)  reduces  to 

=  ^2B^u 
°  9r0  Br2  , 


-4eA 

~  3  "12 


5/2 


75e 


(20.12) 


Comparing  with  the  measured  B“p  (Table  20.3)  we  find  good  agreement  in 
xenon  and  krypton,  but  the  experimental  bulk  modulus  is  about  20  percent  larger 
in  argon  and  60  percent  larger  in  neon.  The  mass  dependence  again  suggests  that 
these  discrepancies  are  due  to  the  neglected  zero-point  motion. 


IONIC  CRYSTALS 

The  simplest  theory  of  cohesion  in  the  ionic  crystals  makes  the  same  physical  sim¬ 
plifications  as  the  theory  of  cohesion  in  molecular  crystals:  One  assumes  that  the 
cohesive  energy  is  entirely  given  by  the  potential  energy  of  classical  particles  localized 
at  the  equilibrium  positions.9  Because  the  particles  in  ionic  crystals  are  electrically 
charged  ions,  by  far  the  largest  term  in  the  interaction  energy  is  the  interionic  Coulomb 
interaction.  This  varies  as  the  inverse  first  power  of  the  interionic  distance,  and 
completely  overwhelms  the  inverse  sixth-power  fluctuating  dipole  interaction10  and 
can  be  taken  as  the  sole  source  of  binding  in  rough  calculations. 


9  We  shall  define  the  cohesive  energy  of  an  ionic  crystal  to  be  the  energy  required  to  disassemble 
it  into  isolated  ions,  rather  than  atoms.  If  one  wishes  the  cohesive  energy  with  respect  to  isolated  atoms, 
one  must  supplement  our  analysis  with  calculations  or  measurements  of  ionization  potentials  and  electron 
affinities. 

10  Such  a  term,  however,  is  present  in  ionic  crystals  as  well,  and  must  be  allowed  for  in  more  precise 
calculations. 
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In  determining  the  equilibrium  lattice  parameters  one  must  still  take  into  account 
the  strong  short-range  core-core  repulsion  due  to  the  Pauli  principle,  without  which 
the  crystal  would  collapse.  We  therefore  represent  the  total  cohesive  energy  per  ion 
pair 1 1  in  the  form 

u(r)  =  tCorc(  r)  +  (20.13) 


where  r  is  the  nearest-neighbor  distance.12 

The  calculation  ofu“"'(r)  is  not  as  straightforward  as  the  calculation  of  the  attrac¬ 
tive  energy  in  molecular  crystals,  because  of  the  very  long  range  of  the  Coulomb 
potential.  Consider,  for  example,  the  sodium  chloride  structure  (Figure  19.4a),  which 
we  can  represent  as  a  fee  Bravais  lattice  of  negative  anions  at  sites  R,  and  a  second 
Bravais  lattice  of  positive  cations  displaced  by  d  from  the  first,  where  d  is  a  translation 
through  a/2  along  a  cube  side.  We  again  measure  all  interionic  distances  in  terms 
of  the  nearest-neighbor  distance  r  =  a/ 2: 


|R  =  a(R)r, 

|R  +  d  =  a(R  +  d)r.  (20.14) 

It  is  then  tempting  to  proceed  as  in  the  earlier  case,  writing  the  total  potential  energy 
of  a  single  cation  (or  a  single  anion)  as 


r  {a(d)  +  „5oC(R  +  d) 


(20.15) 


If  there  are  N  ions  in  the 
(20.15): 

V  =  - 


crystal,  the  total  potential  energy  will  be 

TV |i(d)  +r?,(«(R  +  d)  ~  «(R))}' 


half  of  N  times 

(20.16) 


The  energy  per  ion  pair  is  this  divided  by  the  number  N/2  of  ion  pairs: 


,ml7) 

However,  1/r  falls  off  so  slowly  with  distance  that  Eq.  (20.17)  is  not  a  well-defined 
sum.  Mathematically  speaking,  it  is  only  a  conditionally  convergent  series,  and  can 
therefore  be  summed  to  any  value  whatsoever,  depending  on  the  order  in  which  the 
summation  is  performed! 

This  is  not  just  a  mathematical  nuisance.  It  reflects  the  physical  fact  that  Coulomb 
interactions  are  of  such  long  range  that  the  energy  of  a  collection  of  charged  particles 
can  depend  crucially  on  the  configuration  of  a  negligibly  small  fraction  of  them  on 
the  surface.  We  have  already  encountered  this  problem  in  Chapter  18.  In  the  present 
case  we  can  put  the  point  as  follows: 


11  It  is  customary  to  calculate  the  cohesive  energy  per  ion  pair,  rather  than  per  ion.  If  there  are  A 
ions,  then  there  will  be  N/2  ion  pairs. 

12  We  called  the  nearest-neighbor  distance  d  in  Chapter  19  to  avoid  confusion  with  ionic  radii. 
Here  we  call  it  r  because  it  is  esthetically  distressing  to  take  derivatives  with  respect  to  d.  And  we  are 
esthetes,  at  heart. 
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If  we  included  only  a  finite  set  of  ions  in  the  summation  there  would  be  no 
ambiguity,  and  the  sum  would  give  the  electrostatic  energy  of  that  finite  crystal. 
Summing  the  infinite  series  in  a  particular  order  corresponds  to  constructing  the 
infinite  crystal  as  a  particular  limiting  form  of  successively  larger  and  larger  finite 
crystals.  If  the  interionic  interactions  were  of  short  enough  range  one  could  prove 
that  the  limiting  energy  per  ion  pair  would  not  depend  on  how  the  infinite  crystal 
was  built  up  (provided  that  the  surface  of  successive  finite  constructions  was  not 
wildly  irregular).  However,  with  the  long-range  Coulomb  interaction  one  can  con¬ 
struct  the  infinite  crystal  in  such  a  way  that  arbitrary  distributions  of  surface  charge 
and/or  dipolar  layers  are  present  at  all  stages.  By  judiciously  choosing  the  form 
of  these  surface  charges  one  can  arrange  things  so  that  the  energy  per  ion  pair  u 
approaches  any  desired  value  in  the  limit  of  an  infinite  crystal.  This  is  the  physics 
underlying  the  mathematical  ambiguity  in  Eq.  (20.17). 

The  disease  being  thus  diagnosed,  the  cure  is  obvious :  The  series  must  be  summed 
in  such  a  way  that  at  all  stages  of  the  summation  there  are  no  appreciable  contributions 
to  the  energy  from  charges  at  the  surface.  There  are  many  ways  this  can  be  guaranteed. 
For  example,  one  can  break  up  the  crystal  into  electrically  neutral  cells  whose  charge 
distributions  have  the  full  cubic  symmetry  (see  Figure  20.2).  The  energy  of  a  finite 
subcrystal  composed  of  n  such  cells  will  then  be  just  n  times  the  energy  of  a  single 
cell,  plus  the  cell-cell  interaction  energy.  The  internal  energy  of  a  cell  is  easily  cal¬ 
culated  since  the  cell  contains  only  a  small  number  of  charges.  But  the  interaction 
energy  between  cells  will  fall  off  as  the  inverse  fifth  power  of  the  distance  between 
the  cells,13  and  thus  the  cell-cell  interaction  energy  will  be  a  rapidly  converging 
summation,  which,  in  the  limit  of  an  infinite  crystal,  will  not  depend  on  the  order 
of  summation. 

There  are  numerically  more  powerful,  but  more  complex,  ways  of  computing  such 

Figure  20.2 

One  possible  way  of  dividing  up  the  sodium  chloride 
structure  into  cubical  cells,  whose  electrostatic  interaction 
energy  falls  off  rapidly  (as  the  inverse  fifth  power)  with 
intercellular  distance.  Each  cell  contains  four  units  of 
positive  charge,  made  up  of  a  whole  unit  at  the  center  and 
twelve  quarter  units  on  the  edges,  and  four  units  of  negative 
charge,  made  up  of  six  half  units  on  the  faces  and  eight 
eighth  units  at  the  corners.  For  computation  each  sphere 
can  be  represented  as  a  point  charge  at  its  center.  (The 
energies  of  interaction  of  the  surface  point  charges  of  two 
adjacent  cubes  must  not  be  counted.) 


13  This  is  because  the  charge  distribution  within  each  cell  has  the  full  cubic  symmetry.  See  page  355. 
Note  also  that  a  minor  problem  arises  if  some  ions  lie  on  the  boundary  between  cells.  Their  charge  must 
then  be  divided  between  cells  so  as  to  maintain  the  full  symmetry  of  each  cell.  Having  done  this,  one  must 
be  careful  not  to  include  the  self-energy  of  the  divided  ion  in  the  interaction  energy  between  the  cells 
sharing  it. 
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Coulomb  lattice  sums,  which  are,  however,  all  guided  by  the  same  physical  criterion. 
The  most  famous  is  due  to  Ewald.14 

The  result  of  all  such  calculations  is  that  the  electrostatic  interaction  per  ion  pair 
has  the  form: 

e2 

u““'(r)  =  —a  — ,  (20.18) 

r 

where  a,  known  as  the  Madelung  constant,  depends  only  on  the  crystal  structure. 
Values  of  a  for  the  most  important  cubic  structures  are  given  in  Table  20.4.  Note  that 
a  is  an  increasing  function  of  coordination  number;  i.e.,  the  more  nearest  neighbors 
(of  opposite  charge),  the  lower  the  electrostatic  energy.  Since  the  Coulomb  interaction 
has  so  long  a  range  this  is  not  an  obvious  result.  Indeed,  the  amount  by  which  the 
electrostatic  energy  of  the  cesium  chloride  structure  (coordination  number  8)  is  lower 
than  that  of  a  sodium  chloride  structure  with  the  same  nearest-neighbor  distance 
r  (coordination  number  6)  is  less  than  1  percent,  although  the  nearest-neighbor 
contribution  is  lower  by  33  percent. 


Table  20.4 

THE  MADELUNG  CONSTANT  ot  FOR  SOME  CUBIC 
CRYSTAL  STRUCTURES 


CRYSTAL  STRUCTURE 

MADELUNG  CONSTANT  a 

Cesium  chloride 

1.7627 

Sodium  chloride 

1.7476 

Zincblende 

1.6381 

The  dominant  contribution  of  the  Coulomb  energy  to  the  cohesive  energy  of 
the  alkali  halides  is  demonstrated  in  Table  20.5,  where  nco“'(r)  is  evaluated  at  the 
experimentally  observed  nearest-neighbor  separations,  and  compared  with  the  ex¬ 
perimentally  determined  cohesive  energies.  It  can  be  seen  that  the  alone  accounts 
for  the  bulk  of  the  observed  binding,  being  in  all  cases  about  10  percent  lower  than 
the  measured  cohesive  energy. 

It  is  to  be  expected  that  the  electrostatic  energy  alone  overestimates  the  strength 
of  the  binding,  for  Eq.  (20.18)  omits  any  contribution  from  the  positive  potential 
representing  the  short-range  core-core  repulsion.  This  weakens  the  binding.  We  can 
see  that  the  resulting  correction  will  be  small  by  noting  that  the  potential  representing 
the  core-core  repulsion  is  a  very  rapidly  varying  function  of  ionic  separation.  If  we 
were  to  represent  the  core  as  infinitely  repulsive  hard  spheres,  we  should  find  a 
cohesive  energy  exactly  given  by  the  electrostatic  energy  at  minimum  separation 
(Figure  20.3).  Evidently  this  is  too  extreme.  We  acquire  more  latitude  by  letting  the 
repulsion  vary  with  an  inverse  power  law,  writing  the  total  energy  per  ion  pair  as 


u(r) 


(20.19) 


14  P.  P.  Ewald,  Ann.  Physik  64,253  (1921).  A  particularly  nice  discussion  can  be  found  in  J.  C.  Slater, 
Insulators  Semiconductors  and  Metals,  McGraw-Hill,  New  York,  1967,  pp.  215-220. 
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Table  20.5 

MEASURED  COHESIVE  ENERGY  AND  ELECTROSTATIC 

ENERGY  FOR  THE  ALKALI  HALIDES 

WITH  THE  SODIUM  CHLORIDE  STRUCTURE 


Li 

Na 

K 

Rb 

Cs 

F 

-1.68° 

— 2.011 

-1.49 

-1.75 

-1.32 

-1.51 

-1.26 

-1.43 

-1.20 

-1.34 

Cl 

-1.38 

-1.57 

-1.27 

-1.43 

-1.15 

-1.28 

-1.11 

-1.23 

Br 

-1.32 

-1.47 

-1.21 

-1.35 

-1.10 

-1.22 

-1.06 

-1.18 

I 

-1.23 

-1.34 

-1.13 

-1.24 

-1.04 

-1.14 

-1.01 

-1.10 

• 

“  The  upper  figure  in  each  box  is  the  measured  cohesive  energy  (compared  with 
separated  ions)  in  units  of  10" 11  erg  per  ion  pair.  Source:  M.  P.  Tosi,  Solid  Stale 
Physics ,  vol.  16,  F.  Seitz  and  D.  Turnbull,  eds..  Academic  Press,  New  York, 
1964,  p.  54. 

*  The  lower  figure  in  each  box  is  the  electrostatic  energy  as  given  by  Eq.  (20.1 8), 
evaluated  at  the  observed  nearest-neighbor  separation  r. 


The  equilibrium  separation  r0  is  then  determined  by  minimizing  u.  Setting  u'(r0) 
equal  to  zero  gives 

r,rl  =  (20.20) 

e  a 

In  the  noble  gases  we  used  the  corresponding  equation  to  determine  r0  (Eq.  (20.7)), 
but  now,  lacking  an  independent  measurement  of  C,  we  may  use  it  to  determine 


h 


Figure  20.3 

Graph  of  the  pair  potential,  which  is  infinitely 
repulsive  when  r  <  r0  and  Coulombic  when 
r  >  r0.  The  dashed  curve  is  the  extension  of  the 
Coulomb  potential.  The  dotted  curve  is  how  the 
potential  would  be  affected  if  the  repulsion  were 
a  power  law,  rather  than  being  infinitely  strong. 
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C  in  terms  of  the  experimentally  measured  r0 : 


C  = 


ae2r0m~1 


(20.21) 


We  can  then  substitute  this  back  into  (20.19)  to  find  that  the  theoretical  cohesive 
energy  per  ion  pair  is 


«0  =  !'('o)  = 


ae 2  m  -  1 
r0  m 


(20.22) 


As  expected,  this  is  only  slightly  smaller  than  (20.18)  for  large  m. 

In  the  noble  gases  we  chose  m  =  12,  for  reasons  of  calculational  convenience, 
noting  that  this  led  to  reasonable  agreement  with  the  data.  The  motivation  for 
m  =  12  is  lacking  in  the  alkali  halides,15  and  if  a  power  law  is  used  to  represent 
the  repulsion,  we  might  just  as  well  determine  the  exponent  by  fitting  the  data  as 
closely  as  possible.  It  is  not  advisable  to  fix  m  by  setting  (20.22)  equal  to  the  observed 
cohesive  energy,  for  (20.22)  is  so  slowly  varying  a  function  of  m  that  small  errors  in 
the  experimental  measurement  will  cause  large  alterations  in  in.  A  better  procedure 
is  to  find  an  independent  measurement  determining  m.  We  can  then  use  that  m  in 
(20.22)  to  see  whether  the  agreement  with  the  experimental  cohesive  energies  is 
thereby  improved  over  the  10  percent  agreement  in  Table  20.5. 

Such  an  independent  determination  of  m  is  provided  by  the  experimentally  mea¬ 
sured  bulk  moduli.  If  B0  and  r0  are,  respectively,  the  equilibrium  bulk  modulus  and 
nearest-neighbor  separation,  then  (see  Problem  2)  m  has  the  value 


m  =  1  + 


18  B0r03 
|ucoul(r0)| 


(20.23) 


The  values  of  m  obtained  from  the  measured  values  of  B0  and  r0  are  listed  in  Table  20.6. 
They  vary  from  about  6  to  10.  When  the  purely  electrostatic  contributions  to  the 
cohesive  energy  are  corrected  by  the  factor  (m  —  l)/m,  agreement  with  the  observed 
cohesive  energies  is  considerably  improved,  being  3  percent  or  better,  except  from 
the  troublesome16  lithium  halides  and  sodium  iodide. 

This  is  as  much  as  (if  not  more  than)  one  can  expect  from  so  crude  a  theory.  A 
better  analysis  would  make  several  improvements: 

1.  The  core-core  repulsion  is  probably  better  represented  in  an  exponential  form 
(the  so-called  Born-Mayer  potential  being  a  popular  choice)  than  as  a  power  law. 

2.  The  inverse  sixth-power  fluctuating  dipole  force  between  ion  cores  should  be 
taken  into  account. 

3.  The  zero-point  vibrations  of  the  lattice  should  be  allowed  for. 

However,  these  improvements  will  not  alter  our  main  conclusion,  that  the  major 
part  (90  percent)  of  the  cohesive  energy  in  the  ionic  crystals  is  due  simply  to  the 
electrostatic  Coulomb  interactions  among  the  ions,  considered  as  fixed-point  charges. 


15  One  might  expect  m  to  be  rather  less  than  12  simply  because  the  halogen  ions  owing  to  their  excess 
negative  charge  should  have  a  rather  lower  density  of  elecirons  at  their  surface  than  the  corresponding 
noble  gas  atoms. 

16  See  page  383. 
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COHESION  IN  COVALENT  CRYSTALS  AND  METALS 

The  crude  theories  we  have  made  of  the  cohesive  energies  of  molecular  and  ionic 
crystals  are  as  accurate  as  they  are,  primarily  because  in  these  solids  the  valence 
electron  configuration  is  not  appreciably  distorted  from  what  it  is  in  the  isolated 
atoms  (molecular  crystals)  or  ions  (ionic  crystals).  This  ceases  to  be  so  in  covalent 
crystals  and  metals,  which  are  characterized  by  distributions  of  valence  electrons 
that  differ  substantially  from  anything  found  either  in  isolated  atoms  or  in  isolated 
ions  of  the  underlying  material.  Consequently,  to  calculate  the  cohesive  energy  of 
such  solids  one  cannot  merely  calculate  the  classical  potential  energy  of  a  set  of 
weakly  or  negligibly  deformed  atoms  or  ions,  arranged  in  the  appropriate  crystal 
structure.  Instead,  even  the  simplest  calculations  must  include  a  computation  of  the 
energy  levels  of  the  valence  electrons  in  the  presence  of  the  periodic  potential  of  the 
ion  cores. 

Thus  a  theory  of  the  cohesive  energy  of  covalent  crystals  and  metals  must  contain 
a  computation  of  their  band  structure. 17  For  this  reason  there  is  no  model  of  cohesion 
in  such  solids  of  remotely  comparable  simplicity  to  those  we  have  described  for 
molecular  and  ionic  crystals.  The  basis  for  calculations  of  comparable  accuracy  must 
be  provided  by  the  techniques  we  have  described  in  Chapters  10,  11,  and  17.  We 
shall  limit  ourselves  here  to  a  few  qualitative  remarks  about  covalent  crystals,  and 
some  crude  and  highly  inaccurate  estimates  for  metals,  based  on  a  free  electron 
picture. 

Cohesion  in  Covalent  Crystals 

The  theory  of  cohesion  in  good  covalent  insulators  is  quite  similar  to  the  theory  of 
chemical  bonding  in  molecules,18  a  subject  beyond  the  scope  of  this  book.19  The 
way  in  which  the  electrostatic  forces  conspire  to  hold  covalent  crystals  together 
is  considerably  more  subtle  than  the  simple  electrostatic  attraction  between  point 
ions,  which  works  so  well  in  describing  ionic  crystals,  or  even  the  fluctuating  dipole 
interaction  we  used  in  describing  the  solid  noble  gases.  Consider,  for  concreteness, 
the  case  of  diamond  (carbon).  Suppose  that  a  group  of  carbon  atoms  are  placed  onto 
the  sites  of  a  diamond  lattice,  but  with  so  large  a  lattice  constant  that  the  energy 
of  the  collection  is  just  the  sum  of  the  energies  of  the  isolated  atoms  (i.e.,  the  cohesive 
energy  is  zero).  Cohesion  will  result  if  the  energy  of  the  collection  can  be  lowered 
by  reducing  the  lattice  constant  to  its  observed  value.  As  the  lattice  constant  is  reduced, 
there  will  eventually  be  some  overlap  in  the  atomic  wave  functions  centered  on 
different  sites  (cf.  the  discussion  in  Chapter  10).  If  the  outermost  atomic  shells  were 
filled  (as  they  are  in  the  noble  gas  atoms  or  in  the  ions  forming  an  ionic  crystal)  this 
overlap  would  result  in  the  short-range  core-core  repulsion,  and  would  raise  the 


17  Indeed,  cohesive-energy  calculations  provided  the  earliest  motivations  for  accurate  band  structure 
calculations.  It  was  only  later  that  it  became  generally  recognized  that  the  band  structure  itself  was  of 
fundamental  interest,  independent  of  the  problem  of  cohesion. 

18  The  basic  text  is  L.  Pauling,  The  Nature  of  the  Chemical  Bond,  3rd  ed.,  Cornell  University  Press, 
Ithaca,  New  York,  1960. 

19  However,  an  elementary  discussion  of  the  hydrogen  molecule  is  given  in  Chapter  32. 
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energy  over  that  of  isolated  atoms.  However,  the  core-core  repulsion  of  filled  atomic 
shells  is  a  consequence  of  the  Pauli  exclusion  principle,  together  with  the  fact  that 
the  only  available  electronic  levels,  if  the  outer  shells  are  filled,  lie  much  higher  in 
energy.  If  the  outermost  electronic  shells  are  only  partially  filled  (as  in  carbon),  then 
the  electrons  in  the  outer  shells  can  rearrange  themselves  with  considerably  more 
flexibility  when  the  wave  functions  of  neighboring  atoms  start  to  overlap,  since  other 
levels  with  comparable  energy  in  the  same  shell  are  available. 

It  turns  out  that  under  these  circumstances  the  overlap  of  the  outermost  shells 
generally  leads  to  a  lowering  of  the  total  electronic  energy,  with  the  electrons  forming 
levels  that  are  not  localized  about  a  single  ion  core.  There  is  no  simple  reason  why 
this  is  so.  The  less  localized  the  electronic  wave  function,  the  smaller  need  be  the 
maximum  electronic  momentum  required  by  the  uncertainty  principle,  and  hence 
the  lower  the  electronic  kinetic  energy.  To  this  must  be  added  estimates  of  the  change 
in  potential  energy  in  the  less  localized  levels.  The  net  result  is  generally  a  lowering 
of  the  energy.20 

Cohesion  in  Free  Electron  Metals 

Turning  to  the  other  extreme,  we  can  compare  a  solid  not  with  a  set  of  atoms  but 
with  a  free  electron  gas.  In  Chapter  2  we  noted  that  the  pressure  of  a  free  electron 
gas  at  the  density  of  the  alkali  metals  gave  their  observed  compressibilities  to  within 
a  factor  of  two  or  less.  To  go  from  this  to  a  crude  theory  of  cohesion  in  the  alkali 
metals,  we  must  add  to  the  electron  gas  kinetic  energy  the  total  electrostatic  potential 
energy.  This  contains,  among  other  things,  the  energy  of  attraction  between  the 
positively  charged  ions  and  the  negatively  charged  electron  gas,  without  which  the 
metal  would  not  be  bound  at  all. 

We  treat  the  ions  in  an  alkali  metal  as  point  charges  localized  at  the  sites  of  the 
body-centered  cubic  Bravais  lattice.  We  treat  the  electrons  as  a  uniform  compensating 
background  of  negative  charge.  The  total  electrostatic  energy  per  atom  of  such  a 
configuration  can  be  computed  by  techniques  similar  to  those  used  in  the  elementary 
theory  of  ionic  crystals.  The  result  for  a  bcc  lattice  is21 

24  35 

11  =  -n—x  eV/atom,  (20.24) 

(a/«o) 

where  rs  is  the  radius  of  the  Wigner-Seitz  sphere  (the  volume  per  electron  is  47ir//3) 
and  aD  is  the  Bohr  radius.  As  expected,  this  term  favors  high  densities  (i.e.,  low  rs). 

The  attractive  electrostatic  energy  (20.24)  must  be  balanced  against  the  electronic 
kinetic  energy  per  atom.  Since  there  is  one  free  electron  per  atom  in  the  alkali  metals, 
we  have  (see  Chapter  2,  page  37): 

,2a25) 


20  The  discussion  of  the  hydrogen  molecule  in  Chapter  32  provides  an  illustration  of  this  in  an 
especially  simple  case. 

21  See.  for  example,  C.  A.  Sholl,  Proc.  Phys.  Soc.  92, 434  (1967). 
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If  we  wished  to  be  more  accurate,  we  should  have  to  replace  (20.25)  by  the  complete 
ground-state  energy  per  electron  of  a  uniform  electron  gas22  at  the  density  3/4m-s3. 
The  computation  of  this  is  quite  difficult  (see  Chapter  17)  and,  considering  the  crude¬ 
ness  of  the  electron  gas  model,  of  questionable  utility  for  an  estimate  of  real  cohesive 
energies.  Here  we  shall  include  only  the  exchange  correction  to  (20.25)  (see  Eq.  ( 1 7.25) ) : 

ii  “=  —7—7—;  Ry/atom  =  —  eV/atom.  (20.26) 

(>'s/ao)  (rJa  0) 

Note  that  the  exchange  correction  to  the  electron  gas  energy  has  the  same  density 
dependence  as  the  average  electrostatic  energy  (20.24)  and  is  about  half  its  size.  This 
indicates  the  importance  of  electron-electron  interactions  in  metallic  cohesion  and 
the  consequent  difficulties  that  any  adequate  theory  of  cohesion  must  cope  with. 

Adding  these  three  contributions,  we  find  that 


30.1 

(rja0)2 


36.8 

(rja0) 


eV/atom. 


Minimizing  this  with  respect  to  rs  gives: 


(20.27) 


—  =  1.6.  (20.28) 

«o 

The  observed  values  of  rja0  range  from  2  to  6  in  the  alkali  metals.23  The  failure  of 
(20.28)  even  to  come  close  is  in  (perhaps  healthy)  contrast  to  our  earlier  successes, 
and  indicates  the  difficulty  in  coming  to  terms  with  metallic  cohesion  with  any  simple 
picture.  A  particularly  striking  qualitative  failure  of  (20.28)  is  its  prediction  of  the  same 
rs  for  all  alkali  metals.  This  result  would  not  be  affected  by  a  more  accurate  deter¬ 
mination  of  the  total  electron  gas  energy,  for  that  would  still  have  the  form  E(rs), 
and,  minimizing  E(rs)  —  24.35  (a0/rs),  would  still  lead  to  a  unique  equilibrium  value 
of  r„  independent  of  the  alkali  metal. 

Evidently  some  other  scale  of  length  must  be  introduced  to  distinguish  among  the 
alkali  metals,  and  it  is  not  hard  to  see  what  that  must  be.  Our  treatment  has  pictured 
the  ions  as  points,  even  though  the  real  ion  cores  have  nonnegligible  radii.  The 
approximation  of  point  ions  is  not  as  absurd  in  metals  as  it  would  be  in  molecular 
or  ionic  crystals,  since  the  fraction  of  the  total  volume  occupied  by  the  ions  is  con¬ 
siderably  smaller  in  metals.  However,  in  making  that  approximation  we  have  ignored 
at  least  two  important  effects.  If  the  ion  core  has  a  nonzero  radius,  then  the  conduction 
electron  gas  is  largely  prevented  from  entering  that  fraction  of  the  metallic  volume 
occupied  by  the  ion  cores.  Even  in  a  very  crude  theory,  this  means  that  the  density 
of  the  electron  gas  is  greater  than  we  have  estimated,  and  therefore  its  kinetic  energy 
is  also  greater.  Furthermore,  because  the  conduction  electrons  are  excluded  from  the 
ion  core  regions,  they  cannot  get  as  close  to  the  positively  charged  ions  as  the  picture 


22  Excluding  the  average  electrostatic  energy  of  the  electrons  and  ions,  which  is  already  accounted 
for  in  (20.24).  This  average  electrostatic  energy  is  just  the  Hartree  energy  (Chapter  17),  which  vanishes 
when  the  ions  are  treated  as  a  uniform  positive  background  of  compensating  charge,  rather  than  as  the 
localized  point  charges  on  which  the  computation  of  (20.24)  is  based. 

23  Table  1.1. 
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underlying  (20.24)  assumes.  We  should  therefore  expect  the  electrostatic  energy  to  be 
less  negative  than  we  have  estimated. 

Both  these  effects  should  cause  the  equilibrium  value  of  rja0  to  increase  with 
increasing  ion  core  radius  (Problem  4).  This  is  consistent  with  the  observed  alkali 
metal  densities.  Evidently,  any  even  moderately  accurate  computation  of  this  crucial 
effect  must  be  quite  subtle,  requiring  good  estimates  of  both  the  conduction  electron 
wave  functions  and  the  crystal  potential  appearing  in  the  one-electron  Schrodinger 
equation. 


PROBLEMS 


1.  A  measure  of  the  importance  of  quantum  effects  in  the  noble  gases  is  the  de  Boer  parameter. 
We  calculated  the  energy  per  atom  u(r)  of  a  noble  gas  (Eq.  (20.5))  under  the  assumption  that  it 
was  entirely  potential  energy.  In  a  quantum  theory,  however,  there  will  be  zero-point  vibrations 
even  at  T  =  0,  leading  to  a  correction  to  Eq.  (20.5)  proportional  to  h. 

(a)  Show,  on  purely  dimensional  grounds,  that  if  the  correction  is  strictly  linear  in  h,  then 
the  correction  to  the  energy  must  have  the  form 

A  u  —  £A/(r/<r),  (20.29) 

where  /  depends  on  the  particular  noble  gas  in  question  only  through  the  ratio  r/a ,  and 


h 

0\/Me 


(20.30) 


The  quantity  A,  known  as  the  de  Boer  parameter,  is  listed  in  Table  20.7.  Since  h/a  is  the  un¬ 
certainty  in  the  momentum  of  a  particle  localized  to  within  a  distance  a.  A2  is  roughly  the  ratio 
of  the  kinetic  energy  of  zero-point  motion  of  an  atom  to  the  magnitude  of  the  attractive  inter¬ 
action.  The  size  of  A  is  thus  a  measure  of  the  importance  of  quantum  effects  (and  a  glance  at 
Table  20.7  immediately  indicates  why  our  purely  classical  discussion  cannot  hope  to  cope  with 
solid  helium). 


Table  20.7 

THE  DE  BOER  PARAMETER  FOR  THE  NOBLE  GASES,  INCLUDING 
THE  TWO  HELIUM  ISOTOPES 


3He 

4He 

Ne 

Ar 

Kr 

Xe 

3.1 

2.6 

0.59 

0.19 

0.10 

0.064 

(b)  Let  rc  be  the  equilibrium  interparticle  distance  computed  by  minimizing  the  classical 
energy  (20.5)  and  rc  +  Ar  be  the  value  obtained  by  minimizing  the  classical  energy  plus  the 
quantum  correction  (20.29).  Show,  under  the  assumption  Ar  «  rc,  that  the  ratio  of  the  values  of 
Ar/rc  for  any  two  noble  gases  is  equal  to  the  ratio  of  their  de  Boer  parameters. 

(c)  Show  that  the  result  of  (b)  also  holds  for  the  fractional  changes  in  internal  energy  and 
bulk  moduli,  due  to  quantum  corrections. 

These  conclusions  are  compared  with  the  data  for  neon  and  argon  in  Table  20.8.  (In  the  cases 
of  krypton  and  xenon  the  deviations  from  the  classical  values  are  too  small  to  be  reliably  extracted 
from  the  data;  in  the  case  of  the  helium  isotopes  the  de  Boer  parameter  is  too  large  for  this  analysis 
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to  be  reliable.)  Chapter  25  describes  how  the  effects  of  zero-point  vibrations  can  be  more  accurately 
taken  into  account. 


Table  20.8 

COMPARATIVE  SIZE  OF  QUANTUM  CORRECTIONS  TO 
THE  EQUILIBRIUM  PROPERTIES  OF  NEON  AND  ARGON 


X 

XNe 

*Ar 

Xtjxk, 

A 

0.59 

0.19 

3.1 

A r/r1 

0.047 

0.011 

4.3 

A  ujif 

0.26 

0.10 

2.6 

AB/tf 

0.39 

0.15 

2.6 

2.  Show  that  the  bulk  modulus  for  an  ionic  crystal  with  the  NaCl  structure  is  given  by 


Bn 


1  d2u\ 


(20.31) 


18r0  dr1 1 

where  r0  is  the  equilibrium  nearest-neighbor  separation.  Show  that  the  form  (20.19)  for  the  total 
energy  per  ion  pair  gives 

(m  —  1)  ae 2 


Bo  — 


18 


and  hence  that 


m  =  1  + 


18B0r03 


(20.32) 


(20.33) 


ko„iM’ 

.  where  u„d(r)  is  the  energy  per  ion  pair  ofa  crystal  of  point  charges  at  nearest-neighbor  separation  r. 


3.  We  can  use  the  form  (20.19)  of  the  cohesive  energy  per  ion  pair  to  investigate  the  stability  of 
the  possible  crystal  structure  an  ionic  crystal  may  assume.  Assuming  that  the  coupling  constant 
C  characterizing  the  contribution  of  the  short-range  repulsion  is  proportional  to  the  coordina¬ 
tion  number  Z,  show  that  the  equilibrium  cohesive  energy  for  different  lattice  types  varies  as 
(om/Z)1,("  -  *>,  and  use  the  values  of  a  in  Table  20.4  to  construct  a  table  of  relative  stability  according 
to  the  value  of  m.  (Hint:  First  examine  the  cases  of  large  or  small  m.) 


4.  (a)  As  a  very  crude  model  of  an  alkali  metal,  suppose  that  the  charge  of  each  valence  electron 
is  uniformly  distributed  through  a  sphere  of  radius  rs  about  each  ion.  Show  that  the  electrostatic 
energy  per  electron  is  then 


9a  24.49 

j(coul  = - 8  Ry /electron  =  —  - — - — -  eV/electron.  (20.34) 

5  rs  ( r,/a0 ) 

(This  is  remarkably  close  to  the  result  (20.24)  for  a  bcc  lattice  of  ions  immersed  in  a  completely 
uniform  distribution  of  compensating  negative  charge.) 

(b)  In  a  real  metal  the  valence  electrons  are  largely  excluded  from  the  ion  core.  If  we  take  this 
into  account  by  uniformly  distributing  the  charge  of  each  electron  in  the  region  between  spheres  of 
radius  rc  and  rs  about  each  ion,  and  then  replacing  the  potential  of  each  ion  by  the  pseudopotential, 

e2 

kW  =  -  —>  r  >  rc 

=  0,  r  <  rc, 


(20.35) 
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show  that  (20.34)  must  be  replaced  to  leading  order  in  rjrs  by 

9a0  3  (rc/a0)2  ,,n 

— —  +  - — : — rv  Ry /electron,  (20.36) 

(rs/a0y 

(c)  Taking  the  energy  per  particle  to  be  the  sum  of  the  kinetic  (20.25),  exchange  (20.26), 
and  potential  (20.36)  energies,  show  that  the  equilibrium  value  of  rs/a0  is  given  by 

rs/a0  =  0.82  +  1.82(re/fl0)[l  +  O(a0/rc)2],  (20.37) 

and  compare  this  with  the  values  given  in  Table  1.1  and  Table  19.4. 


21 

Failures  of  the 
Static  Lattice 
Model 


416  Chapter  21  Failures  of  the  Static  Lattice  Model 


In  Chapter  3  we  reviewed  the  limitations  of  the  free  electron  theory  of  metals,  citing 
a  variety  of  phenomena  that  could  only  be  explained  by  the  presence  of  the  periodic 
potential  arising  from  the  lattice  of  ions.1  In  subsequent  chapters  the  periodic  array 
of  ions  has  played  a  critical  role  in  our  treatment  of  metals  and  insulators.  In  all  such 
discussions  we  have  taken  the  ions  to  constitute  a  fixed,  rigid,  immobile  periodic 
array.2 3 4  This,  however,  is  only  an  approximation  to  the  actual  ionic  configuration, 
for  the  ions  are  not  infinitely  massive,  nor  are  they  held  in  place  by  infinitely  strong 
forces.  Consequently,  in  a  classical  theory  the  static  lattice  model  can  be  valid  only 
at  zero  temperature.  At  nonzero  temperatures  each  ion  must  have  some  thermal 
energy,  and  therefore  a  certain  amount  of  motion  in  the  vicinity  of  its  equilibrium 
position.  Furthermore,  in  a  quantum  theory  even  at  zero  temperature  the  static 
lattice  model  is  incorrect,  because  the  uncertainty  principle  (Ax  A p  £  h)  requires 
localized  ions  to  possess  some  nonvanishing  mean  square  momentum. 

The  oversimplified  model  of  immobile  ions  has  been  impressively  successful  in 
accounting  for  a  wealth  of  detailed  metallic  equilibrium  and  transport  properties 
dominated  by  the  behavior  of  the  conduction  electrons,  provided  that  one  does  not 
inquire  into  the  source  of  the  electronic  collisions.  We  have  also  had  some  success 
with  the  static  lattice  model  in  accounting  for  the  equilibrium  properties  of  ionic  and 
molecular  insulators. 

However,  it  is  necessary  to  go  beyond  the  static  lattice  model  to  fill  in  the  many 
gaps  in  our  understanding  of  metals  (some  of  which  for  example,  the  theory  of  the 
temperature  dependence  of  the  DC  conductivity— are  substantial)  and  to  achieve 
anything  beyond  the  most  rudimentary  theory  of  insulators.  The  limitations  of  a 
static  lattice  theory  are  particularly  severe  in  the  theory  of  insulators,  since  the 
electronic  system  in  an  insulator  is  comparatively  passive,  all  electrons  residing  in 
filled  bands.  Except  in  phenomena  that  supply  enough  energy  to  excite  electrons 
across  the  energy  gap  Eg  between  the  top  of  the  highest  filled  band  and  the  lowest 
empty  levels,  insulators  are  electronically  quiescent.  If  we  adhered  to  the  static  lattice 
approximation  in  insulators,  we  would  have  no  degrees  of  freedom  left  to  account 
for  their  rich  and  varied  properties. 

In  this  chapter  we  shall  summarize  some  of  the  ways  in  which  the  static  lattice 
model  fails  to  cope  with  experimental  facts.  In  subsequent  chapters  we  shall  turn  to 
the  dynamical  theory  of  lattice  vibrations,  which  in  one  form  or  another  will  be  the 
primary  subject  of  Chapters  22  to  27. 

We  have  grouped  the  major  shortcomings  of  the  static  lattice  model  into  three 
broad  categories: 


1  Recall  that,  when  used  in  this  general  way,  the  word  “ion”  means  ions  in  ionic  crystals,  ion  cores 
in  metals  and  covalent  crystals,  and  atoms  in  a  rare  gas  solid. 

2  Except  in  Chapter  20,  where  we  considered  a  uniform  expansion  of  the  array,  and  briefly  considered 

atomic  zero-point  motion  in  the  solid  noble  gases.  _ 

3  We  do  not  have  in  mind  the  fact  that  any  real  crystal  has  imperfections— i.e.,  static  deviations  from 
perfect  periodicity  (see  Chapter  30).  These  can  still  be  described  in  terms  of  a  static  lattice.  Our  concern 
here  is  with  dynamic  deviations  from  periodicity  associated  with  the  vibrations  of  the  ions  about  their 
equilibrium  positions.  These  always  occur,  even  in  an  otherwise  perfect  crystal. 

4  We  have  discussed  this  consequence  of  the  uncertainty  principle  on  page  396,  and  have  seen  some 
simple  evidence  for  it  in  our  estimates  of  the  cohesive  energies  of  the  rare  gas  solids  (Problem  1 ,  Chapter  20). 
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1.  Failures  to  explain  equilibrium  properties. 

2.  Failures  to  explain  transport  properties. 

3.  Failures  to  explain  the  interaction  of  various  types  of  radiation  with  the  solid. 

EQUILIBRIUM  PROPERTIES 

All  equilibrium  properties  are  affected  in  varying  degrees  by  the  lattice  vibrations. 
We  list  below  some  of  the  most  important. 

Specific  Heat 

The  static  lattice  model  attributes  the  specific  heat  of  a  metal  to  the  electronic  degrees 
of  freedom.  It  predicts  a  linear  temperature  dependence  at  temperatures  well  below 
the  Fermi  temperature — i.e.,  all  the  way  to  the  melting  point.  Such  linear  behavior 
is  indeed  observed  (Chapter  2),  but  only  up  to  temperatures  of  order  10K.  At  higher 
temperatures  the  specific  heat  increases  much  more  rapidly  (as  X3)  and  at  still  higher 
temperatures  (typically  between  102  and  103K)  approaches  a  roughly  constant  value. 
This  additional  (and,  above  10K,  dominant)  contribution  to  the  specific  heat  is 
entirely  due  to  the  hitherto  neglected  degrees  of  freedom  of  the  lattice  of  ions. 

Insulators  provide  further  evidence  that  the  ions  contribute  to  the  specific  heat. 
If  static  lattice  theory  were  literally  correct,  the  thermal  energy  of  an  insulator  would 
differ  from  that  at  X  =  0  only  to  the  extent  that  electrons  were  thermally  excited 
across  the  energy  gap  Eg.  The  number  of  electrons  so  excited  can  be  shown  (Chapter 
28)  to  have  a  temperature  dependence  dominated  by  e_£«/2*BT,  at  temperatures 
below  EJkB  (i.e.,  at  all  temperatures  of  interest,  if  Eg  is  as  big  as  an  electron  volt). 
This  exponential  also  dominates  the  behavior  of  cv  =  du/dT.  However,  the  observed 
low-temperature  specific  heat  of  insulators  is  not  exponential,  but  varies,  as  X3. 
Both  in  insulators  and  in  metals  this  X3  contribution  to  cv  can  be  explained  by 
introducing  the  motion  of  the  lattice  into  the  theory  in  a  quantum-mechanical  way. 

Equilibrium  Density  and  Cohesive  Energies 

We  have  mentioned  in  Chapter  20  that  zero-point  vibrations  must  be  included  in  a 
calculation  of  the  ground-state  energy  of  a  solid,  and  hence  in  a  calculation  of  its 
equilibrium  density  and  cohesive  energy.  The  contribution  of  the  zero-point  vibra¬ 
tions  of  the  ions  is  considerably  smaller  than  the  potential  energy  terms  in  most 
crystals,  but,  as  we  have  seen,  leads  to  easily  observable  effects  in  neon  and  argon.5 

Thermal  Expansion 

The  equilibrium  density  of  a  solid  depends  on  the  temperature.  In  the  static  lattice 
model  the  only  effect  of  temperature  is  the  excitation  of  electrons.  In  insulators  such 
excitations  are  of  negligible  importance  at  temperatures  below  Eg/kB.  The  thermal 


5  In  solid  helium  the  zero-point  vibrations  are  so  substantial  that  they  cannot  be  ignored  even  in  a 
first  approximation.  For  this  reason,  the  solid  forms  of  the  two  helium  isotopes  (of  mass  number  3  and  4) 
are  often  referred  to  as  quantum  solids. 
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expansion  of  insulators  (and,  as  it  turns  out,  of  metals,  too)  is  critically  related  to 
the  ionic  degrees  of  freedom.  In  a  sense  this  is  merely  the  T  #  0  version  of  the  point 
made  in  the  preceding  paragraph,  but  the  lattice  vibrations  generally  provide  only  a 
small  correction  to  the  T  =  0  equilibrium  size,  whereas  they  play  a  central  role  in 
determining  the  thermal  expansion. 

Melting 

At  sufficiently  high  temperatures  solids  melt;  i.e.,  the  ions  leave  their  equilibrium 
positions  and  wander  great  distances  through  the  resulting  liquid.  Here  the  static 
lattice  hypothesis  fails  with  a  vengeance;  however,  even  below  the  melting  point, 
when  the  ions  remain  in  the  vicinity  of  their  equilibrium  positions,  it  is  clear  that  any 
adequate  theory  of  the  melting  process  (and  only  very  crude  ones  exist)  must  take 
into  account  the  increasing  amplitude  of  the  lattice  vibrations  with  increasing  tem¬ 
perature. 


TRANSPORT  PROPERTIES 

In  Chapters  1, 2,  12,  and  13  we  examined  those  transport  properties  of  a  metal  that 
depend  almost  entirely  on  its  electronic  structure.  However,  many  aspects  of  transport 
in  metals,  and  all  aspects  of  transport  in  insulators,  can  only  be  understood  when  the 
vibrations  of  the  lattice  are  taken  into  account. 

Temperature  Dependence  of  the  Electronic  Relaxation  Time 

In  a  perfect  periodic  potential  an  electron  would  suffer  no  collisions  and  the  electrical 
and  thermal  conductivities  of  such  a  metal  would  be  infinite.  We  have  on  occasion 
referred  to  the  fact  that  one  of  the  major  sources  of  scattering  in  a  metal  is  the  deviation 
of  the  lattice  from  perfect  periodicity  due  to  the  thermal  vibrations  of  the  ions  about 
their  equilibrium  sites.  This  is  responsible  for  the  characteristic  T 5  term  in  the  elec¬ 
trical  resistivity  at  low  temperatures,  as  well  as  its  linear  growth  with  T  at  high 
temperatures  (Chapter  26).  The  static  lattice  model  cannot  explain  these  facts. 

Failure  of  the  Wiedemann-Franz  Law 

The  failure  of  the  Wiedemann-Franz  law  at  intermediate  temperatures  (page  59) 
finds  a  simple  explanation  in  the  theory  of  how  electrons  are  scattered  by  the  lattice 
vibrations. 

Superconductivity 

Below  a  certain  temperature  (20  K  or  substantially  less)  the  resistivity  of  certain 
metals  (known  as  superconductors)  drops  abruptly  to  zero.  A  full  explanation  for 
this  was  not  given  until  1957.  It  is  now  understood,  and  one  of  the  crucial  parts  of 
the  explanation  is  the  influence  of  lattice  vibrations  on  the  effective  interaction 
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between  two  electrons  in  a  metal  (Chapter  34).  If  the  lattice  were  rigorously  static, 
there  would  be  no  superconductors.6 

Thermal  Conductivity  of  Insulators 

Most  metallic  transport  properties  have  no  analogues  in  insulators.  However,  elec¬ 
trical  insulators  do  conduct  heat.  To  be  sure,  they  do  not  conduct  it  as  well  as  metals: 
The  far  end  of  a  silver  spoon  dipped  in  coffee  becomes  hot  much  faster  than  the  handle 
of  the  ceramic  cup.  However,  from  the  point  of  view  of  a  static  lattice  model  there  is 
no  mechanism  for  insulators  to  conduct  appreciable  amounts  of  heat  at  all ;  there 
are  simply  too  few  electrons  in  partially  filled  bands  to  do  the  job.  The  thermal  con¬ 
ductivity  of  insulators  is  due  predominantly  to  the  lattice  degrees  of  freedom. 

Transmission  of  Sound 

Insulators  transmit  not  only  heat;  they  transmit  sound  as  well,  in  the  form  of  vibra¬ 
tional  waves  in  the  lattice  of  ions.  In  the  static  lattice  model  electrical  insulators 
would  also  be  acoustic  insulators. 


INTERACTION  WITH  RADIATION 

We  have  discussed  the  interaction  of  radiation  with  solids  in  Chapter  6  (X  radiation) 
and  parts  of  Chapters  1  and  1 5  (optical  properties  of  metals).  There  is  a  wealth  of 
additional  data  on  the  response  of  solids  to  radiation  that  cannot  be  explained  in 
terms  of  the  response  of  electrons  in  a  fixed  array  of  ions.  Some  important  examples 
are  the  following: 

Reflectivity  of  Ionic  Crystals 

Ionic  crystals  exhibit  a  sharp  maximum  in  their  reflectivity  at  frequencies  in  the 
infrared,  corresponding  to  values  of  ha>  far  below  their  electronic  energy  gap.  The 
phenomenon  therefore  cannot  be  due  to  an  electronic  excitation.  It  arises  from  the 
fact  that  the  electric  field  in  the  radiation  exerts  oppositely  directed  forces  on  the 
positive  and  negative  ions,  thereby  displacing  them  with  respect  to  one  another.  The 
proper  explanation  of  this  phenomenon  requires  a  theory  of  lattice  vibrations. 

Inelastic  Scattering  of  Light 

When  laser  light  is  scattered  from  crystals  some  components  of  the  reflected  beam 
have  small  shifts  in  frequency  (Brillouin  and  Raman  scattering).  The  explanation  of 
this  phenomenon  requires  a  quantum  theory  of  lattice  vibrations. 


6  More  precisely,  there  might  still  be  superconductors,  but  they  would  be  quite  different  from  the 
ones  we  know  today.  Alternative  mechanisms  for  superconductivity  have  been  proposed  that  do  not  rely 
on  the  effect  of  lattice  vibrations  on  the  electron-electron  interaction,  but  examples  of  superconductivity 
based  on  such  mechanisms  have  yet  to  be  found. 
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Scattering  of  X  Rays 

The  intensity  of  X  radiation  in  the  Bragg  peaks  predicted  by  the  static  lattice  model 
is  incorrect.  The  thermal  vibrations  of  the  ions  about  their  equilibrium  positions  (and 
even  the  zero-point  vibrations  at  T  =  0)  diminish  the  amplitude  of  the  Bragg  peaks. 
In  addition,  because  the  lattice  is  not  static,  there  is  a  background  of  scattered  X 
radiation  in  directions  that  do  not  satisfy  the  Bragg  condition. 

Scattering  of  Neutrons 

When  neutrons7  are  scattered  off  crystalline  solids,  they  are  found  to  lose  energy  only 
in  definite  discrete  amounts,  which  depend  on  the  change  of  momentum  suffered  in 
the  scattering.  The  quantum  theory  of  lattice  vibrations  gives  a  very  simple  explana¬ 
tion  of  this  phenomenon,  and  neutrons  are  one  of  the  most  valuable  probes  of  a 
solid  we  possess. 

The  foregoing  is  in  no  sense  an  exhaustive  list  of  the  ways  in  which  lattice  vibrations 
make  themselves  felt.  However,  it  does  illustrate  most  of  the  important  functions  the 
lattice  vibrations  perform: 

1.  The  ability  of  the  ions  to  vibrate  about  their  equilibrium  positions  is  essential  in 
determining  any  equilibrium  property  of  a  solid  that  is  not  dominated  by  a  very 
much  larger  contribution  from  the  electrons. 

2.  The  lattice  vibrations  provide  a  mechanism  for  transporting  energy  through  a 
solid. 

3.  The  lattice  vibrations  are  a  crucial  source  of  electronic  scattering  in  metals,  and 
can  profoundly  affect  the  interaction  between  electrons. 

4.  The  lattice  vibrations  play  a  role  in  the  response  of  a  solid  to  any  probe  that 
couples  to  the  ions,  such  as  visible  light,  X  rays,  or  neutrons. 

We  shall  examine  these  and  other  aspects  of  the  lattice  vibrations  in  Chapters  22 
to  27. 


7  From  the  quantum-mechanical  point  of  view  a  beam  of  neutrons  of  energy  E  and  momentum  p 
can  be  regarded  as  a  beam  of  radiation  of  angular  frequency  co  —  E/h  and  wave  vector  k  =  p/ft. 
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In  relaxing  the  artificial  assumption  that  the  ions  sit  without  moving  at  the  sites  R 
of  a  Bravais  lattice,  we  shall  rely  extensively  on  two  weaker  assumptions: 

1.  We  shall  assume  that  the  mean  equilibrium  position  of  each  ion  is  a  Bravais 
lattice  site.  We  can  then  continue  to  associate  with  each  ion  a  particular  Bravais 
lattice  site  R,  about  which  the  ion  oscillates,  but  the  site  R  is  now  only  the 
mean  position  of  the  ion,  and  not  its  fixed  instantaneous  position. 

2.  We  shall  assume  that  the  typical  excursions  of  each  ion  from  its  equilibrium 
position  are  small  compared  with  the  interionic  spacing  (in  a  sense  to  be  made 
more  precise  below). 

Assumption  1  accounts  for  the  observed  crystalline  structure  of  solids  by  asserting 
that  the  Bravais  lattice  is  still  there  in  spite  of  ionic  motion,  but  that  it  describes  the 
average  ionic  configuration  rather  than  the  instantaneous  one.  Note  that  although 
this  assumption  permits  a  wide  range  of  possible  ionic  motion,  it  does  not  allow 
for  ionic  diffusion:  The  oscillations  of  each  ion  are  assumed  to  be  forever  about  a 
particular  Bravais  lattice  site  R.  Except  when  the  possibility  of  ions  interchanging 
equilibrium  positions  becomes  important  (as  it  might,  for  example,  near  the  melting 
point)  this  assumption  is  not  terribly  restrictive. 

Assumption  2,  however,  is  not  made  out  of  any  strong  conviction  in  its  general 
validity,  but  on  grounds  of  analytical  necessity.  It  leads  to  a  simple  theory — the 
harmonic  approximation — from  which  precise  quantitative  results  can  be  extracted. 
These  results  are  often  in  excellent  agreement  with  observed  solid  properties.  Some 
properties,  however,  cannot  be  accounted  for  by  a  harmonic  theory,  and  to  explain 
them  it  is  necessary  to  go  to  an  (inharmonic  theory  (Chapter  25).  Even  in  such  cases, 
however,  the  method  of  computation  implicitly  continues  to  rely  on  assumption  2, 
although  it  is  exploited  in  a  more  sophisticated  way.  When  assumption  2  genuinely 
fails  (as  appears  to  be  the  case  in  solid  helium),  one  must  deal  from  the  start  with 
a  theory  of  formidable  analytical  complexity,  and  it  is  only  quite  recently  that  any 
progress  has  been  made  along  these  lines.1 

Because  of  assumption  1  we  may  unambiguously  label  each  ion  with  the  Bravais 
lattice  site  R  about  which  it  oscillates.2  We  denote  the  position  of  the  ion  whose 
mean  position  is  R  by  r(R)  (see  Figure  22.1).  If  the  static  lattice  approximation  were 
valid,  i.e.,  if  each  ion  were  stationary  at  its  Bravais  lattice  site,  we  would  have  r(R)  =  R. 
Ill  the  more  realistic  case,  however,  r(R)  will  deviate  from  its  average  value  R,  and 
we  may  write  at  any  given  time:3 

r(R)  =  R  +  u(R),  (22.1) 


1  Under  the  romantic  name  of  “the  theory  of  quantum  solids”  The  nomenclature  refers  to  the  fact 
that  according  to  classical  theory,  assumption  2  holds  in  any  solid  at  low  enough  T.  It  is  only  the  uncertainty 
principle  that  requires  some  deviations  from  equilibrium  in  Ihe  ionic  positions  no  matter  how  low  the 
temperature. 

2  In  much  of  this  chapter  we  shall  deal  explicitly  only  with  monatomic  Bravais  lattices;  i.e„  solids 
whose  crystal  structures  consist  of  a  single  ion  per  primitive  cell  located  at  the  sites  R  —  n ,  a ,  +  n2a2  +  «3a3 
of  a  Bravais  lattice.  The  generalization  of  the  discussion  to  lattices  with  a  basis  of  n  atoms  per  primitive 
cell,  located  at  R  +  d,,  R  +  d2, . . . ,  R  +  d„,  is  straightforward,  but  the  notation  can  be  cumbersome. 

3  More  generally,  in  describing  a  lattice  with  a  basis,  we  would  let  r/R)  describe  the  position  of  the 
jth  basis  atom  in  the  primitive  cell  about  R,  and  write  r/R)  =  R  +  dj  +  u/R). 


Lattice  Potential  Energy  423 


\ 


\  •* 


••  •• 


Figure  22.1 

(a)  The  Bravais  lattice  of  points,  specified  by  vectors  R.  (b)  A  par¬ 
ticular  instantaneous  configuration  of  ions.  The  ion  whose  mean 
position  is  R  is  found  at  r(R). 

where  u(R)  is  the  deviation  from  equilibrium  of  the  ion  whose  equilibrium  site  is  R. 
(See  Figure  22.2.) 

To  make  the  discussion  concrete,  let  us  reexamine  our  treatment  of  the  cohesive 
energy  of  the  noble  gases  (Chapter  20)  within  this  broader  framework.  We  continue 
to  assume  that  a  pair  of  atoms  separated  by  r  contributes  an  amount  </>(r)  to  the 
potential  energy  of  the  crystal,  where  4>  is,  for  example,  the  Lennard-Jones  potential 
(20.2).  If  the  static  lattice  model  were  correct,  and  each  atom  remained  fixed  at  its 
Bravais  lattice  site,  the  total  potential  energy  of  the  crystal  would  just  be  the  sum 
of  the  contributions  of  all  distinct  pairs: 

u  =  i  y  4(R  -  R')  =  ^  I  <W-  (22-2> 

2  RR  -  R*0 

If  however,  we  allow  for  the  fact  that  the  atom  whose  average  position  is  R  will  in 
general  be  found  at  a  position  r(R)  *  R,  then  we  must  replace  (22.2)  by 

U  =  -  X  <Mr(R)  -  r(R'))  =  \  I  4>(R  -  *'  +  U<R)  -  <22'3) 

Thus  the  potential  energy  now  depends  on  the  dynamical  variables  u(R),  and  we 
must  come  to  grips  with  the  dynamical  (or  statistical  mechanical)  problem  governed 
by  the  Hamiltonian:4 


Figure  22.2 

The  relation  between  the  Bravais  lattice  vector  R,  the  in¬ 
stantaneous  position  r(R)  of  the  ion  that  oscillates  about  R, 
and  the  ionic  displacement,  u(R)  =  r(R)  —  R 


*  In  lattice  dynamics  one  generally  takes  the  canonical  coordinates  to  be  the  u(R)  rather  than  the 
r(R);  i.e.,  each  ion  is  referred  to  a  different  origin. 


424  Chapter  22  Classical  Theory  of  the  Harmonic  Crystal 


H  =  Y 

R 


P(R)2 

2  M 


+  u. 


(22.4) 


where  P(R)  is  the  momentum  of  the  atom  whose  equilibrium  position  is  R,  and  M 
is  the  atomic  mass. 


THE  HARMONIC  APPROXIMATION 

For  a  pair  potential  <‘p  of  the  Lennard-Jones  form  the  extraction  of  useful  exact 
information  from  this  Hamiltonian  is  a  task  of  hopeless  difficulty.  One  therefore 
resorts  to  an  approximation  based  on  the  expectation  that  the  atoms  will  not  deviate 
substantially  from  their  equilibrium  positions.  If  all  the  u(R)  are  small,5  then  we  can 
expand  the  potential  energy  U  about  its  equilibrium  value,  using  the  three-dimen¬ 
sional  form  of  Taylor’s  theorem: 

/( r  +  a)  =  /(r)  +  a  •  V  /( r)  +  i  (a  •  V)2/(r)  +  1  (a  •  V)3/( r)  +  ■  ■  ■ .  (22.5) 
Applying  this  to  each  term  of  (22.3),  with  r  =  R  —  R'  and  a  =  u(R)  —  u(R'),  we  find: 

V  =  J  Y  4><M  +  \y  MR)  -  u(R'))  •  V<«R  -  R') 

+  \  Y  [MR)  -  “(«'))•  v]2<«R  -  R')  +  0(u3).  (22.6) 

^  RR' 

The  coefficient  of  u(R)  in  the  linear  term  is  just 


Y  VflR  -  R').  (22.7) 

R' 

This,  however,  is  just  minus  the  force  exerted  on  the  atom  at  R  by  all  the  other  atoms, 
when  each  is  placed  at  its  equilibrium  position.  It  must  therefore  vanish,  since  there 
is  no  net  force  on  any  atom  in  equilibrium. 

Since  the  linear  term  in  (22.6)  vanishes,  the  first  nonvanishing  correction  to  the 
equilibrium  potential  energy  is  given  by  the  quadratic  term.  In  the  harmonic  approx¬ 
imation  only  this  term  is  retained,  and  the  potential  energy  is  written  as 

U  =  +  l/harm,  (22.8) 

where  Ucq  is  the  equilibrium  potential  energy  (22.2)  and 


u^  =  \  £  K(R)  -  i',i(R')]<MR  -  R')K(R) 

H  RR’ 

fj.v=x,yj 


<Mr)  = 


S2<t>(  r) 

0)-f,  dl\ ' 


•OR')]. 


(22.9) 


5  More  precisely,  if  u(R)  —  u(R')  is  small  for  all  atom  pairs  with  appreciable  <f>( R  -  R').  The  absolute 
displacement  of  an  atom  can  be  large.  What  is  important  is  that  its  displacement  be  small  with  respect 
to  those  atoms  with  which  it  has  any  appreciable  interaction. 
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Since  Ueq  is  just  a  constant  (i.e.,  independent  of  the  u’s  and  P’s),  it  can  be  ignored 
in  many  dynamical  problems,6  and  one  frequently  acts  as  if  the  total  potential  energy 
were  just  Uhalm ,  dropping  the  superscript  altogether  when  no  ambiguity  is  likely  to 
result. 

The  harmonic  approximation  is  the  starting  point  for  all  theories  of  lattice  dy¬ 
namics  (except,  perhaps,  in  solid  helium).  Further  corrections  to  U ,  especially  those 
of  third  and  fourth  order  in  the  ids,  are  known  as  anharmonic  terms,  and  are  of 
considerable  importance  in  understanding  many  physical  phenomena,  as  we  shall 
see  in  Chapter  25.  They  are  generally  treated  as  small  perturbations  on  the  dominant 
harmonic  term. 

The  harmonic  potential  energy  is  usually  written  not  in  the  form  (22.9),  but  in  the 
more  general  form 

l/harm  =  ~E  tv(R)IUR  -  R'K(R').  (22.10) 

*•  RR' 
fiv 

Evidently  (22.9)  has  this  structure,  with 

DJ. R  -  R')  =  5m-  X  <MR  -  R")  -  <MR  -  R')-  (22.11) 

R' 


THE  ADIABATIC  APPROXIMATION 

Aside  from  the  fact  that  it  is  more  compact,  we  use  the  form  (22.10)  rather  than 
(22.9)  because  in  general  the  ion-ion  interaction  cannot  be  represented  as  a  simple 
sum  of  pair  interactions  of  the  form  (22.3).  Indeed,  except  in  particularly  simple  cases 
(such  as  the  noble  gases)  the  quantities  D  appearing  in  (22.10)  are  quite  difficult  to 
calculate.  In  ionic  crystals  the  difficulty  comes  from  the  long-range  nature  of  the 
Coulomb  interaction  between  ions.  In  covalent  crystals  and  metals  the  difficulty  is 
a  more  profound  one,  for  there  the  ionic  motion  is  inextricably  coupled  to  the  motion 
of  the  valence  electrons.  This  is  because  in  covalent  crystals  and  metals  the  electronic 
arrangement,  and  hence  the  contribution  of  the  valence  electrons  to  the  total  energy 
of  the  solid,  depends  in  detail  on  the  particular  arrangement  of  the  ion  cores.  Thus 
when  the  solid  is  deformed  by  displacing  the  ion  cores  from  their  equilibrium  posi¬ 
tions,  the  electronic  wave  functions  will  be  deformed  as  well,  in  a  way  that  can  be 
quite  difficult  to  deduce  with  any  precision.7 

One  copes  with  this  problem  by  making  the  so-called  adiabatic  approximation. 
The  adiabatic  approximation  is  based  on  the  fact  that  typical  electronic  velocities 
are  much  greater  than  typical  ionic  velocities.  As  we  saw  in  Chapter  2,  the  significant 
electronic  velocity  is  rF  w  108  cm/sec.  On  the  other  hand,  typical  ionic  velocities 


6  Of  course,  it  cannot  always  be  ignored.  As  we  have  seen  in  Chapter  20,  it  is  of  crucial  importance 
in  determining  the  absolute  energy  of  the  crystal,  its  equilibrium  size,  or  its  equilibrium  compressibility. 

7  This  can  be  a  problem  even  in  ionic  crystals.  The  outermost  core  electrons  can  be  sufficiently  weakly 
bound  for  the  ions  to  undergo  significant  polarization  when  displaced  from  their  equilibrium  positions.  A 

theory  that  takes  this  into  account  is  known  as  the  shell  model  (not  to  be  confused  with  the  shell  model 
of  nuclear  physics).  See  Chapter  27. 
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are  at  most  of  order  10s  cm/sec.8  One  therefore  assumes  that  because  the  ions  move 
so  slowly  on  the  scale  of  velocities  of  relevance  to  the  electrons,  at  any  moment  the 
electrons  will  be  in  their  ground  state  for  that  particular  instantaneous  ionic  con¬ 
figuration.  In  computing  the  force  constants  appearing  in  (22.10),  one  must  then 
supplement  the  interaction  between  ion  cores  with  terms  representing  the  dependence 
of  the  additional  electronic  energy  on  the  instantaneous  ionic  configuration  specified 
by  the  u(R).  In  practice,  this  can  be  quite  difficult  to  do,  and  a  more  practical  approach 
is  to  consider  the  quantities  D  as  empirical  parameters,  to  be  measured  directly  by 
experiment  (Chapter  24).9 


SPECIFIC  HEAT  OF  A  CLASSICAL  CRYSTAL:  THE  LAW 
OF  DULONG  AND  PETIT 

Having  abandoned  the  static  lattice  approximation,  we  can  no  longer  evaluate  equi¬ 
librium  properties  (as  we  did  in  Chapter  20)  by  simply  assuming  that  each  ion  sits 
quietly  at  its  Bravais  lattice  site  R.  We  must  now  average  over  all  possible  ionic 
configurations,  giving  each  configuration  or  state  a  weight  proportional  to  e"£/V, 
where  E  is  the  energy  of  the  configuration.10  Thus  if  we  treat  the  crystal  classically, 
its  thermal  energy  density  is  given  by 


u 


J 


dre-“HH 


1' 


dr  e-“H 


(22.12) 


where  we  have  used  a  compact  notation  in  which  dr  stands  for  the  volume  element 
in  crystal  phase  space: 


dr  =  n  du( R)  dP(R)  =  n  du^R)  dp„( R). 

R  R  .fi 

We  can  also  write  (22.12)  in  the  more  useful  form: 

l'=“y|ln  \dTe~eH’ 

as  can  be  verified  by  explicitly  differentiating  the  logarithm  in  (22.14). 


(22.13) 


(22.14) 


8  This  will  emerge  in  the  subsequent  analysis.  We  shall  End  that  typical  ionic  vibration  frequencies 
are  at  most  of  the  order  of  0.01SF//j.  Since  the  amplitude  of  the  ionic  vibrations  is  small  compared  with 
the  size  of  the  unit  cell,  a  =  0(l/kr),  the  ionic  velocity  is  of  order  less  than  0.01  SF/fifcF  =  0.0  hy. 

°  There  is,  however,  a  highly  developed  theory  of  how  to  compute  D  for  metals.  See  Chapter  26. 

10  This  is  the  fundamental  rule  of  equilibrium  statistical  mechanics.  It  applies  whether  the  system 
under  consideration  is  treated  classically  or  quantum-mechanically,  provided  that  the  states  under  dis¬ 
cussion  are  states  of  the  full  N-particle  system  (i.e.,  not  single  particle  levels).  By  a  classical  state  we  mean 
a  specified  set  of  values  for  the  3N  canonical  coordinates  u(R)  and  3 N  canonical  momenta  P(R),  i.e.,  a 
point  in  phase  space.  By  a  quantum  state  we  mean  a  stationary-state  solution  to  the  N-particle  Schrodinger 
equation:  HV  =  E'f. 
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In  the  harmonic  approximation  the  temperature  dependence  of  the  integral  ap¬ 
pearing  in  (22.14)  is  easily  extracted  by  making  the  change  of  variables: 

u(R)  =  jT1/2u(R),  du(R)  =  /r3'2 

P(R)  =  jS_1/2P(R),  dP(R)  =  /T3/2  dP(R).  (22.15) 

The  integral  in  (22.14)  can  then  be  written  as 


I 


dre-<>H 


Jir«xP[-f(5;^  +  o'  + 

=  e~fuCqp~3N  |  J]j  du(R)  dP(R) 

XeXp[“X2LP(R)2“^ 


u„(R)D,lv(R  -  R'R(R') 


(22.16) 


The  entire  integral  appearing  in  braces  in  (22.16)  is  independent  of  temperature,  and 
therefore  makes  no  contribution  to  the  ^-derivative  when  (22.16)  is  substituted  into 
(22.14).  The  thermal  energy  therefore  reduces  simply  to: 


or 


it 


u=  -~ln(e-^r3N*  ™sl)  =  —  +  TkBT  (22.17) 

u  =  id*  +  3  nkBT.  (22.18) 


Note  that  this  reduces  to  the  result  u  =  ueq  of  static  lattice  theory  at  T  =  0  (as  we 
expect  from  classical  theory,  which  ignores  zero-point  motion).  At  nonzero  tempera¬ 
tures  the  static  result  is  corrected  by  the  simple  additive  term  3nkgT.  Since  kBT  is 
only  a  few  hundredths  of  an  electron  volt,  even  at  room  temperature,  this  is  generally 
a  small  correction.  It  is  far  more  useful  to  consider  the  specific  heat,  c„  =  (du/dT)v, 
(which  is  also  much  more  easily  measured  than  the  internal  energy).  The  static  lattice 
contribution  to  u  drops  out  of  c„  which  is  determined  entirely  by  the  temperature- 
dependent  correction:12 

cv  =  %  =  3  nkB.  (22.19) 


11  When  it  is  necessary  to  distinguish  the  number  of  ions  per  unit  volume  from  the  number  of  con- 
duction  electrons  per  unit  volume  we  shall  use  subscripts  (n,  or  ne).  In  simple  metals  ne  =  Zn„  where  Z 
is  the  valence. 

12  Experiments  measure  the  specific  heat  at  constant  pressure,  cpi  but  we  have  calculated  the  specific 
heat  at  constant  volume,  cv.  In  a  gas  these  differ  substantially,  but  in  a  solid  they  are  almost  identical. 
This  is  seen  ifiosl  intuitively  from  the  thermodynamic  identity:  cpfcv  =  {dP/dV)s/(dPldV)T.  The  two 
specific  heats  differ  to  the  extent  that  the  adiabatic  and  isothermal  compressibilities  differ.  Since  ucq  is 
the  dominant  term  in  the  internal  energy  of  a  solid,  thermal  considerations  are  of  little  consequence  in 
determining  the  compressibility.  Thus  the  work  necessary  to  compress  a  solid  by  a  definite  amount  depends 
very  little  on  whether  the  solid  is  thermally  insulated  (adiabatic)  or  in  contact  with  a  heat  bath  at  tem¬ 
perature  T  (isothermal)  during  the  compression.  Ordinarily  the  two  specific  heats  differ  by  less  than  a 
percent  at  room  temperature,  and  substantially  less  than  a  percent,  at  lower  temperatures. 
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This  result,  that  the  specific  heat  due  to  the  lattice  vibrations  (i.e.,  the  entire 
specific  heat  of  an  insulator)  is  just  3 kB  per  ion,  is  known  as  the  law  of  Dulong  and 
Petit.  In  a  monatomic  solid  in  which  there  are  6.022  x  1023  ions  per  mole,  it  is 
most  commonly  encountered  in  the  form:13 

c”olar  =  5.96  cal/mole-K.  (22.20) 

In  Figure  22.3  we  display  the  measured  specific  heats  of  solid  argon,  krypton,  and 
xenon.  At  temperatures  of  the  order  of  100  K  and  above,  the  measured  specific  heats 
are  quite  close  to  the  Dulong  and  Petit  value.  However: 

1.  As  the  temperature  drops,  the  specific  heat  falls  well  below  the  Dulong  and  Petit 
value,  heading  toward  zero  at  zero  temperature. 

2.  Even  as  the  temperature  gets  large,  it  seems  fairly  clear  that  the  curve  is  not 
approaching  the  precise  Dulong  and  Petit  value. 


e„(joules/mole-K) 


Figure  22.3 

Measured  specific  heats  of  argon,  xenon,  and  krypton.  The  hori¬ 
zontal  line  is  the  classical  Dulong  and  Petit  value.  (Quoted  in 
M.  L.  Klein,  G.  K.  Horton,  and  J.  L.  Feldman,  Pltys.  Rev.  184, 

68  (1969).) 

Point  2  can  be  explained  on  purely  classical  grounds  as  a  failure  of  the  harmonic 
approximation.  According  to  classical  theory,  at  very  low  temperatures  the  thermal 
energies  are  simply  insufficient  to  permit  an  ion  to  wander  any  appreciable  distance 
from  its  equilibrium  site,  and  the  harmonic  approximation  becomes  very  good  as  the 


13  kB  —  1.38  x  10  16  erg/K;  4.184  x  107  ergs  =  1  cal. 
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temperature  drops.14  However,  at  higher  temperatures  the  ions  possess  sufficient 
energy  to  stray  far  enough  from  their  equilibrium  positions  for  the  neglected  anhar- 
monic  terms  (i.e.,  terms  beyond  the  quadratic  ones  in  the  expansion  of  V  in  powers 
of  the  ionic  displacements  u)  to  become  important.  Thus  classical  statistical  mechanics 
indicates  that  the  law  of  Dulong  and  Petit  need  not  be  perfectly  obeyed  at  high 
temperatures,  but  should  become  more  and  more  closely  followed  as  the  temperature 

Thus  the  low-temperature  behavior  (point  1)  is  completely  inexplicable  classically. 
The  quantum  theory  is  required  to  account  for  the  low-temperature  specific  heat  of 
the  lattice  and,  except  at  rather  high  temperatures  (of  order  102  K,  judging  from 
Figure  22.3),  we  cannot  expect  to  get  very  far  in  a  theory  of  lattice  dynamics  that 
adheres  to  a  purely  classical  picture.15  We  must  therefore  turn  to  a  quantum  theory 
of  lattice  dynamics  to  explain  physical  phenomena  governed  by  the  lattice  vibrations. 

In  spite  of  this  blatant  failure  of  classical  mechanics,  however,  it  is  essential  to 
understand  the  classical  theory  of  lattice  vibrations  before  attempting  to  construct 
a  quantum  theory.  The  reason  for  this  lies  in  the  quadratic  structure  of  the  harmonic 
Hamiltonian.  Because  it  is  quadratic  in  the  displacements  u(R)  and  momenta  P(R)  it 
represents  a  special  case  of  the  general  classical  problem  of  small  oscillations,  which 
can  be  solved  exactly.16  In  the  solution  a  general  motion  of  the  N  ions  is  represented 
as  a  superposition  (or  linear  combination)  of  3 N  normal  modes  of  vibration,  each 
with  its  own  characteristic  frequency  v.  But  it  is  a  basic  result  of  the  quantum  theory 
that  the  allowed  energies  of  an  oscillator  with  frequency  v  are  given  by 

.  (n  +  |)hv,  n  =  0, 1,  2, -  (22.21) 

The  generalization  of  this  result  to  3 N  independent  oscillators  is  the  obvious  one. 
The  allowed  energies  of  the  3 N  oscillator  system  are  given  by  assigning  to  each 
oscillator  a  half  integral  multiple  of  its  frequency  times  h,  and  adding  up  the  con¬ 
tributions  from  each  oscillator.  In  the  case  of  the  harmonic  crystal  the  frequencies  of 
the  3 N  normal  modes  provide  such  a  set  of  frequencies,  out  of  which  all  the  energy 

levels  of  the  crystal  can  be  constructed.17 

An  analysis  of  the  classical  normal  modes  of  a  lattice  of  ions  is  therefore  of  great 
utility,  even  though  a  purely  classical  theory  oflattice  vibrations  is  clearly  inadequate. 
We  must  examine  the  classical  normal  modes  of  the  crystal  before  we  can  correct  the 


»  In  fact,  the  harmonic  approximation  becomes  asymptotically  exact  as  T  -»  0  in  a  classical  theory 
for  at  T  =  0  (infinite  B)  only  values  of  the  u  that  give  an  absolute  minimum  to  the  energy  (i.e.,  u(K)  -  0) 
contribute  to  the  exact  integral  (22.12).  At  sufficiently  small  T  only  u(R)  in  the  immediate  vicinity  of  0 
will  give  any  appreciable  contribution.  Thus  at  sufficiently  small  7' the  exact  Hamiltonian  is  equal  to  its 
harmonic  approximation  at  all  values  of  u  that  contribute  appreciably  to  the  integral.  On  the  other  hand, 
it  is  also  true  that  at  very  low  temperatures  only  very  small  values  of  the  u(R)  give  any  appreciable  con¬ 
tribution  to  the  integral  (22.16)  in  which  the  Hamiltonian  has  been  replaced  by  its  harmonic  approximation. 
Thus  in  both  the  exact  integral  (22.12)  and  its  harmonic  approximation  (22.16)  the  integrands  are  appre¬ 
ciable  at  low  temperatures  only  where  they  agree. 

is  A  Similar  problem  arose  in  connection  with  the  electronic  contribution  to  the  specific  heat  of  a 
metal,  where  the  classical  result  (3/2 )kB  per  electron  fails  at  temperatures  below  the  Fermi  temperature. 

14  See  any  text  on  classical  mechanics. 

17  We  shall  state  this  more  precisely  at  the  beginning  of  Chapter  23.  A  summary  of  the  detailed 
quantum-mechanical  proof  is  given  in  Appendix  L. 
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law  of  Dulong  and  Petit  and  go  on  to  describe  a  variety  of  other  properties  of  the 
dynamical  lattice.  The  rest  of  this  chapter  is  therefore  devoted  to  a  study  of  the 
classical  harmonic  crystal.  We  approach  the  problem  in  the  following  stages: 

1.  Normal  modes  of  a  one-dimensional  monatomic  Bravais  lattice. 

2.  Normal  modes  of  a  one-dimensional  lattice  with  a  basis. 

3.  Normal  modes  of  a  three-dimensional  monatomic  Bravais  lattice. 

4.  Normal  modes  of  a  three-dimensional  lattice  with  a  basis. 

In  principle  the  analysis  is  the  same  in  all  four  cases,  but  the  purely  notational  com¬ 
plexities  of  the  most  general  case  (4)  tend  to  obscure  important  physical  aspects, 
which  are  revealed  more  clearly  in  the  simpler  cases. 

We  conclude  the  chapter  by  relating  this  analysis  to  the  classical  theory  of  an 
elastic  continuous  medium. 

NORMAL  MODES  OF  A  ONE-DIMENSIONAL 
MONATOMIC  BRAVAIS  LATTICE 

Consider  a  set  of  ions  of  mass  M  distributed  along  a  line  at  points  separated  by  a 
distance  a,  so  that  the  one-diinensional  Bravais  lattice  vectors  are  just  R  =  na,  for 
integral  n.  Let  u(na)  be  the  displacement  along  the  line  from  its  equilibrium  position, 
of  the  ion  that  oscillates  about  na  (Figure  22.4).  For  simplicity  we  assume  that  only 


(«-4)a  (n-3)a  (n  -  2)a  (n  -  l)a  na  (r:  +  ] )a  (n  +  2)c  (n  +  3)a  (n  +  4)a 


u(na) 

Figure  22.4 

At  any  instant,  the  ion  whose  equilibrium  position  is  na  is  displaced  from 
equilibrium  by  an  amount  u{na). 

neighboring  ions  interact,  so  we  may  take  the  harmonic  potential  energy  (22.9)  to 
have  the  form 


iyh"  =  ^[«-t<[«+  1]«)]2, 


(22.22) 


(where  K  =  <p"(a),  tp(x)  being  the  interaction  energy  of  two  ions  a  distance  x  apart 
along  the  line).  The  equations  of  motion  are  then 


MU(na )  =  — 


BUh 


=  -  K[2u(na)  -  u([n  -  l]a)  -  a([n  +  !]«)].  (22.23) 


8u(na ) 

These  are  precisely  the  equations  that  would  be  obeyed  if  each  ion  were  connected 
to  its  two  neighbors  by  perfect  massless  springs  with  spring  constant  K  (and  equi¬ 
librium  length  a,  although  the  equations  are  in  fact  independent  of  the  equilibrium 
length  of  the  spring).  The  resulting  motion  is  most  easily  visualized  in  terms  of  such 
a  model  (Figure  22.5).  / 
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Figure  22.5 

If  only  nearest-neighbor  forces  are  kept,  the  harmonic  approximation  for  the 
one-dimensional  Bravais  lattice  describes  a  model  in  which  each  ion  is  tied  to  its 
neighbors  by  perfect  springs. 

If  the  chain  of  ions  has  only  a  finite  number,  N,  then  we  must  specify  how  the  ions 
at  the  two  ends  are  to  be  described.  We  could  take  them  to  interact  only  with  their 
neighbors  on  the  interior  side,  but  this  would  complicate  the  analysis  without  materi¬ 
ally  altering  the  final  results.  For  if  N  is  large,  and  if  we  are  not  interested  in  end 
effects,  then  the  precise  way  in  which  the  ions  at  the  ends  are  treated  is  immaterial, 
and  we  may  choose  the  approach  on  grounds  of  mathematical  convenience.  As  in 
the  case  of  the  electron  gas  (Chapter  2)  by  far  the  most  convenient  choice  is  the 
Born-von  Karman  periodic  boundary  condition.  In  the  linear  chain  of  ions  this 
boundary  condition  is  easily  specified:  We  simply  join  the  two  remote  ends  of  the 
chain  back  together  by  one  more  of  the  same  springs  that  connect  internal  ions 
(Figure  22.6).  If  we  take  the  ions  to  occupy  sites  a,  2a, ,  Na,  then  we  can  use  Eq. 
(22.22)  to  describe  each  of  the  N  ions  (n  =  1,  2, ... ,  N),  provided  that  we  interpret 
the  quantities  u([N  +  l]n)  and  t<0)  that  occur  in  the  equations  of  motion  for  u(Arfl) 
and  u(a),  respectively,  as18 

u([N  +  1  ]a)  =  u(fl);  u(0)  =  it(Na).  (22.24) 


Figure  22.6 

The  Born-von  Karman  or  periodic  boundary  condition  for  the  linear  chain. 


-Na- 


Figure  22.7 

An  alternative  representation  of  the  Born-von  Karman  boundary  condition.  The  object  con¬ 
necting  the  ion  on  the  extreme  left  with  the  spring  on  the  extreme  right  is  a  massless  rigid  rod 
of  length  L  —  Na 


18  An  alternative  interpretation  of  the  Born-von  Karman  boundary  condition  is  to  consider  not 
the  deformation  of  the  chain  into  a  loop,  but  an  explicit  mechanical  constraint  forcing  ion  N  to  interact 
with  ion  1  via  a  spring  of  spring  constant  K  (Figure  22.7).  This  picture  is  probably  more  helpful  in  inter¬ 
preting  the  boundary  condition  in  three  dimensions,  and  is  especially  useful  to  keep  in  mind  when 
considering  questions  involving  the  total  momentum  of  a  finite  crystal,  or  the  question  of  why  a  crystal 
assumes  the  equilibrium  size  it  does. 
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We  seek  solutions  to  (22.23)  of  the  form: 

u(na,  t)  oc  e^n°-c,t\  (22.25) 

The  periodic  boundary  condition  (22.24)  requires  that 

eikNa  =  1,  (22.26) 

which  in  turn  requires  k  to  have  the  form: 


2  re  n 

k  = - — ,  n  an  integer. 

a  N 


(22.27) 


Note  that  if  k  is  changed  by  2 tt/a,  the  displacement  u(na)  defined  by  (22.25)  is  un¬ 
affected.  Consequently,  there  are  just  N  values  of  k  consistent  with  (22.27)  that  yield 
distinct  solutions.  We  take  them  to  be  the  values  lying  between  -it /a  and  n/a}9 
Upon  substituting  (22.25)  into  (22.23)  we  find  that 

—  M(a2e'ikna~c0,)  —  —K[2  —  e~  *“  — 

=  — 2K(1  -  cos  ka)ei(k"a~m\  (22.28) 


and  thus  we  have  a  solution  for  a  given  k,  provided  that  w  —  0){k\  where 

(K 


-C°Sh,)  -2 


M 


|sin  ikn|. 


(22.29) 


The  solutions  describing  the  actual  ionic  displacements  are  given  by  the  real  or 
imaginary  parts  of  (22.25): 

{cos  ( kna  —  cot)) 
sin  ( kna  —  cot)J ’ 


u(na,  t)  oc 


(22.30) 


Since  co  is  an  even  function  of  k,  it  suffices  to  take  only  the  positive  root  in  (22.29), 
for  the  solutions  (22.30)  determined  by  k  and  -  w(k)  are  identical  to  those  determined 
by  -k  and  m(k)  =  co(-/c).  We  thus  have  N  distinct  values  of  k,  each  with  a  unique 
frequency  co(k),  so  Eq.  (22.30)  yields  2 N  independent  solutions.20  An  arbitrary  motion 
of  the  chain  is  determined  by  specifying  the  N  initial  positions  and  N  initial  velocities 
of  the  ions.  Since  these  can  always  be  fit  with  a  linear  combination  of  the  2 N  inde¬ 
pendent  solutions  (22.30),  we  have  found  a  complete  solution  to  the  problem. 

The  solutions  (22.30)  describe  waves  propagating  along  the  chain  with  phase 
velocity  c  =  (ojk,  and  group  velocity  v  =  dio/dk.  The  frequency  w  is  plotted  against 
the  wave  vector  k  in  Figure  22.8.  Such  a  curve  is  known  as  a  dispersion  curve.  When 
k  is  small  compared  with  n/a  (i.e.,  when  the  wavelength  is  large  compared  with  the 
interparticle  spacing),  w  is  linear  in  k: 

/  rF\ 

|.  (22.31) 

This  is  the  type  of  behavior  we  are  accustomed  to  in  the  cases  of  light  waves  and 
ordinary  sound  waves.  If  co  is  linear  in  k,  then  the  group  velocity  is  the  same  as  the 


18  This  is  just  the  one-dimensional  version  of  requiring  k  to  lie  in  the  first  Brillouin  zone  (Chapter  8). 
20  Although  there  are  2 N  solutions,  there  are  only  N  “normal  modes”,  for  the  sine  solution  is  simply 
the  cosine  solution,  shifted  in  time  by  n/2ai. 
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Figure  22.8 

Dispersion  curve  for  a  mon¬ 
atomic  linear  chain  with 
only  nearest-neighbor  inter¬ 
actions.  Note  that  to  is  linear 
for  small  k,  and  that  dw/dk 
vanishes  at  the  boundaries 
of  the  zone  (k  =  +n /a). 


phase  velocity,  and  both  are  independent  of  frequency.  One  of  the  characteristic 
features  of  waves  in  discrete  media,  however,  is  that  the  linearity  ceases  to  hold  at 
wavelengths  short  enough  to  be  comparable  to  the  interparticle  spacing.  In  the 
present  case  co  falls  below  ck  as  k  increases,  and  the  dispersion  curve  actually  becomes 
flat  (i.e.,  the  group  velocity  drops  to  zero)  when  k  reaches  ±n/a. 

If  we  drop  the  assumption  that  only  nearest  neighbors  interact,  very  little  changes 
in  these  results.  The  functional  dependence  of  to  on  k  becomes  more  complex,  but 
we  continue  to  find  N  normal  modes  of  the  form  (22.25)  for  the  N  allowed  values  of 
k.  Furthermore,  the  angular  frequency  co(k)  remains  linear  in  k  for  k  small  compared 
with  n/a,  and  satisfies  dco/dk  =  0  at  k  =  ±n/a.21 


NORMAL  MODES  OF  A  ONE-DIMENSIONAL  LATTICE 
WITH  A  BASIS 

We  consider  next  a  one-dimensional  Bravais  lattice  with  two  ions  per  primitive  cell, 
with  equilibrium  positions  na  and  na  +  d.  We  take  the  two  ions  to  be  identical,  but 
take  d  <  a/2,  so  the  force  between  neighboring  ions  depends  on  whether  their  separa¬ 
tion  is  d  or  a  —  d  (Figure  22.9).22  For  simplicity  we  again  assume  that  only  nearest 

—1  a—d  h—  —1  d  h—  -H  a  f— 

•  X*  X  •  X  •  X  •  X*  X*  X*  X  • 

na  ( n  +  l)a  (n  +  2  )a  (n  +  3)a  ( n  +  4  )a  ( n  +  5)<z  (n  +  6)a  ( n  +  7  )a 

W  G-spring 
VA  -^-spring 

Figure  22.9 

The  diatomic  linear  chain  of  identical  atoms,  connected  by  springs  of  alternating  strengths. 


21  See  Problem  1.  These  conclusions  are  correct  provided  that  the  interaction  is  of  finite  range — i.e., 
provided  that  an  ion  interacts  only  with  its  first  through  mth  nearest  neighbors,  where  m  is  a  fixed  integer 
(independent  of  N).  If  the  interaction  has  infinitely  long  range,  then  it  must  fall  off  faster  than  the  inverse 
cube  of  the  interionic  distance  (in  one  dimension)  if  the  frequencies  are  to  be  linear  in  k  for  small  k. 

22  An  equally  instructive  problem  arises  when  the  forces  between  all  neighboring  ionic  pairs  are 
identical,  but  the  ionic  mass  alternates  between  and  M2  along  the  chain.  See  Problem  2. 
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neighbors  interact,  with  a  force  that  is  stronger  for  pairs  separated  by  d  than  for  pairs 
separated  by  a  -  d  (since  a  -  d  exceeds  d).  The  harmonic  potential  energy  (22.9)  can 
then  be  written: 


K  G 

^harn,  =  £  [„,(„<,)  _  +  V  [„2(„fl)  _  U]([„  +  l]0)]2>  (22.32) 

•  »  L  n 

where  we  have  written  ii^na)  for  the  displacement  of  the  ion  that  oscillates  about  the 
site  na,  and  u2(m)  for  the  displacement  of  the  ion  that  oscillates  about  na  +  d.  In 
keeping  with  our  choice  d  ^  a/2,  we  also  take  K  >  G. 

The  equations  of  motion  are 


Mti^na)  = 


8Uh‘ 


Mtt2{na)  =  — 


du2(na) 

=  -  u2{na)~\  -  G[iq(«o) 

dUhmm 


u2(.[n  - 


cu2(na) 


=  -Kb‘i{na)  -  u,(na)]  -  G[i/2(na)  -  tq([n  +  1]«)].  (22.33) 

We  again  seek  a  solution  representing  a  wave  with  angular  frequency  ca  and  wave 
vector  k : 


iq(no)  =  e,e'(k"“  m,), 

u2(na)  =  e2ei<'™“‘0,).  (22.34) 


Here  e,  and  e2  are  constants  to  be  determined,  whose  ratio  will  specify  the  relative 
amplitude  and  phase  of  the  vibration  of  the  ions  within  each  primitive  cell.  As  in  the 
monatomic  case,  the  Born-von  Karman  periodic  boundary  condition  again  leads 
to  the  N  nonequivalent  values  of  k  given  by  (22.27). 

If  we  substitute  (22.34)  into  (22.33)  and  cancel  a  common  factor  of  from 

both  equations,  we  are  left  with  two  coupled  equations: 


[Mm2  -  (K  +  C)]e,  +  (K  +  Ge~ika)e2  =  0, 

{K  +  +  [Mm2  -  (K  +  G)>2  =  0.  (22.35) 

This  pair  of  homogeneous  equations  will  have  a  solution,  provided  that  the  deter¬ 
minant  of  the  coefficients  vanishes : 

[Mm2  -(K  +  G)]2  =  |K  +  Ge-'kf  =  K2  +  G2  +  2 KG  cos  ka.  (22.36) 
Equation  (22.36)  holds  for  two  positive  values  of  m  satisfying 
K  G  1 

“2  =  —fi—  ±  m  V^2  +  G2  +  2 KG  cos  ka,  (22.37) 

with 

e2  _  K  +  Geika 

e,  “  +  JK  +  Geika |'  (22.38) 

For  each  of  the  N  values  of  k  there  are  thus  two  solutions,  leading  to  a  total  of 
2 N  normal  modes,  as  is  appropriate  to  the  2 N  degrees  of  freedom  (two  ions  in  each 
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of  N  primitive  cells).  The  two  o>  vs.  k  curves  are  referred  to  as  the  two  branches  of 
the  dispersion  relation,  and  are  plotted  in  Figure  22.10.  The  lower  branch  has  the 
same  structure  as  the  single  branch  we  found  in  the  monatomic  Bravais  lattice:  to 
vanishes  linearly  in  k  for  small  k,  and  the  curve  becomes  flat  at  the  edges  of  the 
Brillouin  zone.  This  branch  is  known  as  the  acoustic  branch  because  its  dispersion 
relation  is  of  the  form  w  =  ck  characteristic  of  sound  waves,  at  small  k.  The  second 
branch  starts  at  w  =  Jl(K  +  G)/M  at  k  =  0  and  decreases  with  increasing  k  down 
to  J2K/M  at  the  zone  edge.  This  branch  is  known  as  the  optical  branch  because 
the  long  wavelength  optical  modes  in  ionic  crystals  can  interact  with  electromagnetic 
radiation,  and  are  responsible  for  much  of  the  characteristic  optical  behavior  of  such 
crystals  (Chapter  27). 


Figure  22.10 

Dispersion  relation  for  the 
diatomic  linear  chain.  The 
lower  branch  is  the  acoustic 
branch  and  has  the  same 
structure  as  the  single  branch 
present  in  the  monatomic 
case  (Figure  22.8).  In  addi¬ 
tion,  there  is  now'  an  optical 
branch  (upper  branch.) 


u(k) 


We  can  gain  some  insight  into  the  nature  of  the  two  branches  by  considering  some 
special  cases  in  more  detail: 

Case  1  k  «  n/a  Here  cos  ka  «  1  -  {ka)2/ 2,  and  to  leading  order  in  k  the  roots 
become: 


w  = 

w  = 


%K  +  G) 


M 


I  KG 
yj  2  M{K  +  G) 


0{ka)2, 

{ka). 


(22.39) 

(22.40) 


When  k  is  very  small,  (22.38)  reduces  to  e2  =  +e1.  The  lower  sign  belongs  to  the 
acoustic  mode,  and  describes  a  motion  in  which  the  two  ions  in  the  cell  move  in 
phase  with  one  another  (Figure  22.1 1).  The  upper  sign  belongs  to  the  high-frequency 
optical  mode,  and  describes  a  motion  in  which  the  two  atoms  in  the  cell  are  180 
out  of  phase. 
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(a) 


(b) 


Figure  22.11 

The  long  wavelength  acoustic  (a)  and  optical  (b)  modes  in  the  diatomic  linear 
chain.  The  primitive  cell  contains  the  two  ions  joined  by  the  /£ -spring,  repre¬ 
sented  by  a  jagged  line.  In  both  cases  the  motion  of  every  primitive  cell  is  identical, 
but  in  the  acoustic  mode  the  ions  within  a  cell  move  together,  while  they  move 
180°  out  of  phase  in  the  optical  mode. 


Case  2  k  —  n/a  Now  the  roots  are 


e,  =  -e2;  (22.41) 

ci  =  e2-  (22.42) 


When  k  =  n/a,  the  motions  in  neighboring  cells  are  180°  out  of  phase,  and  therefore 
the  two  solutions  must  be  as  pictured  in  Figure  22.12.  In  each  case,  only  one  type 
of  spring  is  stretched.  Note  that  if  the  two  spring  constants  were  the  same,  there 
would  be  no  gap  between  the  two  frequencies  at  k  —  n/a.  The  reason  for  this  is  clear 
from  Figure  22.12. 


(a) 


(b) 


Figure  22.12 

The  acoustic  (a)  and  optical  (b)  modes  of  the  diatomic  linear  chain,  when 
k  =  ±n/a,  at  the  edges  of  the  Brillouin  zone.  Now  the  motion  changes  by  180° 
from  cell  to  cell.  However,  as  in  Figure  22.11,  the  ions  within  each  cell  move  in 
phase  in  the  acoustic  mode,  and  1 80°  out  of  phase  in  the  optical  mode.  Note  that 
if  the  K-  and  G-springs  were  identical  the  motion  would  be  the  same  in  both  cases. 
This  is  why  the  two  branches  become  degenerate  at  the  edges  of  the  zone  when 
K  =  G. 


Case  3  K  »  G  To  leading  order  in  G/K  we  have: 
[2 K 
w  ~  v  m" 

[2 G 


1  +0  ^ 


M 


;in  ika\  +  O  ^  j 


e2- 


(22.43) 

(22.44) 
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The  optical  branch  now  has  a  frequency  that  is  independent  of  k,  to  leading  order 
in  G/K,  and  equal  to  the  vibrational  frequency  of  a  single  diatomic  molecule  composed 
of  two  mass  M  ions  connected  by  a  spring  K.  Consistent  with  this  picture  of  inde¬ 
pendent  molecular  vibrations  in  each  primitive  cell,  the  atomic  motions  within  each 
cell  are  180°  out  of  phase  (to  leading  order  in  G/K )  whatever  the  wavelength  of  the 
normal  mode.  Because  G/K  is  not  zero,  these  molecular  vibrations  are  very  weakly 
coupled,  and  the  result  is  a  small  spread  of  order  G/K  in  the  optical  band  frequencies, 
as  k  varies  through  the  Brillouin  zone.23 

The  acoustic  branch  (22.44)  is  (to  leading  order  in  G/K)  just  that  for  a  linear  chain 
of  atoms  of  mass  2 M  coupled  by  the  weak  spring  G  (compare  (22.44)  with  (22.29)). 
This  is  consistent  with  the  fact  that  e,  —  e2;  i.e.,  within  each  cell  the  atoms  move 
in  phase,  and  the  strong  A-spring  is  hardly  stretched  at  all. 

This  case  suggests  the  following  characterization  of  the  difference  between  optical 
and  acoustic  branches:24  An  acoustic  mode  is  one  in  which  all  ions  within  a  primitive 
cell  move  essentially  in  phase,  as  a  unit,  and  the  dynamics  are  dominated  by  the 
interaction  between  cells;  an  optical  mode,  on  the  other  hand,  is  one  in  which  the 
ions  within  each  primitive  cell  are  executing  what  is  essentially  a  molecular  vibratory 
mode,  which  is  broadened  out  into  a  band  of  frequencies  by  virtue  of  the  intercellular 
interactions. 

Case  4  K  =  G  In  this  case  we  are  really  dealing  with  a  monatomic  Bravais  lattice 
of  lattice  constant  a/ 2,  and  the  analysis  of  the  preceding  section  is  applicable.  It  is 
nevertheless  instructive  to  see  how  that  analysis  emerges  in  the  limit  K->G.  This 
is  the  subject  of  Problem  3. 

NORMAL  MODES  OF  A  MONATOMIC  THREE-DIMENSIONAL 
BRAVAIS  LATTICE 

We  now  consider  a  general  three-dimensional  harmonic  potential  (Eq.  (22.10)).  To 
keep  from  being  blinded  by  indices  it  is  often  convenient  to  adopt  a  matrix  notation, 
writing  a  quantity  like 

I  uJL R)D„v(R  -  R’K(R')  (22.45) 

/ 

as  the  vector  product  of  u(R)  with  the  vector  obtained  by  operating  on  the  vector 
u(R')  with  the  matrix  D(R  —  R').  With  this  convention  the  harmonic  potential  (22.10) 
can  be  written: 

L'harm  =  -  £  u(R)D(R  -  R')u(R’).  (22.46) 

^  RR' 

In  discussing  the  normal  modes  of  the  crystal  it  is  helpful  to  exploit  some  general 
symmetries  that  must  be  obeyed  by  the  matrices  D(R  —  R'),  regardless  of  the  par¬ 
ticular  forms  of  the  interionic  forces. 


23  Note  the  similarity  of  this  case  to  the  tight-binding  theory  of  electronic  energy  levels  (Chapter  10), 
in  which  weakly  coupled  atomic  energy  levels  broaden  into  a  narrow  band.  In  the  present  case  weakly 
coupled  molecular  vibrational  levels  broaden  into  a  narrow  band. 

24  This  simple  physical  interpretation  does  not  hold  in  the  general  case. 
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Symmetry  1 


DJR  -  R')  =  DJR'  -  R). 


(22.47) 


Since  the  D’s  are  coefficients  in  the  quadratic  form  (22.10),  they  can  always  be 
chosen  to  have  this  symmetry.  Alternatively,  it  follows  from  the  general  definition 
of  D(iV(R  —  R')  as  a  second  derivative  of  the  exact  interaction  potential, 


D„V(R  -  R')  = 


d2U 

3w„(R)  dUv(R')  uso 


(22.48) 


because  of  the  independence  of  order  of  differentiation. 


Symmetry  2 

DJ R  -  R')  =  D,V(R'  -  R)  or  D(R)  =  D(-R),  ^  (22.49) 

x 

or,  by  virtue  of  (22.47), 

f)„v(R  -  R')  =  DV„(R  -  R').  (22.50) 

This  symmetry  follows  from  the  fact  that  every  Bravais  lattice  has  inversion  symmetry. 
This  implies  that  the  energy  of  a  configuration  in  which  the  ion  associated  with  site  R 
has  a  displacement  u(R)  must  be  the  same  as  the  energy  of  the  configuration  in  which 
the  ion  associated  with  site  R  has  a  displacement  —  u(  — R).25  Equation  (22.49)  is  just 
the  condition  that  (22.45)  be  unchanged  by  this  replacement  (u(R)  -»  —  u(  —  R) )  for 
arbitrary  values  of  the  u(R). 


Symmetry  3 

Zfl„(R)  =  0  or  X  D(R)  =  0.  (22.51) 

R  R 

This  follows  from  the  fact  that  if  every  ion  is  given  the  same  displacement  d  from 
equilibrium  (i.e.,  u(R)  =  d),  then  the  entire  crystal  will  simply  be  displaced  without 
internal  distortion,  and  l/harm  will  have  the  same  value  as  it  has  when  all  the  u(R) 
vanish,  namely  zero: 


0  =  1  R  -  R'K.  =  I  A'4A(I  DJ R)).  (22.52) 

RR‘  /tv  R 

fiv 

The  relation  (22.51)  is  simply  the  condition  that  (22.52)  vanish  for  arbitrary  choices 
of  the  vector  d. 


Armed  with  these  symmetries,  we  may  proceed  as  follows: 

We  have  3  N  equations  of  motion  (one  for  each  of  the  three  components  of  the 
displacements  of  the  N  ions): 


MiiJR)  =  - 


dl /ha™ 

du]Kj 


-  X  D^( R  -  R'K(R'), 


(22.53) 


25  That  is,  r(R)  -+  —  r(  -  R). 
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or,  in  matrix  notation, 

Mu(R)  =  -  £  D(R  -  R')u(R').  (22.54) 

R' 

As  in  the  one-dimensional  cases  we  seek  solutions  to  the  equations  of  motion  in 
the  form  of  simple  plane  waves: 

u(R,  t)  =ee& (22.55) 

Here  €  is  a  vector,  to  be  determined,  that  describes  the  direction  in  which  the  ions 
move.  It  is  known  as  the  polarization  vector  of  the  normal  mode. 

We  continue  to  use  the  Born-von  Karman  periodic  boundary  condition,  re¬ 
quiring  that  u(R  +  A',  a,)  =  u(R)  for  each  of  the  three  primitive  vectors  a,  ,  where  the 
Nt  are  large  integers  satisfying  N  =  A',  N2N3.  This  restricts  the  allowed  wave  vectors 
k  to  those  of  the  form:26 

k  =  b,  +  b2  +  b3,  n,  integral,  (22.56) 

Nt  N2  N3 

where  the  bf  are  the  reciprocal  lattice  vectors  satisfying  h,  •  a j  =  2nbi}.  As  in  our 
discussion  of  the  one-dimensional  case,  only  k  within  a  single  primitive  cell  of  the 
reciprocal  lattice  will  yield  distinct  solutions;  i.e.,  if  one  adds  a  reciprocal  lattice 
vector  K  to  the  k  appearing  in  (22.55)  the  displacements  of  all  ions  are  left  completely 
unaltered,  because  of  the  basic  property  e,KR  =  1,  of  the  reciprocal  lattice  vectors. 
As  a  result  there  will  be  just  N  nonequivalent  values  of  k  of  the  form  (22.56)  which 
can  be  chosen  to  lie  in  any  primitive  cell  of  the  reciprocal  lattice.  It  is  generally 
convenient  to  take  that  cell  to  be  the  first  Brillouin  zone. 

If  we  substitute  (22.55)  into  (22.54)  we  find  a  solution  whenever  e  is  an  eigenvector 
of  the  three-dimensional  eigenvalue  problem : 

Mw2e  =  D(k)e.  (22.57) 

Here  D(k),  known  as  the  dynamical  matrix,  is  given  by 

D(k)  =  £  D(R)e-,1‘  R.  (22.58) 

R 

The  three  solutions  to  (22.57)  for  each  of  the  N  allowed  values  of  k,  give  us  3 N 
normal  modes.  In  discussing  these  solutions  it  is  useful  to  translate  the  symmetries 
of  D(R)  into  corresponding  symmetries  of  D(k).  It  follows  from  (22.49)  and  (22.51) 
that  D(k)  can  be  written  in  the  form: 

D(k)  =  iXD(R)[<rfl‘R  +  eik'R  —  2] 

^  R 

=  XD(R)[cos(k-R)  -  1] 

R 

=  -2  £  D(R)  sin2  (ik  •  R).  (22.59) 

R 


26  Compare  the  discussion  on  page  136,  where  identical  restrictions  were  imposed  on  the  allowed 
wave  vectors  for  an  electronic  wave  function  in  a  periodic  potential. 
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Equation  (22.59)  explicitly  demonstrates  that  D(k)  is  an  even  function  of  k,  and  a 
real  matrix.  In  addition,  Eq.  (22.50)  implies  that  D(k)  is  a  symmetric  matrix.  It  is  a 
theorem  of  matrix  algebra  that  every  real  symmetric  three-dimensional  matrix  has 
three  real  eigenvectors,  £t,  e2,  e3,  which  satisfy 

D(k)es(k)  =  ;,(kK(k),  (22.60) 

and  can  be  normalized  so  that 

«s(k)  •  <v(k)  =  8SS;  s,  s'  =  1,  2,  3.  (22.61) 


Evidently  the  three  normal  modes  with  wave  vector  k  will  have  polarization  vectors 
£s(k)  and  frequencies  ms(k),  given  by27 


ms(k)  = 


(22.62) 


In  the  one-dimensional  monatomic  Bravais  lattice  we  found  that  co(k)  vanished 
linearly  with  k  at  small  k.  In  the  three-dimensional  monatomic  Bravais  lattice  this 
remains  the  case  for  each  of  the  three  branches.  This  follows  from  Eq.  (22.59),  for 
when  k  •  R  is  small  for  all  R  connecting  sites  whose  ions  have  any  appreciable  inter¬ 
action,  then  we  can  approximate  the  sine  by28 

sin2  (ik  •  R)  a  (ik  •  R)2  (22.63) 

and  therefore 

D(k)  a  -  ^  £  (k  -  R)2D(R),  k  =  ~.  (22.64) 

Z  R  K 


Consequently,  in  the  long-wavelength  (small  k)  limit  we  can  write 

ms(k)  =  cs{k)k,  (22.65) 


where  the  cs(k)  are  the  square  roots  of  the  eigenvalues  of  the  matrix 


-J^.RmR).  (22.66) 

Note  that,  in  general,  the  c5  will  depend  on  the  direction  k  of  propagation  of  the 
wave  as  well  as  on  the  branch  index  s. 

Typical  dispersion  curves  for  a  three-dimensional  monatomic  Bravais  lattice  are 
shown  in  Figure  22.13. 


21  It  can  be  shown  that  if  D(k)  has  any  negative  eigenvalue,  then  there  will  be  an  ionic  configuration 
for  which  I/h*rm  is  negative,  contradicting  the  assumption  that  C/"1  is  the  minimum  energy.  Consequently, 
the  frequencies  co^k)  are  real.  As  in  the  one  dimensional  case,  it  suffices  to  take  only  the  positive  square 
root  in  (22.62). 

28  If  the  interaction  does  not  decrease  rapidly  enough  with  distance,  this  procedure  may  be  imper¬ 
missible.  A  sufficient  condition  for  its  validity  is  that 

X  R2  D(R) 

R 


converge,  which  is  guaranteed  provided  that  D(R)  decreases  faster  than  — j  in  three  dimensions  (cf.  footnote 

2D-  * 


Frequency  (1012  Hz) 
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(a) 


Figure  22.13 

(a)  Typical  dispersion  curves  for  the  normal-mode  frequencies  in  a  monatomic  Bravais  lattice. 
The  curves  are  for  lead  (face-centered  cubic)  and  are  plotted  in  a  repeated-zone  scheme  along  the 
edges  of  the  shaded  triangle  shown  in  (b).  Note  that  the  two  transverse  branches  are  degenerate 
in  the  [100]  direction.  (After  Brockhouse  et  al.,  Phys.  Rec.  128,  1099  (1962).) 

In  the  three-dimensional  case  it  is  important  to  consider  not  only  the  behavior  of 
the  frequencies  cos(k),  but  also  the  relations  between  the  directions  of  the  polarization 
vectors  es(k)  and  the  direction  of  propagation  k.  In  an  isotropic  medium,  one  can 
always  choose  the  three  solutions  for  a  given  k  so  that  one  branch  (the  longitudinal 
branch)  is  polarized  along  the  direction  of  propagation  («||  k),  and  the  other  two 
(the  transverse  branches)  are  polarized  perpendicular  to  the  direction  of  propagation 
(€  i  k). 

In  an  anisotropic  crystal  the  polarization  vectors  need  not  be  so  simply  related 
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to  the  direction  of  propagation  unless  k  is  invariant  under  appropriate  symmetry 
operations  of  the  crystal.  If,  for  example,  k  lies  along  a  3-,  4-,  or  6-fold  rotation  axis, 
then  one  mode  will  be  polarized  along  k  and  the  other  two  will  be  polarized  per¬ 
pendicular  to  k  (and  degenerate  in  frequency).29  One  can  then  continue  to  use  the 
nomenclature  of  the  isotropic  medium,  referring  to  longitudinal  and  transverse 
branches.  In  crystals  of  high  symmetry  (e.g.,  cubic)  such  symmetry  directions  are 
quite  common,  and  since  the  polarization  vectors  are  continuous  functions  of  k,  the 
branch  that  is  longitudinal  when  k  is  along  a  symmetry  direction  tends  to  have  a 
polarization  vector  fairly  close  to  being  along  k,  even  when  k  is  not  along  a  symmetry 
direction.  Similarly,  the  branches  that  are  transverse  when  k  is  along  a  symmetry 
direction  have  polarizations  not  too  far  out  of  the  plane  perpendicular  to  k,  even 
when  k  lies  in  a  general  direction.  One  therefore  continues  to  speak  of  longitudinal 
and  transverse  branches,  even  though  they  are  strictly  longitudinal  or  transverse 
only  for  special  directions  of  k. 

NORMAL  MODES  OF  A  THREE-DIMENSIONAL  LATTICE 
WITH  A  BASIS 

The  calculation  for  a  three-dimensional  lattice  with  a  basis  is  not  sufficiently  different 
from  the  case  we  have  just  described  to  bear  repeating.  Just  as  in  one  dimension, 
the  chief  effect  of  introducing  a  polyatomic  basis  is  to  produce  optical  branches. 
The  description  of  these  is  made  notationally  more  complicated  by  the  introduction 
of  an  index  to  specify  which  of  the  ions  in  the  basis  is  being  referred  to.  The  main 
results  of  the  analysis  are  by  and  large  the  obvious  extrapolations  of  the  cases  we 
have  already  considered: 

For  each  value  of  k  there  are  3p  normal  modes,  where  p  is  the  number  of  ions 
in  the  basis.  The  frequencies  ws(k)  (s  =  1, . . . ,  3p)  are  all  functions  of  k,  with  the 
periodicity  of  the  reciprocal  lattice,  corresponding  to  the  fact  that  plane  waves  whose 
wave  vectors  k  differ  by  reciprocal  lattice  vectors  K  describe  identical  lattice  waves. 

Three  of  the  3 p  branches  are  acoustic;  i.e.,  they  describe  vibrations  with  frequencies 
that  vanish  linearly  with  k  in  the  long-wavelength  limit.  The  other  3(p  —  1)  branches 
are  optical;  i.e.,  their  frequencies  do  not  vanish  in  the  long- wavelength  limit.  One 
can  think  of  these  modes  as  the  generalizations  to  the  crystalline  case  of  the  three 
translational  and  3(p  —  1)  vibrational  degrees  of  freedom  of  a  p-atomic  molecule. 
Typical  dispersion  curves,  for  the  case  p  =  2,  are  shown  in  Figure  22.14. 

The  polarization  vectors  of  the  normal  modes  are  no  longer  related  by  orthogo¬ 
nality  relations  as  simple  as  (22.61).  If  in  normal  mode  s  the  displacement  of  ion  i  in 
the  cell  about  R  is  given  by 

us'(R,  t)  =  Re  [es'(k)ci(k'R”“s,k)0],  (22.67) 

then  it  can  be  shown  that  the  polarization  vectors  can  be  chosen  to  satisfy  the  3 p 
generalized  orthogonality  relations: 

X  *s‘*(k)  •«s-i(k)Mj  =  <5m.,  (22.68) 

;=i 


29  See  Problem  4.  Note,  however,  that  the  three  polarization  vectors  are  orthogonal  for  general 
directions  of  k  (Eq.  22.61). 


Figure  22.14 

Typical  dispersion  curves 
along  a  general  direction 
in  Ic-space  for  a  lattice  with  a 
two-ion  basis.  The  t  h  ree  lower 
curves  (acoustic  branches) 
are  linear  in  k  for  small  k.  The 
three  upper  curves  (optical 
branches)  will  be  quite  flat 
if  the  intracellular  interac¬ 
tions  are  much  stronger  than 
those  between  cells.  Note 
that  the  direction  of  k  is  not 
one  of  high  symmetry,  since 
there  is  no  degeneracy. 

where  Mf  is  the  mass  of  the  ith  type  of  basis  ion.  In  general  the  polarization  vectors 
need  not  be  real,30  nor  does  the  orthogonality  relation  (22.68)  lend  itself  to  a  simple 
general  geometrical  interpretation. 

CONNECTION  WITH  THE  THEORY  OF  ELASTICITY 

The  classical  theory  of  elasticity  ignores  the  microscopic  atomic  structure  of  a  solid, 
and  treats  it  as  a  continuum.  A  general  deformation  of  the  solid  is  described  in  terms 
of  a  continuous  displacement  field  u(r),  specifying  the  vector  displacement  of  the  part 
of  the  solid  that  in  equilibrium  occupies  position  r.  The  fundamental  assumption  of 
the  theory  is  that  the  contribution  to  the  energy  density  of  the  solid  at  point  r  depends 
only  on  the  value  of  u(r)  in  the  immediate  vicinity  of  r,  or,  more  precisely,  only  on  the 
first  derivatives  of  u(r)  at  the  point  r. 

We  can  derive  the  continuum  theory  of  elasticity  from  the  theory  of  lattice  vibra¬ 
tions,  by  considering  only  lattice  deformations  that  vary  slowly  on  a  scale  determined 
by  the  range  of  the  interionic  forces.  We  must  also  assume  that  one  can  specify  the 
deformation  of  the  basis  ions  within  each  primitive  cell  entirely  in  terms  of  the  vector 
field  u(r),  specifying  the  displacement  of  the  entire  cell.  For  simplicity  we  restrict  our 
discussion  to  monatomic  Bravais  lattices,  where  this  assumption  is  trivially  valid. 

To  derive  the  classical  theory  of  elasticity  we  first  note  that  the  symmetries  (22.49) 
and  (22.51)  permit  us  to  write  the  harmonic  potential  energy  (22.10)  in  the  form: 

L'harm  =  -  7  I  {u(R')  -  u(R)}D(R  -  R'Ku(R')  -  u(R)}-  (22-69) 

4  RR' 

We  only  consider  displacements  u(R)  that  have  a  very  slight  variation  from  cell  to 
cell.  We  can  then  consider  a  smooth  continuous  function  u(r),  which  is  equal  to  u(R) 
when  r  is  a  Bravais  lattice  site.  If  u(r)  varies  little  over  the  range  of  D(R  —  R'),  then  to 
an  excellent  approximation  (which  becomes  exact  in  the  limit  of  very  long-wavelength 
disturbances)  we  can  make  the  replacement 

u(R')  =  u(R)  +  (R'  -  R)  •  Vu(r)|r„R  (22.70) 


30  That  is,  perpendicular  components  of  the  displacement  in  the  normal  mode  will  not  be  in  phase, 
and  the  mode  will  have  an  elliptical  polarization. 
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in  (22.69),  to  find  that 


(22-7,) 


jy  harm  _  _ 


The  quantities  £0/JTV,  which  constitute  a  tensor  of  the  fourth  rank,  are  given  in  terms 
of  Dby31 

1 

Z  R 


E.„  =  RJ)JR)Rr  ■  (22.72) 


Since  the  u(r)  are  slowly  varying,  we  can  equally  well  write  (22.71)  as  an  integral, 

L'ha™  =\l\dT  U"(r))  U'(r))  ’  (22‘73) 


where 


E  =  i£ 

>-1OH TV  x-'f7/itv  > 


(22.74) 


and  t;  is  the  volume  of  the  primitive  cell. 

Equation  (22.73)  is  the  starting  point  in  the  analysis  of  classical  elasticity  theory. 
We  pursue  the  subject  further,  to  extract  the  symmetries  of  the  tensor  £„„„  that  the 
theory  exploits. 

First  note  that  it  follows  directly  from  (22.72)  and  (22.50)  that  £„„„  is  unaltered  by 
the  interchange  (p  «-»  v)  or  the  interchange  (a  <->  r).  Thus  it  is  enough  to  specify  the 
value  of  for  the  six  values: 

xx,  yy,  zz,  yz,  zx,  xy  (22.75) 

of  the  pair  /tv,  and  the  same  six  values  of  the  pair  ox.  This  indicates  that  6  x  6  =  36 
independent  numbers  are  required  to  specify  the  energy  for  a  given  deformation.  A 
further  general  argument  reduces  their  number  to  21 ;  it  can  be  reduced  still  further 
by  exploiting  the  symmetry  of  the  particular  crystal  at  hand. 


Further  Reduction  in  the  Number  of  Independent  Elastic  Constants 

The  energy  of  a  crystal  is  unaffected  by  a  rigid  rotation.  However,  under  a  rotation 
through  the  infinitesimal  angle  5w  about  an  axis  n  passing  through  the  origin,  each 
Bravais  lattice  vector  will  be  shifted  by 

u(R)  =  <5co  x  R,  <5c»  =  (5o;n.  (22.76) 


If  we  substitute  (22.76)  into  (22.71),  we  must  find  that  t/harm  =  0  for  arbitrary  tico.  It 
is  not  difficult  to  show  that  this  implies  that  I/harm  can  depend  on  the  derivatives 
(c/oxju^  only  in  the  symmetrical  combination  (the  strain  tensor): 


2 


If  d  8  \ 

2  \cxc "" +  sx„  "y 


(22.77) 


31  Evidently  our  theory  will  make  sense  only  if  D(R)  vanishes  rapidly  enough  at  large  R  for  the  sum 
in  (22.72)  to  converge.  This  will  be  satisfied  trivially  if  D(R)  vanishes  for  R  greater  than  some  K0,  and  also 
for  D(R)  of  infinitely  long  range,  provided  it  vanishes  faster  than  1/R5. 
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Consequently,  we  can  rewrite  (22.73)  as 

where 


■} 


(22.78) 


Sv  £  +  +  KD„R,  +  R„DBlR  1  (22.79) 

(22.50) o(  D .tat  c.,„  is  InvariM  under  .te 


IS 


_  *  T  ,  /.  .  ' - ~  tliat  ca/frv  invariant  und 

transposition  <r/z  tv.  In  addition,  it  follows  directly  from  (22  79)  that  r 

invariant  under  the  transpositions  ~  „  orm,  As  a  result,  the  number  Tin: 
dependent  components  of  cOJ1IV  is  reduced  to  21. 

Crystal  Symmetries 

Sc" teams'' Tte  ™SMm’  °” “nKf“"her  "d““ the  “mb"  °( w«p™dem 

constants.  The  maximum  number  required  for  each  of  the  seven  crystal 
Table  22.1 


CRYSTAL  SYSTEM 

POINT  GROUPS 

ELASTIC  CONSTANTS 

Triclinic 

all 

21 

13 

9 

7 

Monoclinic 

Orthorhombic 

Tetragonal 

all 

all 

Q>  c«, 

Rhombohedral 

^4-vy  ^4’  ^4/1 »  A 

6 

7 

Hexagonal 

^3i»  ^3i  ^34 

all 

6 

5 

Cubic 

all 

3 

pendent  components  are 


Ci i  —  cv 


1 1  ^xxxjc  ‘'j'yyy  ^zzzz> 

^12  Cxxyy  Cyyzz  =  Czzxx-> 

C44  —  cxyxy  =  cyzyz  =  Czxzx. 

AH  the  other  components  (in  which  x,  y,  or  z  must  appear  as  an  index  an  odd  number 
of  times  vanish  because  the  energy  of  a  cubic  crystal  cannot  change  when  the  sign 
a  single  component  of  the  displacement  field  along  any  of  the  cubic  axes  is  reversed 

f  t  ^  n  y’^he  language  ln  whlch  dasticity  theory  is  conventionally  couched 
s  to  take  full  advantage  of  the  simple  tensor  notation.  In  particular,  the  displace¬ 
ment  field  is  usually  described  not  by  (22.77),  but  by  the  strain  components 


2S„ 


=  v 
P  #  v. 


(22.80) 


which  are,  in  turn,  simplified  to  ^  a  =  1, . . . ,  6  according  to  the  rule: 

XX  ->1,  yy  —9  2,  zz  ->  3,  yz  ->■  4,  zx  -»  5,  xy  -►  6. 

New  Yorkl'mTLTL.  E'  ^  ^  ""  "*  Mathema,Ual  The"r>  of  Elasticity.  Dover, 


(22.81) 
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Instead  of  Eq.  (22.78),  one  writes: 

(22.82) 

where  the  elements  of  the  6x6  matrix  of  C's  are  related  to  the  components  of  the 
tensor  caflxv  by 

C  —  c 

'-'a/J  *'optv» 

where  a  <-*  ap 
and  (3  <-*  tv, 

as  specified  in  (22.81).  (22.83) 

The  quantities  Ca0  are  called  the  elastic  stiffness  constants  (or  the  elastic  moduli). 
The  elements  of  the  6  x  6  matrix  S  that  is  inverse  to  C  are  called  the  elastic  compliance 
constants  (or  simply  the  elastic  constants). 

Given  the  potential-energy  density  (22.78),  the  macroscopic  theory  of  elasticity 
goes  on  to  construct  a  wave  equation  for  u(r,  t).  The  neatest  way  to  do  this  is  to  note 
that  the  kinetic  energy  associated  with  a  given  deformation  field  u(r)  can  be  written 
in  the  form 

T=  p  Jdr^u(r.t)2,  (22.84) 

where  p  is  the  mass  density  of  the  lattice:  p  —  MN/V.  We  can  then  write  a  Lagrangian 
for  the  medium  in  the  form: 


L=  T  -  U  =  ^  Jdr|^u(r)2 

- 1 1 c *■•■  (£  “-w  +  bt:  “-w)  (k  ”'w  +  k  “■w) 


(22.85) 


Hamilton’s  principle. 


5  |  dt  L  =  0, 
then  implies  the  equations  of  motion:33 


P%  =  Z  c»' 


d2uT 
1  dx„  dxv 


If  a  solution  is  sought  of  the  form 


u(r,  t)  =  €cl(k'r  M) 


(22.86) 


(22.87) 


33  These  can  also,  of  course,  be  derived  in  a  more  elementary  and  picturesque  way  by  considering 
the  forces  acting  on  small  volume  elements.  The  superiority  of  the  Lagrangian  derivation  is  only  evident 
when  tensor  notation  is  used. 
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then  m  will  have  to  be  related  to  k  through  the  eigenvalue  equation 

=  (22.88) 

This  has  the  same  structure  as  (and  through  (22.79)  can  be  shown  to  be  identical  to) 
the  results  (22.65)  and  (22.66)  derived  in  the  long  wavelength  limit  of  the  general 
harmonic  theory.  Thus  in  the  limit  of  long  wavelengths  the  normal  modes  of  the 
discrete  crystal  reduce  to  the  sound  waves  of  the  elastic  continuum.  Conversely,  by 
measuring  the  velocities  of  sound  in  the  solid,  one  can  extract  information  about  the 
force  constants  via  (22.88)  and  the  microscopic  definition  of  the  cCT(m  in  (22.79). 

In  Table  22.2  we  list  the  elastic  constants  for  some  representative  cubic  solids. 


Table  22.2 

ELASTIC  CONSTANTS  FOR  SOME  CUBIC  CRYSTALS0 


SUBSTANCE 

c„ 

c12 

C44 

REFERENCE4 

Li  (78  K) 

0.148 

0.125 

0.108 

1 

Na 

0.070 

0.061 

0.045 

2 

Cu 

1.68 

1.21 

0.75 

3 

Ag 

1.24 

0.93 

0.46 

3 

Au 

1.86 

1.57 

0.42 

3 

A1 

1.07 

0.61 

0.28 

4 

Pb 

0.46 

0.39 

0.144 

5 

Ge 

1.29 

0.48 

0.67 

1 

Si 

1.66 

0.64 

0.80 

3 

V 

2.29 

1.19 

0.43 

6 

Ta 

2.67 

1.61 

0.82 

6 

Nb 

2.47 

1.35 

0.287 

6 

Fe 

2.34 

1.36 

1.18 

7 

Ni 

2.45 

1.40 

1.25 

8 

LiCl 

0.494 

0.228 

0.246 

9 

NaCl 

0.487 

0.124 

0.126 

9 

KF 

0.656 

0.146 

0.125 

9 

RbCl 

0.361 

0.062 

0.047 

10 

“  Elastic  constants  in  1012  dynes-cm-2  at  300  K. 
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5.  P.  Ho  and  A.  L.  Ruoff,  J.  Appl.  Phys.  40,  51  (1969). 
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PROBLEMS 


1.  Linear  Chain  with  mth  Nearest-Neighbor  Interactions 

Reexamine  the  theory  of  the  linear  chain,  without  making  the  assumption  that  only  nearest 
neighbors  interact,  replacing  Eq.  (22.22)  by 


{yharm  =  £  £  -  Km[u(na)  -  u([tl  +  m]a)]2. 

n  in  >  0  ^ 

(a)  Show  that  the  dispersion  relation  (22.29)  must  be  generalized  to 


=  2  /T 

\  m>  0 


K, 


(sin2jmka) 

'  M  ‘ 


(22.89) 


(22.90) 


(b)  Show  that  the  long-wavelength  limit  of  the  dispersion  relation,  (22.31),  must  be  generalized 


to: 


=  a  (  I  m2KJM)1'2  |fc|, 

\m>  0  / 


(22.91) 


provided  that  Zrn2Km  converges. 

(c)  Show  that  if  =  l/mp  (1  <  p  <  3),  so  that  the  sum  does  not  converge,  then  in  the 
long-wavelength  limit 

oiocr1^.  (22.92) 


(Hint:  It  is  no  longer  permissible  to  use  the  small-fc  expansion  of  the  sine  in  (22.90),  but  one  can 
replace  the  sum  by  an  integral  in  the  limit  of  small  k.) 

(d)  Show  that  in  the  special  case  p  =  3, 

co~k^k\.  (22.93) 

2.  Diatomic  Linear  Chain 

Consider  a  linear  chain  in  which  alternate  ions  have  mass  M i  and  M2,  and  only  nearest  neighbors 
interact. 

(a)  Show  that  the  dispersion  relation  for  the  normal  modes  is 

co2  =  -  — —  (M,  +  M2  ±  jMt2  +  M22  +  2MjM2  cos  ka).  (22.94) 
M  i  A/  2 

(b)  Discuss  the  form  of  the  dispersion  relation  and  the  nature  of  the  normal  modes  when 
»  M2- 

(c)  Compare  the  dispersion  relation  with  that  of  the  monatomic  linear  chain  (22.29)  when 
M i  «  M2. 

3.  Lattice  with  a  Basis  Viewed  as  a  Weakly  Perturbed  Monatomic  Bravais  Lattice 
It  is  instructive  to  examine  the  dispersion  relation  (22.37)  for  the  one-dimensional  lattice  with  a 
basis,  in  the  limit  in  which  the  coupling  constants  K  and  G  become  very  dose: 

K  —  K0  +  A,  G  =  K0  -  A,  A  «  K0.  (22.95) 

(a)  Show  that  when  A  =  0,  the  dispersion  relation  (22.37)  reduces  to  that  for  a  monatomic 
linear  chain  with  nearest-neighbor  coupling.  ( Warning :  If  the  length  of  the  unit  cell  in  the  diatomic 
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chain  is  a,  then  when  K  =  C  it  will  reduce  to  a  monatomic  chain  with  lattice  constant  a/2; 
furthermore,  the  Brillouin  zone  {—n/a  <  k  <  n/a)  for  the  diatomic  chain  will  be  only  half  the 
Brillouin  zone  (— re/(a/2)  <  k  <  nj(a/2) )  of  the  monatomic  chain.  You  must  therefore  explain 
how  two  branches  (acoustic  and  optical)  in  half  the  zone  reduce  back  to  one  branch  in  the  full 
zone.  To  demonstrate  the  reduction  convincingly  you  should  examine  the  behavior  of  the  ampli¬ 
tude  ratio,  Eq.  (22.38),  when  A  =  0.) 

(b)  Show  that  when  A  #  0,  but  A  «  Kc,  then  the  dispersion  relation  differs  from  that  of  the 
monatomic  chain  only  by  terms  of  order  (A /K0)2,  except  when  |ji  —  ka\  is  of  order  A /K0.  Show 
that  when  this  happens  the  distortion  of  the  dispersion  relation  for  the  monatomic  chain  is  linear 
in  A /K0.3i 

4.  Polarization  of  the  Normal  Modes  of  a  Monatomic  Bravais  Lattice 

(a)  Show  that  if  k  lies  along  a  3-,  4-,  or  6-fold  axis,  then  one  normal  mode  is  polarized  along 
k,  and  the  other  two  are  degenerate  and  polarized  perpendicular  to  k. 

(b)  Show  that  if  k  lies  in  a  plane  of  mirror  symmetry,  then  one  normal  mode  has  a  polariza¬ 
tion  perpendicular  to  k,  and  the  other  two  have  polarization  vectors  lying  in  the  mirror  plane. 

(c)  Show  that  if  the  point  k  lies  in  a  Bragg  plane  that  is  parallel  to  a  plane  of  mirror  symmetry, 
then  one  normal  mode  is  polarized  perpendicular  to  the  Bragg  plane,  while  the  other  two  have 
polarizations  lying  in  the  plane.  (Note  that  in  this  case  the  modes  cannot  be  strictly  longitudinal  and 
transverse  unless  k  is  perpendicular  to  the  Bragg  plane.) 

To  answer  these  questions,  one  must  note  that  any  operation  that  leaves  both  k  and  the 
crystal  invariant  must  transform  one  normal  mode  with  wave  vector  k  into  another.  In  particular, 
the  set  of  three  (orthogonal)  polarization  vectors  must  be  invariant  under  such  operations.  In 
applying  this  fact  one  must  remember  that  if  two  normal  modes  are  degenerate,  then  any  vector 
in  the  plane  spanned  by  their  polarization  vectors  is  also  a  possible  polarization  vector. 

5.  Normal  Modes  of  a  Three-Dimensional  Crystal 

Consider  a  face-centered  cubic  monatomic  Bravais  lattice  in  which  each  ion  interacts  only  with 
its  (twelve)  nearest  neighbors.  Assume  that  the  interaction  between  a  pair  of  neighboring  ions  is 
described  by  a  pair  potential  (p  that  depends  only  on  the  distance  r  between  the  pair  of  ions. 

(a)  Show  that  the  frequencies  of  the  three  normal  modes  with  wave  vector  k  are  given  by 

w  =  ,/I/M  (22.96) 

where  the  X  are  the  eigenvalues  of  the  3  x  3  matrix : 

D  =  X  sin2  (jk  •  R)[y41  +  ERR].  (22.97) 

R 

Here  the  sum  is  over  the  twelve  nearest  neighbors  of  R  =  0: 

~(±x±y),  |(±y±z),  ^ (±z  ±  x);  (22.98) 

1  is  the  unit  matrix  ((1)^  =  ^v),  and  RR  is  the  diadic  formed  from  the  unit  vectors  R  =  R JR 
(i.e.,  (RR)MV  —  RjjRy)-  The  constants  A  and  B  are :  A  —  2 <p\d)jd,  B  =  2[</>"(d)  —  <j>'[d)/d\  where 


34  Note  the  analogy  to  the  nearly  free  electron  model  of  Chapter  9:  The  free  electron  gas  corresponds 
to  the  monatomic  linear  chain;  the  weak  periodic  potential  corresponds  to  the  small  change  in  coupling 
between  alternate  nearest-neighbor  pairs. 
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d  is  the  equilibrium  nearest-neighbor  distance.  (This  follows  from  Equations  (22.59)  and  (22.11).) 

(b)  Show  that  when  k  is  in  the  (100)  direction  (k  =  (k,  0,  0)  in  rectangular  coordinates), 
then  one  normal  mode  is  strictly  longitudinal,  with  frequency 


Wl 


8  A  +  4  B 
~M 


sin  | ka , 


and  the  other  two  are  strictly  transverse  and  degenerate,  with  frequency 

coT  = 


8A  +  2  B 


M 


sin  ika. 


(22.99) 


(22.100) 


(c)  What  are  the  frequencies  and  polarizations  of  the  normal  modes  when  k  is  along  a  [1 1 1] 
direction  (k  =  (k,  k,  k)/^/ 3)? 

(d)  Show  that  when  k  is  along  the  [110]  direction  (k  =  (k,  k,  0)/y/ 2),  then  one  mode  is 
strictly  longitudinal,  with  frequency 


wL 


SA  +  2  B 


■G&) 


+ 


2A  +  2  B 


M  V4./27  '  M 

one  is  strictly  transverse  and  polarized  along  the  z-axis  (e  =  (0,  0,  1)),  with  frequency 


(22.101) 


C’jr 


.«»=  /®d±^sl  n 


M 


.  ka\  2A  .  ,/l  ka\ 

“  (4  7?)  ;+  m-  ““  (2  72J'  (22-,02> 


and  the  third  is  strictly  transverse  and  perpendicular  to  the  z-axis,  with  frequency 
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(e)  Sketch  the  dispersion  curves  along  the  lines  TX  and  TKX,  (Figure  22.13),  assuming  that 
A  =  0.  (Note:  The  length  of  TX  is  2n/a.) 


23 

Quantum  Theory  of 
the  Harmonic 
Crystal 


Normal  Modes  and  Phonons 

High-Temperature  Specific  Heat 

Low-Temperature  Specific  Heat 

Models  of  Debye  and  Einstein 

Comparison  of  Lattice  and 
Electronic  Specific  Heats 

Density  of  Normal  Modes  (Phonon  Level  Density) 
Analogy  with  Theory  of  Blackbody  Radiation 


452  Chapter  23  Quantum  Theory  of  the  Harmonic  Crystal 


In  Chapter  22  we  found  that  the  contribution  of  the  lattice  vibrations  to  the  specific 
heat  of  a  classical  harmonic  crystal  was  independent  of  the  temperature  (the  law  of 
Dulong  and  Petit).  However,  as  the  temperature  drops  below  room  temperature,  the 
specific  heat  of  all  solids  starts  to  decline  below  the  classical  value,  and  eventually  is 
observed  to  vanish  as  T3  (in  insulators)  or  AT  +  BT3  (in  metals).  The  explanation 
of  this  behavior  was  one  of  the  earliest  triumphs  of  the  quantum  theory  of  solids. 

In  a  quantum  theory  of  the  specific  heat  of  a  harmonic  crystal,  the  classical  ex¬ 
pression  (22.12)  for  the  thermal  energy  density  u  must  be  replaced  by  the  general 
quantum-mechanical  result 

u  =  yl  E*-'*  e-^,  p  =  1  /kBT,  (23.1) 

where  Et  is  the  energy  of  the  ith  stationary  state  of  the  crystal,  and  the  sum  is  over  all 
stationary  states. 

The  energies  of  these  stationary  states  are  given  by  the  eigenvalues  of  the  harmonic 
Hamiltonian:1 


Wharm  =  1 4? P(R)2  +  MRP,v(R  -  R'K(R').  (23.2) 

The  detailed  procedure  by  which  these  eigenvalues  are  extracted  is  summarized  in 
Appendix  L.  The  result  of  this  calculation  is  so  simple  and  intuitively  plausible  that 
we  simply  state  it  here,  unencumbered  by  its  straightforward,  but  rather  lengthy, 
derivation. 

To  specify  the  energy  levels  of  an  N- ion  harmonic  crystal,  one  regards  it  as  3 A 
independent  oscillators,  whose  frequencies  are  those  of  the  3N  classical  normal  modes 
described  in  Chapter  22.  The  contribution  to  the  total  energy  of  a  particular  normal 
mode  with  angular  frequency  <us(k)  can  have  only  the  discrete  set  of  values 

(«b  +  (23.3) 

where  nkl,  the  excitation  number  of  the  normal  mode,  is  restricted  to  the  values  0,  1, 
2, ....  A  state  of  the  entire  crystal  is  specified  by  giving  the  excitation  numbers  for 
each  of  the  3N  normal  modes.  The  total  energy  is  just  the  sum  of  the  energies  of  the 
individual  normal  modes: 

£  =  IK,+M).  (23.4) 

ks 

The  thermal  energy  (23.1)  can  be  evaluated  from  (23.4)  straightforwardly.  Before 
doing  so,  however,  we  digress  to  describe  the  language  in  which  the  excited  states  of 
the  harmonic  crystal  are  customarily  discussed. 


1  See  Eqs.  (22.8)  and  (22.10).  We  display  only  the  form  appropriate  to  a  monatomic  Bravais  lattice, 
but  the  discussion  that  follows  is  quite  general.  We  have  added  the  kinetic  energy,  which  no  longer 
disappears  from  the  problem  at  an  early  stage,  (as  it  does  in  classical  statistical  mechanics).  W'e  drop,  for 
the  moment,  the  additive  constant  t/eq_  This  has  the  effect  of  subtracting  Ueq/V  from  the  energy  density 
(23.1).  Since  Ueq  does  not  depend  on  temperature,  this  has  no  effect  on  the  specific  heat.  Should  we  require, 
however,  the  volume  dependence  of  the  internal  energy,  it  would  be  necessary  to  retain  Ueq. 
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NORMAL  MODES  vs.  PHONONS 

We  have  described  the  result  (23.4)  in  terms  of  the  excitation  number  nks  of  the  normal 
mode  with  wave  vector  k  in  branch  s.  This  nomenclature  can  be  quite  clumsy,  es¬ 
pecially  when  describing  processes  in  which  energy  is  exchanged  among  the  normal 
modes,  or  between  the  normal  modes  and  other  systems,  such  as  electrons,  incident 
neutrons,  or  incident  X  rays.  Usually  the  language  of  normal  modes  is  replaced  by 
an  equivalent  corpuscular  description,  which  is  analogous  to  the  terminology  used 
in  the  quantum  theory  of  the  electromagnetic  field.  In  that  theory  the  allowed  ener¬ 
gies  of  a  normal  mode  of  the  radiation  field  in  a  cavity  are  given  by  (n  +  i)ha>, 
where  w  is  the  angular  frequency  of  the  mode.  It  is  the  universal  practice,  however, 
to  speak  not  of  the  quantum  number  of  excitation  of  the  mode,  n,  but  of  the  number, 
n,  of  photons  of  that  type  that  are  present.  In  precisely  the  same  way,  instead  of  saying 
that  the  normal  mode  of  branch  s  with  wave  vector  k  is  in  its  nksth  excited  state,  one 
says  that  there  are  nkJ  phonons  of  type  s  with  wave  vector  k  present  in  the  crystal. 

The  term  “phonon”  stresses  this  analogy  with  photons.  The  latter  are  the  quanta 
of  the  radiation  field  that  (in  the  appropriate  frequency  range)  describes  classical 
light;  the  former  are  the  quanta  of  the  ionic  displacement  field  that  (in  the  appropriate 
frequency  range)  describes  classical  sound.  Although  the  language  of  phonons  is 
more  convenient  than  that  of  normal  modes,  the  two  nomenclatures  are  completely 
equivalent. 
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To  calculate  the  contribution  of  the  lattice  vibrations  to  the  internal  energy,  we 
substitute  the  explicit  form  (23.4)  of  the  energy  levels  into  the  general  formula  (23.1). 
To  expedite  the  calculation  we  introduce  the  quantity 


The  identity 


(23.5) 


(23.6) 


can  be  verified  by  explicit  differentiation  of  (23.5).  To  evaluate  /,  note  that  e  pE  occurs 
exactly  once  for  every  energy  E  of  the  form  (23.4)  in  the  expansion  of  the  product 


j“[  +  e-3/!>:,.>stk)/2  _j_  <r5(ifes(k)/2  _|_  .  .  .  | 

k.v 


(23.7) 


The  individual  terms  in  this  product  are  just  convergent  geometric  series,  that  can 
be  explicitly  summed,  to  give 


/  = 


j  g-rfoujM/z 

V  ln  0  !  -  e-^  ' 


(23.8) 
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Differentiating  /,  as  required  by  (23.6),  we  find  that  the  internal  energy  density  is  given 
by 

+  i],  (23.9) 

'  ki 

where 

ns(k)  =  e^osik)  _  |  •  (23.10) 


Comparing  Eq.  (23.9),  for  the  mean  thermal  energy  density  of  the  crystal  at  tem¬ 
perature  T,  with  Eq.  (23.4),  for  the  energy  in  a  particular  stationary  state,  leads  one 
to  conclude  that  ns(k)  is  simply  the  mean  excitation  number  of  the  normal  mode  k,v 
at  temperature  T.  In  the  phonon  language,  ns(k)  is  the  mean  number  of  phonons  of 
type  ks  present  in  thermal  equilibrium2  at  temperature  T. 

Thus  the  simple  classical  expression  for  the  energy  density  of  a  harmonic  crystal 
at  temperature  T,  Eq.  (22.18),  must  be  generalized  to3 

«  =  “cq  +  i  I  (23.11) 

'  ks  '  ki  e  A 


As  T  -»  0  the  third  term  vanishes,  but,  in  contrast  with  the  classical  result  (22.18), 
there  remains  not  only  the  energy  tf1  of  the  equilibrium  configuration,  but  also  a 
second  term,  giving  the  energy  of  the  zero-point  vibrations  of  the  normal  modes.  All 
the  temperature  dependence  of  u  (and  hence  the  entire  contribution  to  the  specific 
heat)  comes  from  the  third  term,  whose  variation  with  temperature  is  far  more  complex 
than  the  simple  linear  form  of  the  classical  result.  In  the  quantum  theory  of  the 
harmonic  solid  the  specific  heat  is  no  longer  constant,  but  is  given  by 


1  y  d  ho)s(k) 
Cv~v\;df  ephw*k)  -  1’ 


(23.12) 


which  now  depends  in  a  detailed  way  on  the  frequency  spectrum  of  the  normal  modes. 

Certain  general  features  of  the  specific  heat  (23.12)  emerge  in  limiting  cases,  which 
we  now  examine. 


HIGH-TEMPERATURE  SPECIFIC  HEAT 

When  kBT/h  is  large  compared  with  all  the  phonon  frequencies  (i.e.,  when  every 
normal  mode  is  in  a  highly  excited  state),  then  the  argument  of  the  exponential  will 
be  small  in  each  term  of  (23.12)  and  we  can  expand: 


2  Those  familiar  with  the  ideal  Bose  gas  will  recognize  Eq.  (23.10)  as  a  special  case  of  the  Bose-Einstein 
distribution  function,  giving  the  number  of  bosons  with  energy  fims(k)  in  thermal  equilibrium  at  temperature 
T ‘  when  the  chemical  potential  /i  is  taken  to  be  zero.  The  lack  of  freedom  in  selecting  n  comes  from  the 
fact  that  the  total  number  of  bosons  in  thermal  equilibrium  is  not  an  independent  variable  at  our  disposal 
in  the  case  of  phonons  (as  it  is,  for  example,  in  the  case  of  4He  atoms)  but  is  entirely  determined  by  the 
temperature. 

3  For  comparison  with  Eq.  (22.18)  we  reintroduce  the  constant  giving  the  potential  energy  of  the 

static  equilibrium  distribution. 
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x  +  ix2  4-  6*3  + 


'[■ 


1  -  £  +  ~  +  0(x3 

x  |  2  12 


If  we  keep  only  the  leading  term  in  this  expansion,  then  the  summand  in  (23.12) 
reduces  to  the  constant  kBT,  and  the  specific  heat  reduces  to  kB  times  the  density  of 
normal  modes,  3 N/V.  This  is  just  the  classical  law  of  Dulong  and  Petit  (22.19). 

Additional  terms  in  the  expansion  (23.13)  yield  the  high-temperature  quantum 
corrections  to  the  Dulong  and  Petit  law.  The  linear  term  in  x  (in  the  square  brackets) 
gives  a  temperature-independent  term  in  the  thermal  energy  (which  is  precisely  equal 
to  minus  the  zero-point  energy)  and  therefore  does  not  affect  the  specific  heat.  The 
leading  correction  is  thus  given  by  the  term  in  square  brackets  quadratic  in  x.  When 
this  is  substituted  into  (23.12)  it  gives  a  correction  to  the  Dulong  and  Petit  specific 
heat  c°  of  the  form: 


c„  =  c°  +  Ac,., 


h2  1 
12(kBT)2  3 N 


I 


m,(k)2. 


(23.14) 


At  temperatures  high  enough  for  this  expansion  to  be  valid,  the  anharmomc 
corrections  to  the  classical  specific  heat,  which  are  not  included  in  the  Dulong  and 
Petit  value,4  are  likely  to  be  of  significance,  and  will  tend  to  mask  the  quantum 
correction  (23.14).5 


LOW-TEMPERATURE  SPECIFIC  HEAT 

To  discuss  the  specific  heat  more  generally,  we  first  observe  that  in  the  limit  of  a 
large  crystal  the  set  of  discrete  wave  vectors  summed  over  in  Eq.  (23.12)  becomes 
dense  on  the  scale  over  which  the  summand  has  appreciable  variation.  We  can  there¬ 
fore  replace  the  sum  by  an  integral  according  to  the  general  prescription  (2.29)  for  any 
set  of  wave  vectors  satisfying  the  Born-von  Karman  boundary  conditions,  rewriting 
(23.12)  as 

3  _  C  t/k  fttUs(k)  (23.15) 

c»  ~~  qj-  2-<  J  (27r)3  —  1  ’ 

where  the  integral  may  be  taken  over  the  first  Brillouin  zone. 

At  very  low  temperatures,  modes  with  fiws( k)  »  kBT  will  contribute  negligibly  to 
(23.15),  since  the  integrand  will  vanish  exponentially.  However,  because  ms(k)  -*  0  as 
k  -*  0  in  the  three  acoustic  branches,  this  condition  will  fail  to  be  satisfied  by  acoustic 
modes  of  sufficiently  long-wavelength,  no  matter  how  low  the  temperature.  These 
modes  (and  only  these)  will  continue  to  contribute  appreciably  to  the  specific  heat. 
Bearing  this  in  mind,  we  can  make  the  following  simplifications  in(23.15),  all  ofwhich 
result  in  a  vanishingly  small  fractional  error,  in  the  zero-temperature  limit: 


4  See  point  2,  page  428,  and  the  discussion  following  it. 

5  Indeed,  at  such  high  temperatures  real  crystals  are  likely  to 
of  anharmonic  behavior. 


have  melted — a  rather  extreme  form 
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1.  Even  if  the  crystal  has  a  polyatomic  basis,  we  may  ignore  the  optical  modes  in 
the  sum  over  s,  since  their  frequencies  are  bounded  below.6 

2.  We  may  replace  the  dispersion  relation  w  =  cos(k)  for  the  three  acoustic  branches 
by  its  long-wavelength  form  (22.65)  (o  =  cji)k.  This  will  be  valid  provided  that 
kET/h  is  substantially  less  than  those  frequencies  at  which  the  acoustic  dispersion 
curves  begin  to  differ  appreciably  from  their  long-wavelength  linear  forms. 

3.  We  may  replace  the  Ic-space  integration  over  the  first  Brillouin  zone  by  an  integra¬ 
tion  over  all  of  /c-space.  This  is  because  the  integrand  is  negligibly  small  unless 
hcs(i.)k  is  of  order  kBT,  which  happens  only  in  the  immediate  vicinity  of  k  =  0 
at  low  temperatures. 

These  three  simplifications  are  illustrated  in  Figure  23.1. 


Figure  23.1 

The  simplifications  that  can  be  made  in  evaluating  the  low-temperature  specific  heat  of  a  harmonic 
crystal,  (a)  Typical  normal-mode  dispersion  relations  for  a  diatomic  crystal  along  a  particular 
direction  in  /c-space  (taken  to  be  a  direction  of  high  enough  symmetry  for  two  of  the  acoustic  and 
two  of  the  optical  branches  to  be  degenerate),  (b)  The  spectrum  that  replaces  (a)  in  evaluating  the 
integral  (23.15).  The  acoustic  branches  are  replaced  by  linear  branches,  extending  over  all  k  (i.e., 
the  integral  is  extended  from  the  first  zone  to  all  of  k- space)  and  the  optical  branches  are  ignored. 
This  is  justified  because  frequencies  large  compared  with  kBT/h  (those  parts  of  the  dispersion 
curves  in  (a)  and  (b)  above  the  horizontal  dashed  line)  make  negligible  contributions  to  (23.15), 
and  because  the  parts  of  the  dispersion  curves  describing  modes  that  do  contribute  (the  parts 
below  the  horizontal  dashed  line)  are  identical  in  (a)  and  in  (b). 


Thus  at  very  low  temperatures  (23.15)  may  be  simplified  to 

5  f  dk  hc&)k 
Cv  ~  8T  \  J  (2t:)3  eh^Wk“T  -  1’ 


(23.16) 


where  the  integral  is  over  all  of  k- space.  We  evaluate  the  integral  in  spherical  coor- 


e  Under  certain  special  conditions  (usually  associated  with  an  imminent  change  in  crystal  structure) 
an  optical  branch  may  dip  down  almost  to  zero  frequency  (acquiring  what  is  known  as  a  “soft  mode”). 
When  this  happens,  there  will  be  an  additional  contribution  to  the  low-temperature  specific  heat  from 
that  optical  branch. 
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dinates,  writing  dk  =  k2dk  dQ.  If  we  make  the  change  of  variables  phcs(£)k  =  x  in 
the  k  integration,  then  (23.16)  becomes 

_  d  (k„Tf  3  f “  x3dx  (23.17) 

C"  ST  (he)3  2n2  Jo  e*-l’ 

where  1/c3  is  the  average  of  the  inverse  third  power  of  the  long-wavelength  phase 
velocities  of  the  three  acoustic  modes: 


4=4i 

C  3  s  j 


dQ  1 


4*  cs(ft)3' 

The  definite  integral  in  (23.17)  can  be  evaluated  by  writing7 
x3  dx 


f“  x3  dx  _  “ 

J0  ex  -  1  ~  k  . 


“  1  K 

dx  =  6  I  = 

n=  1  n 


4 

15' 


Therefore,  at  very  low  temperatures8 

S  n2  (kBT)*  _  In2  h  (KTSf 
Cv  ~  8T  10  (he)3  5  B\hc  )  ' 


(23.18) 


(23.19) 


(23.20) 


This  relation  can  be  checked  by  comparing  the  measured  low-temperature  specific 
heat  with  the  measured  elastic  constants,  which  are  directly  related  to  the  phase 
velocities  appearing  in  the  definition  (23.18)  of  c.  In  the  alkali  halides,  for  example, 
the  disagreement  has  been  found  to  be  less  than  the  experimental  errors  in  the 
measurements  (typically  about  a  percent).9 

Since  (23.20)  remains  valid  only  as  long  as  kBT/h  is  small  compared  with  all  phonon 
frequencies  that  are  not  in  the  linear  part  of  the  spectrum,  we  might  expect  that  this 
would  require  kBT/h  to  be  a  small  fraction  of  the  frequencies  near  the  zone  edge.  This 
requires  Tto  be  well  below  room  temperature.  Since  the  Dulong  and  Petit  law  starts 
to  fail  as  the' temperature  drops  below  room  temperature,  there  is  a  considerable 
temperature’ range  over  which  neither  the  high-  nor  low-temperature  evaluations  of 
the  specific  heat  are  valid,  and  one  must  work  with  the  general  form  (23.15).  However, 
it  is  quite  common  to  use  an  approximate  interpolation  scheme  for  this  intermediate 
temperature  range. 


INTERMEDIATE-TEMPERATURE  SPECIFIC  HEAT: 

THE  MODELS  OF  DEBYE  AND  EINSTEIN 

The  earliest  quantum  theories  of  lattice  specific  heats,  due  to  Einstein  and  Debye,  did 
not  use  phonon  spectra  of  the  general  form  we  have  been  considering,  but  assumed 


7  See  also  Appendix  C,  Eqs.  (C.  11)  to  (C.  13).  . 

e  We  reemphasize  that  this  result  becomes  asymptotically  exact  (within  the  harmonic  approximation) 

as  T  -»  0;  i.e.,  it  can  be  written  as  an  equality: 


lim  -4  = 
T->  0  T 


2n2  kB* 
5  h3c3 


9  J.  T.  Lewis  et  al.,  Phys.  Rev.  161,  877  (1967). 


458  Chapter  23  Quantum  Theory  of  the  Harmonic  Crystal 


normal-mode  dispersion  relations  of  an  especially  simple  structure.  Their  results, 
based  on  crude  approximations  to  the  normal-mode  dispersion  relations,  are  still  of 
use  as  interpolation  formulas.  Debye’s  theory  has  had  a  considerable  impact  on  the 
nomenclature  of  the  subject,  and  even  on  the  way  in  which  data  are  presented. 


The  Debye  Interpolation  Scheme 

The  Debye  model  replaces  all  branches  of  the  vibrational  spectrum  with  three 
branches,  each  with  the  same  linear  dispersion  relation:10 

w  =  ck.  (23.21) 

In  addition,  the  integral  in  (23.15)  over  the  first  Brillouin  zone  is  replaced  by  an 
integral  over  a  sphere  of  radius  kD,  chosen  to  contain  precisely  N  allowed  wave 
vectors,  where  N  is  the  number  of  ions  in  the  crystal.  Since  the  volume  of  fc-space 
per  wave  vector  is  (2n)3/V  (see  page  37),  this  requires  (2n)3N/V  to  equal  AnkD3/3, 
so  that  kD  is  determined  by  the  relation11 


As  a  result  of  these  simplifications  Eq.  (23.15)  reduces  to 

e  3 he  f**>  k3dk 

Cv~df2n2]0  efhck  —  1 


(23.22) 


(23.23) 


In  evaluating  the  integral  (23.23)  it  is  convenient  to  define  a  Debye  frequency  by 


Wd  —  ^DC 


and  a  Debye  temperature  by 


&£©£  —  —  hckD. 


(23.24) 


(23.25) 


Evidently  kD  is  a  measure  of  the  inverse  interparticle  spacing,  wD  is  a  measure  of  the 
maximum  phonon  frequency,  and  ©D  is  a  measure  of  the  temperature  above  which 
all  modes  begin  to  be  excited,  and  below  which  modes  begin  to  be  “frozen  out”12 


10  In  the  case  of  a  lattice  with  a  polyatomic  basis,  the  representation  of  the  3 p  branches  of  the  phonon 
spectrum  by  only  three  is  compensated  by  the  volume  of  the  Debye  sphere  being  p  times  the  volume  of 
the  first  Brillouin  zone.  This  point  is  elaborated  in  the  discussion  of  the  Einstein  model. 

11  When,  in  applications  to  metals,  there  is  a  danger  of  confusing  the  density  of  ions  w  ith  the  density 
of  conduction  electrons,  we  shall  denote  the  former  by  «,  and  the  latter  by  ne.  The  two  are  related  by 
ne  —  Zn{ ,  where  Z  is  the  nominal  valence.  Since  the  free  electron  Fermi  wave  vector  kF  satisfies  kF3/3n2  — 
ne>  kD  is  related  to  kF  in  a  metal  by  kD  =  (2 fZ)l/3kF. 

12  One  can  also  regard  ©c  and  coD  as  measures  of  the  “stiffness”  of  the  crystal. 
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If  we  make  the  change  of  variables  hck/kBT  =  x,  then  (23.23)  can  be  written  in 
terms  of  the  Debye  temperature: 


cv 


9  nkB 


■  j  \  3  reD!T  x4g*  dx 

&>)  Jo  (ex  -  ir 


(23.26) 


This  formula  expresses  the  specific  heat  at  all  temperatures  in  terms  of  a  single 
empirical  parameter,  0D.  One  reasonable  way  to  pick  ©c  (though  it  is  by  no  means 
the  only  way  used)  is  by  making  (23.26)  agree  with  the  observed  specific  heat  at  low 
temperatures.  This  will  be  assured  (at  least  in  the  harmonic  approximation)  if  the 
speed  c  in  (23.21)  or  (23.25)  is  related  to  the  exact  phonon  spectrum  through  (23.18). 
The  resulting  form  for  the  low-temperature  specific  heat  is13 


(23.27) 


Values  of  0D  for  some  of  the  alkali  halides,-  determined  by  fitting  the  T3  term  m 
their  low-temperature  specific  heats,  are  quoted  in  Table  23.1. 


Table  23.1 

DEBYE  TEMPERATURES  FOR  THE  ALKALI  HALIDE 
CRYSTALS" 


F 

Cl 

Br 

I 

Li 

730 

422 

— 

— 

Na 

492 

321 

224 

164 

K 

336 

231 

173 

131 

Rb 

— 

165 

131 

103 

”  Given  in  degrees  kelvin.  All  values  were  obtained  by  comparing  the 
constant  in  the  T3  fit  to  the  low-temperature  specific  heat  to  Eq 
(23.27),  except  for  NaF,  KF,  and  NaBr,  where  ©0  was  deduced  from 
the  measured  elastic  constants  through  (23.18),  and  (23.25).  (In  cases 
where  values  were  obtained  by  both  methods,  they  agree  to  within  a 
percent  or  two,  which  is  about  the  size  of  the  experimental  uncertainty 
in  the  numbers.) 

Source:  J.  T.  Lewis  et  al.,  Phys.  Rev.  161,  877  (1967). 


Unfortunately  0D  is  not  always  chosen  by  this  convention.  Partly  because  the 
Debve  result  (23.26)  was  viewed  by  some  as  far  more  general  than  a  rough  inter¬ 
polation  formula,  the  practice  arose  of  fitting  observed  heat  capacities  with  (23.26) 
by  allowing  0D  to  depend  on  temperature.  There  is  no  good  reason  for  doing  this. 


■3  This  can  be  derived  directly  from  (23.26)  by  noting  that  for  T  <  ©„,  the  upper  limit  of  the  integral 
can  be  extended  to  infinity  with  exponentially  small  error.  It  is  also  equivalent  to  the  exact  result  (23.20), 
provided  c  is  eliminated  in  favor  of  0„  and  the  ionic  density  via  (23.22)  and  (23.25). 
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Figure  23.2 

Debye  temperature  as  a  func¬ 
tion  of  temperature  for  argon 
and  krypton  This  is  a  widely 
used  way  of  presenting  spe¬ 
cific  heat  data.  (L.  Finegold 
and  N.  Phillips,  Phys.  Rev. 
177, 1383  (1969).) 


0  0.05  0.10  0.15  0.20 

r/0o(  o) 


but  the  practice  has  persisted  down  to  the  present  day — to  the  extent  that  results 
of  heat-capacity  measurements  are  sometimes  reported  in  terms  of  @D(T)  rather  than 
the  data  themselves.14  It  is  helpful,  in  converting  such  information  back  to  specific 
heats,  to  have  a  graph  of  the  Debye  c„  as  a  function  of  ©C/T.  This  is  shown  in  Figure 
23.3,  and  some  numerical  values  of  the  function  are  given  in  Table  23.2.  In  Table  23.3 
we  give  some  Debye  temperatures  for  selected  elements  that  were  determined  by 
fitting  the  observed  heat  capacity  with  the  Debye  formula  (23.26)  at  the  point  where 
the  heat  capacity  is  about  half  the  Dulong  and  Petit  value. 


TI&d 


Figure  23.3 

Specific  heat  in  the  Debye  approximation  (in 
cal/mole-K)  vs.  T/0C.  (From  J.  de  Launay, 
op.  cit.;  see  Table  23.2.) 


Note  that  at  temperatures  well  above  ©D  we  can  replace  the  integrand  in  (23.26) 
by  its  form  for  small  x,  and  the  Dulong  and  Petit  result  emerges.  (This  is  to  be  expected, 
since  it  was  built  into  the  formula  through  the  definition  of  kD.)  Thus  the  Debye 
temperature  plays  the  same  role  in  the  theory  of  lattice  vibrations  as  the  Fermi 
temperature  plays  in  the  theory  of  electrons  in  metals:  Both  are  a  measure  of  the 
temperature  separating  the  low-temperature  region  where  quantum  statistics  must 
be  used  from  the  high-temperature  region  where  classical  statistical  mechanics  is 
valid.  However,  in  the  electronic  case  actual  temperatures  are  always  well  below  Tr, 
whereas  ©D  (see  Table  23.3)  is  typically  of  the  order  of  102  K,  so  both  classical  and 
quantum  regimes  can  be  encountered. 


14 


See,  for  example.  Figure  23.2. 
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Table  23.2 

TEMPERATURE  DEPENDENCE  OF  THE  DEBYE  SPECIFIC  HEAT” 


T/@D 

cv/3nkB 

T/0O 

cJ3nkB 

T/Qd 

cJ3nk 

0.00 

0 

0.35 

0.687 

0.70 

0.905 

0.05 

0.00974 

0.40 

0.746 

0.75 

0.917 

0.10 

0.0758 

0.45 

0.791 

0.80 

0.926 

0.15 

0.213 

0.50 

0.825 

0.85 

0.934 

0.20 

0.369 

0.55 

0.852 

0.90 

0.941 

0.25 

0.503 

0.60 

0.874 

0.95 

0.947 

0.30 

0.608 

0.65 

0.891 

1.00 

0.952 

0  The  table  entries  are  the  ratios  of  the  Debye  to  the  Dulong-Petit  specific  heats, 
that  is,  cJJnkg,  with  c„  given  by  (23.26). 

Source :  J.  de  Launay,  Solid  State  Physics ,  vol.  2,  F.  Seitz  and  D.  Turnbull,  eds..  Academic 
Press,  New  York,  1956. 


Table  23.3 

DEBYE  TEMPERATURES  FOR  SELECTED  ELEMENTS0 


ELEMENT 

©d(K) 

ELEMENT 

©D  (K) 

Li 

400 

A 

85 

Na 

150 

Ne 

63 

K 

100 

Cu 

315 

Be 

1000 

Ag 

215 

Mg 

318 

Au 

170 

Ca 

230 

Zn 

234 

B 

1250 

Cd 

120 

A! 

394 

Hg 

100 

Ga 

240 

In 

129 

Cr 

460 

T1 

96 

Mo 

380 

W 

310 

C  (diamond) 

1860 

Mn 

400 

Si 

625 

Fe 

420 

Ge 

360 

Co 

385 

Sn  (grey) 

260 

Ni 

375 

Sn  (white) 

170 

Pd 

275 

Pb 

88 

Pt 

230 

As 

285 

La 

132 

Sb 

200 

Gd 

152 

Bi 

120 

Pr 

74 

0  The  temperatures  were  determined  by  fitting  the  observed  specific 
heats  c„  to  the  Debye  formula  (23.26)  at  the  point  where  cv  =  3nkB/Z 
Source:  J.  de  Launay,  Solid  State  Physics,  vol.  2,  F.  Seitz  and  D.  Turnbull, 
eds.,  Academic  Press,  New  York,  1956. 
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The  Einstein  Model 

In  the  Debye  model  of  a  crystal  with  a  polyatomic  basis,  the  optical  branches  of  the 
spectrum  are  represented  by  the  high  k  values  of  the  same  linear  expression  (23.21) 
whose  low  k  values  give  the  acoustic  branch  (Figure  23.4a).  An  alternative  scheme 
is  to  apply  the  Debye  model  only  to  the  three  acoustic  branches  of  the  spectrum. 
The  optical  branches  are  represented  by  the  “Einstein  approximation,”  which  replaces 
the  frequency  of  each  optical  branch  by  a  frequency  coE  that  does  not  depend  on  k 
(see  Figure  23.4b).  The  density  n  in  (23.22),  (23.26),  and  (23.27)  must  then  be  taken 
as  the  number  of  primitive  cells  per  unit  volume  of  crystal,  and  (23.26)  will  give  only 


Figure  23.4 

Two  different  ways  of  approximating  the  acoustic  and  optical  branches  of  a  diatomic  crystal 
(illustrated  in  two  dimensions  along  a  line  of  symmetry),  (a)  The  Debye  approximation.  The  first 
two  zones  of  the  square  lattice  are  replaced  by  a  circle  with  the  same  total  area,  and  the  entire 
spectrum  is  replaced  by  a  linear  one  within  the  circle,  (b)  Debye  approximation  for  the  acoustic 
branch  and  Einstein  approximation  for  the  optical  branch.  The  first  zone  is  replaced  by  a  circle  with 
the  same  area,  the  acoustic  branch  is  replaced  by  a  linear  branch  within  the  circle,  and  the  optical 
branch  is  replaced  by  a  constant  branch  within  the  circle. 
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the  contribution  of  the  acoustic  branches  to  the  specific  heat.15 
will  contribute 

nhcoE 

ehmE!kBT  _  1 


Each  optical  branch 

(23.28) 


to  the  thermal  energy  density  in  the  Einstein  approximation,  so  if  there  are  p  such 
branches  there  will  be  an  additional  term 


..optical 


,  (h(DE/kBT)2e^BT 

-  pnkB  ^MElkBT  _  JJ2 


(23.29) 


in  the  specific  heat.16 

The  characteristic  features  of  the  Einstein  term  (23.29)  are  that  (a)  well  above  the 
Einstein  temperature  ©E  =  tw:E/kB  each  optical  mode  simply  contributes  the  constant 
kB/V  to  the  specific  heat,  as  required  by  the  classical  law  of  Dulong  and  Petit,  and 
(b)  at  temperatures  well  below  the  Einstein  temperature  the  contribution  of  the  optical 
modes  to  the  specific  heat  drops  exponentially,  reflecting  the  difficulty  in  thermally 
exciting  any  optical  modes  at  low  temperatures. 


Figure  23.5 

A  comparison  of  the  Debye  and  Einstein  ap¬ 
proximations  to  the  specific  heat  of  an  insulating 
crystal.  0  is  either  the  Debye  or  the  Einstein 
temperature,  depending  on  which  curve  is  being 
examined.  Both  curves  are  normalized  to  ap¬ 
proach  the  Dulong  and  Petit  value  of  5.96 
cal/mole-K  at  high  temperatures.  In  fitting  to  a 
solid  with  an  m-ion  basis,  the  Einstein  curve 
should  be  given  m  —  1  times  the  weight  of  the 
Debye  one.  (From  J.  de  Launay,  op.  tit;  see 
Table  23.2.) 


COMPARISON  OF  LATTICE  AND  ELECTRONIC  SPECIFIC  HEATS 

It  is  useful  to  have  a  measure  of  the  temperature  at  which  the  specific  heat  of  a 
metal  ceases  to  be  dominated  by  the  electronic  contribution  (linear  in  T )  rather  than 
the  contribution  from  the  lattice  vibrations  (cubic  in  T).  If  we  divide  the  electronic 


15  Note  that  in  Eq.  (23.27)  for  the  low-temperature  specific  heat  this  redefinition  of  n  is  precisely 
compensated  by  the  redefinition  of  ®D,  so  that  the  coefficient  of  T3  is  unchanged.  This  reflects  the  fact 
that  the  optical  branches  do  not  contribute  to  the  low-temperature  specific  heat,  the  form  of  which  must 
therefore  be  independent  of  how  they  are  treated. 

16  The  first  application  of  quantum  mechanics  to  the  theory  of  the  specific  heats  of  solids  was  made 
by  Einstein,  who  proposed  a  total  specific  heat  of  the  form  (23.29).  Although  this  form  did  produce  the 
observed  decline  from  the  high-temperature  form  of  Dulong  and  Petit,  it  fell  far  too  rapidly  to  zero  at 
very  low  temperatures  (see  Figure  23.5).  Debye  subsequently  observed  that  because  a  solid  could  support 
elastic  waves  of  very  long-wavelength,  and  hence  very  low  frequency,  the  picture  of  a  solid  as  a  set  of 
identical  oscillators,  on  which  Einstein’s  formula  was  based,  could  not  be  correct.  Nevertheless,  the  Einstein 
model  does  apply  fairly  well  to  the  contribution  of  a  relatively  narrow  optical  branch  to  the  specific  heat, 
and  in  that  role  the  model  is  still  used. 
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contribution  to  the  specific  heat  (Eq.  (2.81))  by  the  low-temperature  form  (23.27)  of 
the  phonon  contribution,  and  note  that  the  electronic  density  is  Z  times  the  density 
of  ions,  where  Z  is  the  nominal  valence,  we  find 


c'J  5  ©D3 

cf  24k2  T2T/ 


(23.30) 


Thus  the  phonon  contribution  begins  to  exceed  the  electronic  contribution  at  a 
temperature  T0,  given  by 

/ZP)  \1/2 

T0  =  0.145  /  — 5  J  ©£»-  (23.31) 


Since  Debye  temperatures  are  of  the  order  of  room  temperature,  while  Fermi  tem¬ 
peratures  are  several  tens  of  thousands  of  degrees  Kelvin,  the  temperature  T0  is 
typically  a  few  percent  of  the  Debye  temperature — i.e.,  a  few  degrees  Kelvin.  This 
explains  why  the  linear  term  in  the  heat  capacity  of  metals  is  only  observed  at  low 
temperatures. 


DENSITY  OF  NORMAL  MODES  (PHONON-LEVEL  DENSITY) 

One  frequently  encounters  lattice  properties  that,  like  the  specific  heat  (23.15),  are 
of  the  form 

■p  I  QffaJM)  =  £  GK(k)).  (23.32) 

It  is  often  convenient  to  reduce  such  quantities  to  frequency  integrals,  by  introducing 
a  density  of  normal  modes  per  unit  volume,17  g(o),  defined  so  that  g(o)  do  is  the 
total  number  of  modes  with  frequencies  in  the  infinitesimal  range  between  o  and 
o  +  do,  divided  by  the  total  volume  of  the  crystal.  In  terms  of  g,  the  sum  or  integral 
in  (23.32)  takes  on  the  form 

j* do  g(o)Q(o).  (23.33) 

By  comparing  (23.33)  with  (23.32)  it  is  clear  that  the  density  of  normal  modes  can 
be  represented  in  the  form 

f  dk 

g(o)  =  L  I  )-  (23.34) 

The  density  of  normal  modes  is  also  called  the  phonon  density  of  levels,  for  if  we 
describe  the  lattice  in  the  phonon  rather  than  the  normal-mode  language,  each  normal 
mode  corresponds  to  a  possible  level  for  a  single  phonon. 

By  following  precisely  the  same  steps  that  led  to  the  representation  (8.63)  for  the 


17  Compare  the  quite  analogous  discussion  of  the  electronic  density  of  levels  on  pages  143-145. 
Generally  gr(co)  is  taken  to  give  the  contributions  of  all  branches  of  the  phonon  spectrum,  but  one  can  also 
define  separate  ffs(co)  for  each  branch. 
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electron  density  of  levels,  one  can  represent  the  phonon  density  of  levels  in  the 
alternative  form 


g(ca)  = 


dS  1 
(2rr)3  jVcDs(k)|’ 


(23.35) 


where  the  integral  is  over  that  surface  in  the  first  zone  on  which  ms(k)  =  co.  Just  as 
in  the  electronic  case,  because  cos(k)  is  periodic  there  will  be  a  structure  of  singularities 
in  g(co),  reflecting  the  fact  that  the  group  velocity  appearing  in  the  denominator  of 
(23.35)  must  vanish  at  some  frequencies.  As  in  the  electronic  case,  the  singularities 
are  known  as  van  Hove  singularities.18  A  typical  density  of  levels  displaying  these 
singularities  is  shown  in  Figure  23.6,  and  a  concrete  illustration  of  how  the  singularities 
arise  in  the  linear  chain  is  given  in  Problem  3. 


Figure  23.6 

Phonon  density  of  levels  in  aluminum, 
as  deduced  from  neutron  scattering 
data  (Chapter  24).  The  highest  curve 
is  the  full  density  of  levels.  Separate 
level  densities  for  the  three  branches 
are  also  shown.  (After  R.  Stedman, 
L.  Almqvist,  and  G.  Nilsson,  Phys. 
Rev.  162, 549  (1967).) 


The  Debye  approximation  and  its  limitations  are  quite  compactly  expressed  in 
terms  of  the  density  of  levels.  If  all  three  branches  of  the  spectrum  have  the  linear 
dispersion  relation  (23.21),  and  if  the  wave  vectors  of  the  normal  modes  are  assumed 
to  lie  within  a  sphere  of  radius  kD  rather  than  the  first  Brillouin  zone,  then  (23.34) 
becomes  simply: 


SoM  = 


=  3 

Jk<lD 


dk 

(2rtp 


<5(co  —  ck)  = 


2n‘ 


'kD 

Jo 


k1  dk  5 (co  —  ck) 


co  <  coD  =  kDc; 


0, 


CO  >  co„. 


(23.36) 

This  simple  parabolic  behavior  is  clearly  a  rather  gross  approximation  to  the  form 
characteristic  of  real  solids  (Figure  23.6).  The  choice  of  kD  does  guarantee  that  the 


The  singularities  were  in  fact  first  noted  in  tile  context  of  the  theory  of  lattice  vibrations. 
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area  under  the  curve  gD(co)  will  be  the  same  as  under  the  correct  curve,  and  if,  in 
addition,  the  velocity  c  is  chosen  according  to  (23.18),  then  the  curves  will  be  in 
agreement  in  the  neighborhood  ofro  =  0.  The  former  property  is  sufficient  to  produce 
the  Dulong  and  Petit  law  at  high  temperatures,  and  the  latter  guarantees  the  correct 
specific  heat  at  low  temperatures.19 

In  a  similar  way,  an  Einstein  model  for  an  optical  branch  amounts  to  the  approx¬ 
imation: 


£/eM 


I 


d  k 

(2rf 


<5(co  —  coE)  =  nS(co  —  a>E), 


(23.37) 


which  can  be  expected  to  give  reasonable  results  provided  that  the  frequency  variation 
of  the  property  Q  being  calculated  is  not  appreciable  over  the  width  of  the  actual 
optical  branch. 


ANALOGY  WITH  THE  THEORY  OF  BLACKBODY  RADIATION 

The  photon-phonon  analogy  we  described  on  page  453  carries  over  to  a  correspon¬ 
dence  between  the  theory  of  electromagnetic  radiation  in  thermal  equilibrium  (so- 
called  blackbody  radiation)  and  the  theory  of  the  vibrational  energy  of  a  solid  that 
we  have  just  discussed.  Both  subjects  were  a  source  of  mystery  in  the  context  of  the 
classical  physics  prevailing  at  the  turn  of  the  century.  The  failure  of  the  Dulong  and 
Petit  law  to  explain  the  very  low  specific  heats  of  solids  at  low  temperatures  was 
mirrored  by  the  failure  of  the  classical  theory  to  predict  an  energy  density  for 
blackbody  radiation  that  did  not  yield  an  infinite  answer  when  summed  over  all 
frequencies — the  Rayleigh-Jeans  catastrophe.  In  both  cases  the  problem  came  from 
the  classical  result  that  all  normal  modes  should  contribute  kBT  to  the  energy.  The 
Dulong  and  Petit  law  was  saved  from  the  self-contradiction  afflicting  the  corre¬ 
sponding  result  for  the  radiation  field  because  the  discrete  nature  of  the  solid  allowed 
for  only  a  finite  number  of  degrees  of  freedom.  The  two  theories  are  compared  in 
Table  23.4. 


Table  23.4 

A  COMPARISON  OF  PHONONS  AND  PHOTONS 


PHONONS 

PHOTONS 

Number  of 

normal  modes 

3  p  modes  for  each  k, 
a  —  a>s(k) 

Two  modes  for  each  k, 

a)  =  ck  (c  sr  3  x  IO10  cm/sec) 

Restriction  on 

wave  vector 

k  confined  to 

first  Brillouin  zone 

k  arbitrary 

Thermal 
energy  density 

f  4k  hcojy) 

f  4k  hek 

f  J  (2rt)3  1 

(integral  over  first 
Brillouin  zone) 

“  J  (2n)3  e»hck  -  1 

(integral  over  all  k) 

io  One  can  somewhat  improve  the  overall  fit  with  a  refinement  of  the  Debye  model  that  uses  three 
different  sound  velocities  for  the  three  branches. 
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Because  of  the  simple  general  form  for  the  photon  dispersion  relation  the  exact 
expression  for  the  thermal  energy  of  blackbody  radiation  is  very  similar  to  the  Debye 
approximation  to  the  thermal  energy  of  a  harmonic  crystal.  The  differences  are. 

1  The  sound  velocity  is  replaced  by  the  velocity  of  light.  ^ 

2  The  formula  for  blackbody  radiation  has  an  extra  factor  of  3,  corresponding  to 
the  fact  that  there  are  only  two  branches  to  the  photon  spectrum  (electromagnetic 
radiation  must  be  transverse:  there  is  no  longitudinal  branch). 

3.  The  upper  limit  of  the  integral  is  not  kD  but  oo,  since  there  is  no  restriction  on 

the  maximum  allowed  photon  wave  vector. 

Point  3  means  that  the  formulas  for  blackbody  radiation  always  assume  the  form 
appropriate  to  the  extreme  low-temperature  limit  in  the  crystalline  case  This  is 
reasonable,  since  no  matter  what  the  temperature  the  vast  majority 
of  the  normal  modes  of  the  radiation  field  will  have  hck  greater  than  kBT.  Together 
with  the  exact  linearity  in  k  of  the  photon  dispersion  relation,  this  means  that  we 
are  always  in  the  region  where  the  heat  capacity  is  rigorously  cubic.  As  a  result, 
can  read  off  the  exact  thermal  energy  density  for  blackbody  radiation  from  Eq.  (23  20) 
for  the  low-temperature  specific  heat  ct.  =  cu/cT  due  to  lattice  vibrations,  by  inter¬ 
preting  c  to  be  the  velocity  of  light  and  multiplying  by  §  (to  remove  the  contnbu  i 
of  the  longitudinal  acoustic  branch).  The  result  is  the  Stefan-Boltzmann  law. 

_  712  (kBTf  (23.38) 

"  ~  15  (kc)3  ' 

Similarly,  the  thermal  energy  density  in  the  frequency  range  from  m  to  m  + dm  is 

/icog(fti)  dm  (23.39) 

-  1  ’ 

The  appropriate  level  density  is  just  two  thirds  of  the  Debye  form  (23.36),  without 
the  cutoff  at  mD.  This  yields 

h  co3  dm 


n2  c3 


1’ 


(23.40) 


which  is  the  Planck  radiation  law. 


PROBLEMS 


1.  High  Temperature  Specific  Heat  of  a  Harmonic  Crystal 

(a)  Show  that  Eq.  (23.14).  for  the  leading  high-temperature  quantum 
of  Dulong  and  Petit,  can  also  be  written  in  the  form: 


corrections  to  the  law 


1  I 


(23.41) 


where  g(co)  is  the  density  of  normal  modes. 
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(b)  Show  that  the  next  term  in  the  high-temperature  expansion  of  cjc°  is 

<23'42) 

(c)  Show  that  if  the  crystal  is  a  monatomic  Bravais  lattice  of  ions  acting  only  through  pair 
potentials  <p( r),  then  (within  the  harmonic  approximation)  the  second  moment  of  the  frequency 
distribution  that  appears  in  (23.41)  is  given  by 

'  [dtoto2g{to)  ^(R)-  (23.43) 

J  M  r^o 


2.  Low  Temperature  Specific  Heat  in  d-Dimensions ,  and  for  Nonlinear  Dispersion 
Laws 

(a)  Show  that  Eq.  (23.36),  for  the  density  of  normal  modes  in  the  Debye  approximation, 
gives  the  exact  (within  the  harmonic  approximation)  leading  low-frequency  behavior  of  g(co), 
provided  that  the  velocity  c  is  taken  to  be  that  given  in  Eq.  (23.18). 

(b)  Show  that  in  a  4-dimensional  harmonic  crystal,  the  low-frequency  density  of  normal 
modes  varies  as  of~l. 

(c)  Deduce  from  this  that  the  low-temperature  specific  heat  of  a  harmonic  crystal  vanishes 
as  T‘  in  d  dimensions. 

(d)  Show  that  if  it  should  happen  that  the  normal  mode  frequencies  did  not  vanish  linearly 
with  k ,  but  as  k\  then  the  low-temperature  specific  heat  would  vanish  as  Td,v,  in  d  dimensions. 


3.  van  Hove  Singularities 

(a)  In  a  linear  harmonic  chain  with  only  nearest-neighbor  interactions,  the  normal-mode 
dispersion  relation  has  the  form  (cf.  Eq.  (22.29))  eo(k)  =  &>0|sin  {ka/2)\,  where  the  constant  w0  is 
the  maximum  frequency  (assumed  when  k  is  on  the  zone  boundary).  Show  that  the  density  of 
normal  modes  in  this  case  is  given  by 


0M  = 


2 

nosjeog2  —  co2 ' 


(23.44) 


The  singularity  at  co  =  co0  is  a  van  Hove  singularity. 

(b)  In  three  dimensions  the  van  Hove  singularities  are  infinities  not  in  the  normal  mode 
density  itself  but  in  its  derivative.  Show  that  the  normal  modes  in  the  neighborhood  of  a  maximum 
of  co(k),  for  example,  lead  to  a  term  in  the  normal-mode  density  that  varies  as  (co0  —  co)1'2. 
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It  is  possible  to  extract  the  detailed  form  of  the  normal-mode  dispersion  relations 
cos(k)  from  experiments  in  which  the  lattice  vibrations  exchange  energy  w'ith  an 
external  probe.  The  most  informative  of  such  probes  is  a  beam  of  neutrons.  One  can 
view  the  energy  lost  (or  gained)  by  a  neutron  while  interacting  with  a  crystal  as  being 
due  to  the  emission  (or  absorption)  of  phonons,  and  by  measuring  the  emergent  angles 
and  energies  of  the  scattered  neutrons  one  can  extract  direct  information  about  the 
phonon  spectrum.  Similar  information  can  be  obtained  when  the  probe  is  electro¬ 
magnetic  radiation,  the  two  most  important  cases  being  X  rays  and  visible  light. 

The  broad,  general  principles  underlying  these  experiments  are  much  the  same, 
whether  the  incident  particles  are  neutrons  or  photons,  but  the  information  one 
extracts  from  electromagnetic  probes  is  generally  more  limited  or  more  difficult  to 
interpret.  On  the  other  hand,  electromagnetic  probes — particularly  X-ray  analyses — 
are  of  crucial  importance  for  those  solids  that  are  not  susceptible  to  analysis  through 
neutron  scattering.  One  example1  is  solid  helium-3,  in  which  neutron  spectroscopy 
is  made  impossible  by  the  enormous  cross  section  for  a  helium-3  nucleus  capturing 
a  neutron. 

Neutrons  and  photons  probe  the  phonon  spectrum  in  different  ways  primarily 
because  of  their  very  different  energy-momentum  relations: 

p 2 

Neutrons:  £„  = 

Af„  =  1838.65m,  =  1.67  x  l(T24gm,  (24.1) 


Photons:  Ey  =  pc, 

c  =  2.99792  x  1  O’ 0  cm/sec.  (24.2) 

Throughout  the  energy  ranges  of  interest  for  the  measurement  of  phonon  dispersion 
relations,  these  two  energy-momentum  relations  are  spectacularly  different  (see  Figure 
24.1).  However,  the  part  of  the  general  analysis  that  does  not  exploit  the  particular 
form  of  the  E  vs.  p  relation  of  the  probe  is  much  the  same  in  either  case.  Therefore, 
although  we  shall  start  with  a  discussion  of  neutron  scattering,  we  shall  be  able  to 
apply  to  photons  those  aspects  of  the  discussion  that  do  not  depend  on  the  particular 
form  (24.1)  of  the  neutron  energy-momentum  relation. 


NEUTRON  SCATTERING  BY  A  CRYSTAL 

Consider  a  neutron,  of  momentum  p  and  energy  E  =  p2/2M„,  that  is  incident  upon 
a  crystal.  Since  the  neutron  interacts  strongly  only  with  the  atomic  nuclei  in  the 
crystal,2  it  will  pass  without  difficulty  into  the  crystal3  and  subsequently  emerge  with 
momentum  p'  and  energy  E'  =  p'2/2M„. 


1  A  more  subtle  example  is  vanadium,  in  which  the  natural  relative  abundances  of  the  vanadium 
isotopes  conspire  numerically  with  the  isotopic  variation  in  neutron  scattering  amplitude,  in  such  a  way 
as  to  cancel  out  almost  completely  the  informative  (so-called  coherent)  part  of  the  scattering.  The  com¬ 
bination  of  scattering  amplitudes  can  be  changed  by  isotope  enrichment. 

2  The  neutron  has  no  electric  "charge,  so  it  interacts  with  electrons  only  through  the  relatively  weak 
coupling  of  its  magnetic  moment  to  the  magnetic  moment  of  the  electrons.  This  is  of  considerable  impor¬ 
tance  in  the  study  of  magnetically  ordered  solids  (Chapter  33),  but  is  of  little  consequence  in  determining 
phonon  spectra. 

3  Typical  nuclear  radii  are  of  order  10“ 13  cm,  and  typical  internuclear  distances  in  the  solid  are  of 
order  10-8  cm.  Hence  the  nuclei  occupy  only  10",s  of  the  total  volume  of  the  solid. 
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We  assume  that  the  ions  in  the  crystal  are  well  described  by  the  harmonic  ap¬ 
proximation.  Later  we  shall  indicate  how  our  conclusions  must  be  modified  by  the 
inevitable  anharmonic  terms  in  the  ion-ion  interaction.  Suppose  that  at  the  start  of 
the  experiment  the  crystal  is  in  a  state  with  phonon  occupation  numbers4  nk.,  and 
after  the  experiment,  as  a  result  of  its  interaction  with  the  neutron,  the  crystal  is  in 
a  state  with  phonon  numbers  n'ks.  Conservation  of  energy  requires  that 

E'  -  E  -  ~Y,  Attk„  A»ks  =  tik,  -  nks;  (24.3) 

ks 

i.e.,  the  change  in  the  energy  of  the  neutron  is  equal  to  the  energy  of  the  phonons 
it  has  absorbed  during  its  passage  through  the  crystal,  minus  the  energy  of  the  phonons 
it  has  emitted.5 

Thus  the  change  in  neutron  energy  upon  passage  through  the  crystal  contains 
information  about  the  phonon  frequencies.  A  second  conservation  law  is  required 
to  disentangle  this  information  from  the  scattering  data.  The  second  law  is  known 
as  the  conservation  of  crystal  momentum.  It  is  a  very  general  consequence  of  a  symmetry 
of  the  neutron-ion  interaction, 

=  £>(r  -  R  -  u(R) ).  (24.4) 

R 

•  By  a  state  with  phonon  occupation  numbers  nk,  we  mean  one  in  which  nb  phonons  of  type  kj  are 
present — i.e.,  one  in  which  the  ks-th  normal  mode  is  in  its  'ikih  excited  stale. 

5  A  neutron  may  lose  energy  or  gain  it,  depending  on  the  balance  between  the  energies  of  the  emitted 
and  absorbed  phonons. 
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Here  w  is  the  (very  short-ranged)  potential  of  interaction  between  a  neutron  and 
an  atomic  nucleus  of  the  crystal,  and  r  is  the  neutron  coordinate.  The  interaction 
(24.4)  is  unaffected  by  a  transformation  that  shifts  the  neutron  coordinate  r  by  any 
Bravais  lattice  vector  R0  and  also  permutes  the  ion  displacement  variables  u(R)  by 
u(R)  -»  u(R  —  R0),  for  if  we  make  both  replacements,  then  (24.4)  becomes: 

tf„-i  -  I  vv(r  4-  R0  —  R  —  u(R  -  R0)) 

R 

=  X  w(r  -  (R  -  R0)  -  u(R  -  R0)).  (24.5) 

R 

Since  we  are  summing  over  all  vectors  in  the  Bravais  lattice,  (24.5)  is  precisely  the 
same  as  (24.4).6 

One  of  the  fundamental  results  of  quantum  theory  is  that  symmetries  of  the 
Hamiltonian  imply  conservation  laws.  It  is  shown  in  Appendix  M  that  this  particular 
symmetry  implies  the  conservation  law: 

p'  —  p  =  —  £  hk  A nkl  +  (reciprocal  lattice  vector  x  h).  (24.6) 

ki 

If  we  define  the  crystal  momentum  of  a  phonon  to  be  h  times  its  wave  vector,  then 
(24.6)  makes  an  assertion  strikingly  similar  to  momentum  conservation:  The  change 
in  neutron  momentum  is  just  the  negative  of  the  change  in  total  phonon  crystal  momentum, 
to  within  an  additive  reciprocal  lattice  vector. 

We  emphasize,  however,  that  the  crystal  momentum  of  a  phonon  is  not,  in  general, 
accompanied  by  any  real  momentum  of  the  ionic  system.  “Crystal  momentum”  is 
simply  a  name  for  h  times  the  phonon  wave  vector.7  The  name  is  intended  to  suggest 
that  ftk  frequently  plays  a  role  quite  similar  to  that  of  a  momentum,  as  it  evidently 
does  in  Eq.  (24.6).  Since  a  crystal  possesses  translational  symmetry,  it  is  not  surprising 
that  there  should  be  a  conservation  taw  rather  like  momentum  conservation;8  but 
because  this  symmetry  is  only  that  of  a  Bravais  lattice  (as  opposed  to  the  full  trans¬ 
lational  symmetry  of  empty  space),  it  is  also  not  surprising  that  the  conservation  law 
is  weaker  than  momentum  conservation  (i.e.,  crystal  momentum  is  only  conserved 
to  within  an  additive  reciprocal  lattice  vector). 

Because  there  are  two  conservation  laws,  it  turns  out  to  be  possible  to  extract  the 
explicit  forms  of  the  cos(k)  from  the  neutron  scattering  data  in  a  simple  way.  To 
show  this  we  examine  the  distribution  of  scattered  neutrons  emerging  from  the  crystal, 
classifying  the  types  of  scattering  that  can  take  place  according  to  the  total  number 


6  This  is  strictly  the  case  only  for  a  neutron  interacting  with  an  infinite  crystal.  To  the  extent  that 
surface  scattering  is  important  (and  in  neutron  scattering  it  is  not),  crystal  momentum  will  not  be  conserved. 

7  The  nomenclature  is  quite  analogous  to  that  used  in  Chapter  8,  where  we  defined  the  crystal 
momentum  of  a  Bloch  electron  with  wave  vector  k,  to  be  hk.  The  identical  terminology  is  deliberate,  for 
in  processes  in  which  phonon  and  electronic  transitions  occur  together,  the  total  crystal  momentum  of 
the  electron-phonon  system  is  conserved  (to  within  a  reciprocal  lattice  vector  x  h).  (See  Appendix  M  and 
Chapter  26.) 

8  The  law  of  momentum  conservation  follows  from  the  complete  translational  invariance  of  empty 
space. 
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of  phonons  that  a  neutron  has  exchanged  energy  with  while  passing  through  the 
crystal. 


Zero-Phonon  Scattering 

In  this  case  the  final  state  of  the  crystal  is  identical  to  its  initial  state.  Energy  con¬ 
servation  (Eq.  (24.3))  implies  that  the  energy  of  the  neutron  is  unchanged  (i.e.,  the 
scattering  is  elastic)  and  crystal  momentum  conservation  (Eq.  (24.6))  implies  that 
the  neutron  momentum  can  only  change  by  IiK,  where  K  is  a  reciprocal  lattice  vector. 
If  we  write  the  incident  and  scattered  neutron  momenta  as: 

p  =  fiq,  p'  =  ttq',  (24.7) 

then  these  restrictions  become: 

I'  =  <1,  q'  =  q  +  K.  (24.8) 

Equations  (24.8)  are  precisely  the  Laue  conditions  that  the  incident  and  scattered 
X-ray  wave  vectors  must  satisfy  in  order  for  elastically  scattered  X  rays  to  produce 
a  Bragg  peak  (see  page  99).  Since  a  neutron  with  momentum  p  =  hq  can  he  viewed 
as  a  plane  wave  with  wave  vector  q,  this  emergence  of  the  Laue  condition  is  to  be 
expected.  We  conclude  that  elastically  scattered  neutrons,  which  create  or  destroy 
no  phonons,  are  found  only  in  directions  that  satisfy  the  Bragg  condition,  and  give 
precisely  the  same  structural  information  about  the  crystal  as  descrihed  in  the  dis¬ 
cussion  of  elastic  X-ray  scattering  in  Chapter  6. 

One-Phonon  Scattering 

It  is  the  neutrons  that  absorb  or  emit  precisely  one  phonon  that  convey  the  most 
important  information.  In  the  case  of  absorption  (which  is  generally  the  more  im¬ 
portant)  conservation  of  energy  and  crystal  momentum  implies  that 

E'  =  E  +  hct)j(k), 

p'  =  p  +  Hk  +  h  K,  (24.9) 

where  k  and  s  are  the  wave  vector  and  branch  .ndex  of  the  absorbed  phonon.  In  the 
case  of  emission  we  have: 

E'  =  E  —  fta>s(k), 

p'  =  p  -  fik  +  fiK,  (24.10) 

where  the  phonon  has  been  emitted  into  branch  s,  with  wave  vector  k. 

In  either  case  we  can  use  the  crystal  momentum  law  to  represent  k  in  terms  of 
the  neutron  momentum  transfer,  p'  —  p.  Furthermore,  the  additive  reciprocal  lattice 
vector  that  appears  in  this  relation  can  be  ignored,  when  the  resulting  expression  for 
k  is  substituted  into  the  energy  conservation  law.  for  each  ct^k)  is  a  periodic  function 
in  the  reciprocal  lattice: 


cos(k  ±  K)  =  cus(k). 


(24.11) 


474  Chapter  24  Measuring  Phonon  Dispersion  Relations 


As  a  result,  the  two  conservation  laws  yield  one  equation: 


or 


phonon  absorbed. 


phonon  emitted. 


(24.12) 

(24.13) 


In  a  given  experiment,  the  incident  neutron  momentum  and  energy  are  usually 
specified.  Thus  for  a  given  phonon  dispersion  relation  cos(k)  the  only  unknowns  in 
(24.12)  and  (24.13)  are  the  three  components  of  the  final  neutron  momentum  p'. 
Quite  generally,  a  single  equation  relating  the  three  components  of  a  vector  p'  will 
(if  it  has  any  solutions  at  all)  specify  a  surface  (or  surfaces)  in  three-dimensional 
p'-space.  If  we  only  examine  neutrons  emerging  in  a  definite  direction  we  will  specify 
the  direction  of  p',  and  can  therefore  expect  to  find  solutions  at  only  a  single  point  on 
the  surface  (or  a  finite  number  of  points  on  the  surfaces).9 

If  we  select  a  general  direction  we  will  see  neutrons  scattered  by  one-phonon 
processes  only  at  a  few  discrete  values  of  p',  and  correspondingly  only  at  a  few  discrete 
energies  £'  =  p'2/2M„.  Knowing  the  energy  and  the  direction  in  which  the  scattered 
neutron  emerges,  we  can  construct  p'  —  p  and  £'  —  £,  and  can  therefore  conclude 
that  the  crystal  has  a  normal  mode  whose  frequency  is  (£'  —  E)/h  and  wave  vector 
is  ±(p'  -  p )/h.  We  have  therefore  measured  a  point  in  the  crystal’s  phonon  spectrum. 
By  varying  all  the  parameters  at  our  disposal  (incident  energy,  orientation  of  the 
crystal,  and  direction  of  detection)  we  can  collect  a  large  number  of  such  points,  and 
quite  effectively  map  out  the  entire  phonon  spectrum  (Figure  24.2).  This  can  be 
accomplished,  however,  only  if  it  is  possible  to  distinguish  the  neutrons  scattered  in 
one-phonon  processes  from  the  others.  We  consider  explicitly  the  case  of  two-phonon 
processes. 

Two-Phonon  Scattering 

In  a  two-phonon  process  a  neutron  can  absorb  or  emit  two  phonons,  or  emit  one 
and  absorb  another  (which  can  also  be  described  as  the  scattering  of  a  single  phonon). 
To  be  concrete  we  discuss  the  case  of  two-phonon  absorption.  The  conservation  laws 
will  then  have  the  form: 

£'  =  £  +  hc\(k)  +  fictv(k'),  ]4. 

p'  =  p  +  Iik  +  ftk'  +  IiK.  '  ’  ' 

If  we  eliminate  k'  through  the  crystal  momentum  conservation  law,  we  arrive  at  a 
single  restriction: 

£'  =  £  +  hws(k)  +  hws-  ~  kY  (24.15) 


9  Alternatively,  by  specifying  the  direction  of  p'  we  leave  a  single  unknown  variable  (the  magnitude 
p')  in  (24.12)  or  (24.13)  and  therefore  expect  at  most  a  finite  number  of  solutions. 


(1013  rad/sec) 
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Figure  24.2 

Phonon  dispersion  relations  in  aluminum,  measured  along  the  fc-space  lines  TX  and  TKX  by 
neutron  scattering.  The  estimated  error  in  frequency  is  1  to  2  percent.  Each  point  represents  an 
observed  neutron  group.  (After  J.  Yarnell  et  al..  Lattice  Dynamics ,  R.  F.  Wallis,  ed.,  Pergamon, 
New  York,  1965.)  Note  that  the  two  transverse  branches  are  degenerate  along  FX  (4-fold  axis), 
but  not  along  TK  (2-fold  axis).  See  Chapter  22. 

For  each  fixed  value  of  k,  the  discussion  we  gave  in  analyzing  the  one-phonon  case 
can  be  repeated:  For  a  given  direction  of  detection,  scattered  neutrons  will  occur 
only  at  a  small  set  of  discrete  energies.  However,  k  can  now'  be  varied  continuously 
throughout  the  first  Brillouin  zone,  since  the  wave  vector  of  the  absorbed  phonons 
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is  not  at  our  disposal.  As  k  varies,  the  discrete  energies  of  the  emergent  neutron  will 
also  vary.  Therefore  the  totality  of  neutrons  emerging  from  the  process  in  a  specified 
direction  will  have  a  continuous  distribution  of  energies. 

Evidently  this  conclusion  is  not  restricted  to  the  particular  type  of  two-phonon 
process  we  have  considered,  nor  even  to  two-phonon  processes.  Only  in  one-phonon 
processes  are  the  conservation  laws  restrictive  enough  to  forbid  all  but  a  discrete  set 
of  energies  for  the  neutrons  scattered  in  a  given  direction.  If  a  neutron  has  exchanged 
energy  with  two  or  more  phonons,  the  number  of  degrees  of  freedom  sufficiently 
exceeds  the  number  of  conservation  laws  that  a  continuum  of  scattered  neutron 
energies  can  be  observed  in  any  direction. 

As  a  result  it  is  possible  to  distinguish  the  one-phonon  processes  from  the  rest 
(known  as  the  multiphonon  background)  not  by  any  characteristics  of  a  single  scat¬ 
tered  neutron,  but  by  the  statistical  structure  of  the  energy  distribution  of  the  neutrons 
scattered  in  a  given  direction.  Tire  one-phonon  processes  will  contribute  sharp  peaks 
at  isolated  energies,  whereas  the  multiphonon  processes  will  give  a  continuous  back¬ 
ground  (Figure  24.3).  The  energy  and  momentum  transfer  of  the  one-phonon  processes 
can  therefore  be  identified  as  those  at  which  the  sharp  peaks  occur. 


Figure  24.3 

Relative  numbers  of  scat¬ 
tered  neutrons  in  a  given 
direction  as  a  function  of 
neutron  energy.  The  smooth 
curve  is  the  background  due 
to  multiphonon  processes.  In 
an  ideal  harmonic  crystal  the 
one-phonon  processes  would 
contribute  sharp  peaks.  In  a 
real  crystal  these  peaks  are 
broadened  (dashed  curves) 
due  to  phonon  lifetime  ef¬ 
fects. 


Widths  of  One-Phonon  Peaks 

Some  typical  neutron  distributions  are  shown  in  Figure  24.4.  Note  that  although 
the  one-phonon  peaks  are  generally  quite  unambiguous,  they  are  not  perfectly  sharp, 
as  our  analysis  has  suggested.  This  is  because  real  crystals  are  not  perfectly  harmonic. 
The  stationary  states  of  the  harmonic  approximation  are  only  approximate  stationary 
states:  Even  if  the  real  crystal  is  in  such  a  state  (characterized  by  a  specific  set  of 
phonon  occupation  numbers)  at  one  time,  it  will  eventually  evolve  into  a  superposition 
of  other  such  states  (characterized  by  different  phonon  occupation  numbers).  If, 
however,  the  harmonic  stationary  states  are  reasonably  good  approximations  to  the 
exact  ones,  this  decay  may  be  slow  enough  to  permit  one  to  continue  to  describe 
processes  occurring  within  the  crystal  in  terms  of  phonons,  provided  that  one  assigns 
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Figure  24.4 

Some  typical  experimental 
neutron  groups.  In  all  cases 
the  number  of  neutrons 
emerging  in  a  fixed  direction 
for  a  fixed  incident  energy  is 
plotted  against  a  variable 
that  distinguishes  scattered 
neutron  energies,  (a)  Copper. 
(G.  Gobert  and  B.  Jacrot, 
J.  Phys.  Radium  19  (1959).) 
(b)  Germanium.  (1.  Pelah  et 
al.,  Phys.  Rev.  108,  1091 
(1957).) 


Neutron  wavelength  (A) 


to  the  phonons  finite  lifetimes,  reflecting  the  eventual  decay  of  the  approximate 
harmonic  stationary  state.  Associated  with  a  phonon  of  lifetime  t,  there  will  be  an 
uncertainty  hjx  in  the  phonon  energy.  The  energy  conservation  law  determining  the 
one-phonon  peaks  will  then  be  correspondingly  weakened. 

These  points  will  be  taken  up  in  more  detail  in  Chapter  25.  Here  we  merely  note 
that  the  one-phonon  peaks,  though  broadened,  are  still  clearly  identifiable.  The  fact 
that  they  are  indeed  due  to  one-phonon  processes  is  strikingly  confirmed  by  the 
consistency  of  the  cos(k)  curves  inferred  from  their  positions,  for  there  is  considerable 
redundancy  in  the  data  furnished  by  the  one-phonon  peaks.  One  can  extract  infor- 


478  Chapter  24  Measuring  Phonon  Dispersion  Relations 


mation  about  a  particular  phonon  in  several  ways,  by  considering  scattering  events 
with  the  same  energy  exchange,  and  momentum  transfers  that  differ  by  a  reciprocal 
lattice  vector.10 

It  is  important  to  emphasize  that  there  are  indeed  solutions  to  the  one-phonon 
conservation  law  (24.12)  for  a  range  of  energy  and  momentum  transfers  sufficient  to 
permit  a  systematic  mapping  out  of  the  phonon  spectrum.  To  see  this,  suppose  first, 
for  simplicity,  that  the  incident  neutron  energy  E  is  negligibly  small  on  the  scale  of 
phonon  energies.  Since  the  maximum  phonon  energy  is  of  order  kB&D,  and  ©c  is 
typically  somewhere  from  100  to  1000  K,  this  means  we  are  dealing  with  so-called 
cold-neutron  scattering. 


Conservation  Laws  and  One-Phonon  Scattering 


If  £  =  0,  then  Eq.  (24.13)  will  have  no  solutions  at  all  (a  zero-energy  neutron  cannot 
emit  a  phonon  and  conserve  energy).  However,  the  conservation  law  for  phonon 
absorption  (24.12)  reduces  to 


(24.16) 


which  must  have  solutions  for  any  direction  of  p'.  This  is  evident  from  Figure  24.5. 


Figure  24.5 

One-dimensional  demonstration  of  the  fact  that  the  conservation  laws  for  one-phonon  absorption 
can  always  be  satisfied  .for  zero-energy  incident  neutrons.  The  equation  h2k2/2M„  =  hoj(k)  is 
satisfied  wherever  the  two  curves  intersect. 


10  It  is  also  possible  to  extract  information  about  the  polarization  vectors.  This  follows  from  the  fact 
(derived  in  Appendix  N)  that  the  cross  section  for  a  given  one-phonon  process  is  proportional  to 

k(k) '  (P  -  p')|2, 

where  c^k)  is  the  polarization  vector  for  the  phonon  involved,  and  p'  —  p  is  the  neutron  momentum  transfer. 
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It  follows  analytically  from  the  fact  that  the  neutron  energy  vanishes  quadratically 
for  small  p',  while  lw\(p'/h)  either  vanishes  linearly  (acoustic  branch)  or  approaches 
a  constant  (optical  branch).  Hence  for  small  enough  p'  the  neutron  energy  is  always 
less  than  that  of  the  phonon  for  any  direction  of  p'.  However,  as  p'  increases,  the 
neutron  energy  can  increase  without  bound,  while  hojlp'/h)  is  bounded  above  by  the 
maximum  phonon  energy  in  the  branch.  By  continuity  there  is  therefore  at  least  one 
value  of  p'  for  each  direction  of  p'  for  which  the  left  and  right  sides  of  (24.16)  are 
equal.  There  must  be  at  least  one  such  solution  for  each  branch  s  of  the  phonon 
spectrum.  Usually  there  is  more  than  one  solution  (Figure  24.5).  This  is  because  the 
final  neutron  energy  is  comparatively  small  (even  when  p'/h  is  at  the  surface  of  the 
Brillouin  zone),  for  a  neutron  of  wave  vector  q  (measured  in  inverse  angstroms)  has 
an  energy 

En  =  2.1(q[A-1])2  x  10-3  eV, 

^  =  24(q[A-‘])2K.  (24.17) 

Thus  EJkB  is  small  compared  with  typical  ©D,  even  when  q  is  at  a  zone  boundary. 

When  the  incident  neutron  energy  is  not  zero,  there  will  continue  to  be  solutions 
corresponding  to  the  absorption  of  a  phonon  in  each  branch  (Figure  24.6).  If  a 


Figure  24.6 

Graphical  solution  to  the  one-phonon  conservation  laws  when  the  incident  neutron  has  wave 
vector  k‘.  The  conservation  law  for  phonon  absorption  can  be  written 
F.N(hk  +  Md)  -  £v(fik')  =  /ifu(k), 

where  hk  is  the  momentum  of  the  scattered  neutron,  and  £N(p)  =  p2/2MN.  To  draw  the  left-hand 
side  of  this  equation,  one  displaces  the  neutron  energy-momentum  curve  horizontally  so  that  it 
is  centered  at  k  =  -k'  rather  than  k  =  0.  and  displaces  it  downward  by  an  amount  EK(hk‘). 
Solutions  occur  wherever  this  displaced  curve  intersects  the  phonon  dispersion  curve  h<o(k).  In 
the  present  case  there  are  solutions  for  four  different  scattered  neutron  wave  vectors,  k^  —  k.; ' 
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certain  threshold  energy  is  exceeded,  additional  solutions  will  become  possible 
corresponding  to  the  emission  of  a  phonon.  Therefore  there  is  no  lack  of  one-phonon 
peaks,  and  ingenious  techniques  have  been  developed  for  mapping  out  the  phonon 
spectrum  of  a  crystal  along  various  directions  in  k-space  with  considerable  accuracy 
(a  few  percent)  and  at  a  large  number  of  points. 

ELECTROMAGNETIC  SCATTERING  BY  A  CRYSTAL 

Precisely  the  same  conservation  laws  (energy  and  crystal  momentum)  apply  to  the 
scattering  of  photons  by  the  ions  in  a  crystal,  but  because  of  the  very  different  quan¬ 
titative  form  of  the  photon  energy-momentum  relation,  simple  direct  information 
about  the  entire  phonon  spectrum  is  much  more  difficult  to  extract  than  from  neutron 
scattering  data.  The  two  most  commonly  employed  electromagnetic  techniques,  each 
with  its  limitations,  are  the  inelastic  scattering  of  X  rays  and  of  visible  light. 


X-Ray  Measurements  of  Phonon  Spectra 

Our  discussion  of  X-ray  scattering  in  Chapter  6  was  based  on  the  model  of  a  static 
lattice  (which  is  why  it  is  equivalent  to  the  zero-phonon  elastic  scattering  described  in 
our  discussion  of  neutron  scattering).  When  the  assumption  of  a  rigid  static  lattice 
of  ions  is  relaxed,  it  is  possible  for  X-ray  photons,  like  neutrons,  to  be  inelastically 
scattered  with  the  emission  and/or  absorption  of  one  or  more  phonons.  However, 
the  change  in  the  energy  of  an  inelastically  scattered  photon  is  extremely  difficult  to 
measure.  A  typical  X-ray  energy  is  several  keV  (103  eV),  whereas  a  typical  phonon 
energy  is  several  meV  (1CT 3  eV),  and  at  most  a  few  hundredths  of  an  eV,  for  &D  of 
order  room  temperature.  In  general  the  resolution  of  such  minute  photon  frequency 
shifts  is  so  difficult  that  one  can  only  measure  the  total  scattered  radiation  of  all 
frequencies,  as  a  function  of  scattering  angle,  in  the  diffuse  background  of  radiation 
found  at  angles  away  from  those  satisfying  the  Bragg  condition.  Because  of  this 
difficulty  in  energy  resolution,  the  characteristic  structure  of  the  one-phonon  processes 
is  lost  and  their  contribution  to  the  total  radiation  scattered  at  any  angle,  cannot 
be  simply  distinguished  from  the  contribution  of  the  multiphonon  processes. 

Some  information  can  be  extracted  along  different  lines,  however.  It  is  shown  in 
Appendix  N  that  the  contribution  of  the  one-phonon  processes  to  the  total  intensity 
of  radiation  scattered  at  a  given  angle  is  entirely  determined  by  a  simple  function  of 
the  frequencies  and  polarizations  of  those  few  phonons  taking  part  in  the  one-phonon 
events.  Therefore  one  can  extract  the  phonon  dispersion  relations  from  a  measurement 
of  the  intensity  of  scattered  X  radiation  as  a  function  of  angle  and  incident  X-ray 
frequency,  provided  that  one  can  find  some  way  of  subtracting  from  this  intensity 
the  contribution  from  the  multiphonon  processes.  Generally  one  attempts  to  do  this 
by  a  theoretical  calculation  of  the  multiphonon  contribution.  In  addition,  however, 
one  must  allow  for  the  fact  that  X  rays,  unlike  neutrons,  interact  strongly  with 
electrons.  There  will  be  a  contribution  to  the  intensity  due  to  inelastically  scattered 
electrons  (the  so-called  Compton  background),  which  must  also  be  corrected  for. 

As  a  result  of  these  considerations  X-ray  scattering  is  a  far  less  powerful  probe 
of  the  phonon  spectrum  than  neutron  scattering.  The  great  virtue  of  neutrons  is 
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that  good  energy  resolution  is  possible,  and  once  the  scattered  energies  have  been 
resolved,  the  highly  informative  one-phonon  processes  are  clearly  identifiable. 

Optical  Measurements  of  Phonon  Spectra 

If  photons  of  visible  light  (usually  from  a  high  intensity  laser  beam)  are  scattered 
with  the  emission  or  absorption  of  phonons,  the  energy  (or  frequency)  shifts  are 
still  very  small,  but  they  can  be  measured,  generally  by  interferometric  techniques. 
Therefore  one  can  isolate  the  one-phonon  contribution  to  the  light  scattering,  and 
extract  the  values  of  cos(k)  for  the  phonons  participating  in  the  process.  Because, 
however,  the  photon  wave  vectors  (of  order  10s  cm-1)  are  small  compared  with  the 
Brillouin  zone  dimensions  (of  order  108  cm-1),  information  is  provided  only  about 
phonons  in  the  immediate  neighborhood  of  k  =  0.  The  process  is  referred  to  as 
Brillouin  scattering,  when  the  phonon  emitted  or  absorbed  is  acoustic,  and  Raman 
scattering,  when  the  phonon  is  optical. 

In  examining  the  conservation  laws  for  these  processes,  one  must  note  that  the 
photon  wave  vectors  inside  the  crystal  will  differ  from  their  free  space  values  by  a 
factor  of  the  index  of  refraction  of  the  crystal  n  (since  the  frequencies  in  the  crystal 
are  unchanged,  and  the  velocity  is  c/n).  Therefore  if  the  free  space  wave  vectors  of 
the  incident  and  scattered  photons  are  q  and  q',  and  the  corresponding  angular 
frequencies  are  at  and  at',  conservation  of  energy  and  crystal  momentum  in  a  one- 
phonon  process  requires 

fiat'  =  fiat  ±  fcr.os(k)  (24.18) 

and 

ftnq'  =  finq  ±  fik  +  fiK.  (24.19) 

Here  the  upper  sign  refers  to  processes  in  which  a  phonon  is  absorbed  (known  as 
the  anti-Stokes  component  of  the  scattered  radiation)  and  the  lower  sign  refers  to 
processes  in  which  a  phonon  is  emitted  (the  Stokes  component).  Since  the  photon 
wave  vectors  q  and  q'  are  small  in  magnitude  compared  with  the  dimensions  of  the 
Brillouin  zone,  for  phonon  wave  vectors  k  in  the  first  zone  the  crystal  momentum 
conservation  law  (24.19)  can  be  obeyed  only  if  the  reciprocal  lattice  vector  K  is  zero. 
The  two  types  of  process  are  shown  in  Figure  24.7,  and  the  constraint  imposed 


Figure  24.7 

The  scattering  of  a  photon  through  an  angle  6  from  free  space  wave  vector  q 
to  free  space  wave  vector  q'  with  (a)  the  absorption  of  a  phonon  of  wave  vector 
k  (anti-Stokes)  and  (b)  the  emission  of  a  phonon  of  wave  vector  k  (Stokes). 
The  photon  wave  vectors  in  the  crystal  are  nq  and  nq',  where  n  is  the  index  of 
refraction. 


) 
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by  crystal  momentum  conservation  in  Figure  24.8.  Since  the  energy  of  any  phonon 
is  at  most- of  order  hcoD  ~  10-2  eV,  the  photon  energy  (typically  a  few  eV),  and 
hence  the  magnitude  of  the  photon  wave  vector,  is  changed  very  little — i.e.,  the 
triangle  in  Figure  24.8  is  very  nearly  isosceles.  It  follows  immediately  that  the  mag¬ 
nitude  k  of  the  phonon  wave  vector  is  related  to  the  angular  frequency  of  the  light 
and  the  scattering  angle  8  by 

k  =  2nq  sin  {0  =  (2 am/c)  sin  j8.  (24.20) 

The  direction  of  k  is  determined  by  the  construction  in  Figure  24.8,  and  the  frequency, 
m,(k)  by  the  measured  (small)  change  in  photon  frequency. 

Figure  24.8 

Geometrical  derivation  of  Eq.  (24.20). 
Because  the  photon  energy  is  virtually 
unchanged,  the  triangle  is  isosceles. 
Because  the  process  takes  place  within 
the  crystal,  the  photon  wave  vectors 
are  iiq  and  iiq',  where  n  is  the  index  of 
refraction  of  the  crystal.  The  figure  is 
drawn  for  the  case  of  phonon  absorp¬ 
tion  (anti-Stokes).  It  also  describes  the 
case  of  phonon  emission  (Stokes)  if 
the  direction  of  k  is  reversed. 

In  the  case  of  Brillouin  scattering,  the  phonon  is  an  acoustic  phonon  near  the 
origin  of  k-space,  and  m5(k)  has  the  form  a>s(k)  =  cs(K)fc  (Eq.  (22.65) ).  Equation  (24.20) 
then  relates  the  sound  velocity  cs(k)  to  the  scattering  angle  and  shift  in  photon  energy 
Am  by 

cs(K)  =  “  -  (esc  iff).  (24.21) 

2(o  n 

Some  typical  data  are  shown  in  Figure  24.9.  1 


WAVE  PICTURE  OF  THE  INTERACTION  OF  RADIATION  WITH 
LATTICE  VIBRATIONS 

In  the  above  discussion  we  have  regarded  both  neutrons  (or  photons)  and  phonons 
as  particles,  for  which  the  crucial  equations  (24.3)  and  (24.6)  express  the  conservation 
of  energy  and  crystal  momentum.  However,  the  same  constraints  can  be  derived  by 
viewing  the  phonons  and  the  incident  radiation  not  as  particles,  but  as  waves.  For 
electromagnetic  scattering  this  is  the  natural  classical  approach,  and  was  the  point 
of  view  from  which  the  subject  was  originally  developed  by  Brillouin.  For  neutron 
scattering,  the  wave  picture  remains  quantum  mechanical,  since  although  the  phonon 
is  no  longer  considered  a  particle,  the  neutron  is  regarded  as  a  wave.  This  alternative 
point  of  view  cannot  contain  any  new  physics,  but  is  nevertheless  worth  keeping  in 
mind  for  the  additional  insights  it  sometimes  affords. 


Main 
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beam 


\ 


350  300  250  200  150 

Shift  (cm1) 

(b) 


Figure  24.9 

(a)  Characteristic  structure  of  a  Brillouin  spectrum.  Intensity  is  plotted  vs.  frequency.  There  are 
clearly  identifiable  peaks  at  frequencies  above  and  below  the  frequency  of  the  main  laser  beam, 
corresponding  to  one  longitudinal  and  two  transverse  acoustic  branches.  (S.  Fray  et  al.  Light 
Scattering  Spectra  of  Solids ,  G.  B.  Wright,  ed.,  Springer,  New  York,  1969.)  (b)  The  Raman  spectra 
of  CdS  and  CdSe,  revealing  peaks  determined  by  the  longitudinal  and  transverse  optical  phonons. 
(R.  K.  Chang  et  al.,  ibid.) 
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Consider,  then,  the  interaction  of  a  wave  with  angular  frequency  E/h  and  wave 
vector  q  =  p/h,  with  a  particular  normal  mode  of  the  crystal  with  angular  frequency 
co  and  wave  vector  k.  We  suppose  that  only  this  particular  normal  mode  is  excited; 
i.e.,  we  consider  the  interaction  of  the  wave  with  one  phonon  at  a  time.  We  also  ignore 
for  the  moment  the  microscopic  structure  of  the  crystal,  regarding  the  normal  mode 
of  interest  as  a  wavelike  disturbance  in  a  continuous  medium.  If  the  disturbance  did 
not  move  it  would  present  to  the  incident  radiation  a  periodic  variation  in  density, 
which  would  act  as  a  diffraction  grating  (Figure  24.10),  the  scattered  wave  being 
determined  by  Bragg’s  law.  However,  the  disturbance  is  not  stationary,  but  is  moving 


Figure  24.10 

Scattering  of  a  neutron  by  a  phonon 
in  a  frame  of  reference  in  which  the 
phonon  phase  velocity  is  zero.  The 
phonon  appears  as  a  static  diffrac¬ 
tion  grating;  i.e.,  it  results  in  regions 
of  alternating  high  and  low  ionic 
density.  The  Bragg  condition  (p. 
97),  m/.  =  2d  sin  8  (m  an  integer), 
can  be  written  as 


or 


or 


2nm 

Q 


mk  —  2q  sin  8 


mk  =  (q'  -  q)  -  K. 


Since  Bragg  reflection  is  specular 
(angle  of  incidence  equals  angle  of 
reflection)  and  since  the  magnitude 
q'  equals  the  magnitude  q,  it  follows 
that  q'  —  q  must  be  parallel  to  k, 
and  therefore  q'  —  q  =  mk. 


with  the  phonon  phase  velocity  which  is  directed  along  k  and  has  magnitude  co/fc: 


v  = 


(24.22) 


This  complication  can  be  dealt  with  by  describing  the  diffraction  in  the  frame  of 
reference  that  moves  with  the  phase  velocity  v.  In  that  frame  the  disturbance  will  be 
stationary,  and  the  Bragg  condition  can  be  applied.  The  wave  vectors  (of  both  the 
lattice  wave  and  the  incident  and  scattered  waves)  are  unaltered  by  a  change  of 
frame,  since  this  affects  neither  the  distance  between  planes  of  constant  phase  nor 
their  orientation.11  However,  the  frequencies  undergo  a  Doppler  shift; 


11  More  precisely,  the  change  in  wave  vector  is  a  relativistic  effect,  which  we  ignore  because  the  phase 
velocity  v  is  small  compared  with  c.  The  Doppler  shift  formulas  (24.23)  are  also  used  in  their  nonrelativistic 
forms. 
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co  —  k  •  v, 

E 

¥  ~  q  v’ 

y  -  q'  •  V.  (24.23) 

Since  Bragg  reflection  by  a  stationary  grating  leaves  the  frequency  of  the  incident 
wave  unaltered,  E'  must  be  equal  to  E.  The  transformation  law  (24.23)  then  implies 
that  in  the  original  frame  the  frequency  of  the  scattered  wave  must  be  shifted: 

—  =  f  +  (q'  -  q)  ■  V.  (24.24) 

The  change  in  wave  vector  under  a  Bragg  reflection  has  the  form: 

/  q'  =  q  +  oik,  (24.25) 

where  the  integer  m  is  the  order  of  the  Bragg  reflection  (as  demonstrated  in  Figure 
24.10).12  This  relation  holds  in  either  frame,  since  wave  vectors  are  invariant  under 
change  of  frame. 

Substituting  (24.25)  into  (24.24),  we  find  that  the  frequency  shift  in  the  original 
frame  is  given  by 

y  =  -p-  +  mk  •  v.  (24.26) 

ft  ft 

If  we  substitute  into  (24.26)  the  explicit  form  (24.22)  for  the  phase  velocity  v,  we  find 
that 

£'  =  E  +  mho.  (24.27) 

Equations  (24.25)  and  (24.27)  reveal  that  an  mth-order  Bragg  reflection  in  the 
moving  frame  corresponds  to  a  process  that  we  would  describe  in  the  laboratory 
frame  as  the  absorption  or  emission  of  m  phonons  of  a  given  type.  Multiphonon 
processes  involving  several  normal  modes  evidently  will  correspond  to  successive 
Bragg  reflections  from  the  corresponding  moving  diffraction  gratings. 

The  wave  vector  condition  (24.25)  might  appear  to  lack  the  arbitrary  additive 
reciprocal  lattice  vector  that  is  present  in  the  crystal  momentum  conservation  law 
(24.6).  Actually,  it  is  implicit  in  (24.25)  as  well,  as  soon  as  we  acknowledge  that  the 
crystal  is  not  a  continuum,  but  a  discrete  system.  Only  in  a  continuum  is  it  possible 
to  assign  a  unique  wave  vector  k  to  each  normal  mode.  In  a  discrete  lattice  the  normal 
mode  wave  vector  is  defined  only  to  within  an  additive  reciprocal  lattice  vector 
(see  page  439). 

Thus  from  the  wave  point  of  view  the  law  of  energy  conservation  is  simply  a  state¬ 
ment  of  the  Doppler  shift  for  a  wave  reflected  from  a  moving  diffraction  grating;  the 
law  of  crystal  momentum  conservation  is  the  Bragg  condition  for  that  grating,  the 
additive  reciprocal  lattice  vector  expressing  the  variety  of  orientations  that  the  grating 


12  Note  that  m  can  have  either  sign,  depending  on  the  side  of  the  grating  from  which  the  wave  is 
incident. 


} 
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may  be  considered  to  have,  due  to  the  discrete  periodic  nature  of  the  underlying 
crystal. 


PROBLEMS 


1.  (a)  Draw  diagrams  showing  some  possible  two-phonon  processes  in  which  a  neutron  enters 
with  momentum  p  and  leaves  with  momentum  p\  In  labeling  the  diagrams  take  due  account  of 
the  conservation  laws. 

(b)  Repeat  (a)  for  three-phonon  processes. 

2.  (a)  Repeat  the  method  of  graphical  solution  given  in  Figure  24.6  for  the  case  of  phonon 
emission. 

(b)  Verify  that  when  the  incident  neutron  energy  is  zero,  no  solutions  are  possible. 

(c)  In  qualitative  terms,  how  does  the  number  of  solutions  depend  on  the  incident  wave 
vector  k,? 

3.  This  problem  is  based  on  Appendices  L  and  N. 

(a)  Using  the  definition  of  W  given  in  Eq.  (N.17)  and  the  expansion  (L.14)  for  u(R),  show 
that  the  Debye- Waller  factor  has  the  form: 

“ p  { J(SF  ?  asbo  <" ' ' “lh  »"•«}  <24-2s> 

where  v  is  the  appropriate  cell  volume. 

(b)  Show  that  e~lw  =  0  in  one  and  two  dimensions.  (Consider  the  behavior  of  the  integrand 
for  small  k.)  What  are  the  implications  of  this  for  the  possible  existence  of  one-  or  two-dimensional 
crystalline  ordering? 

(c)  Estimate  the  size  of  the  Debye-Waller  factor  for  a  three-dimensional  crystal. 
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In  Chapter  21  we  reviewed  the  evidence  forcing  us  to  abandon  the  model  of  a  static 
lattice  of  ions1,  and  in  subsequent  chapters  we  have  cautiously  relaxed  this  oversimpli¬ 
fication.  We  have,  however,  relied  on  two  less  restrictive  simplifying  assumptions: 

1,  Assumption  of  Small  Oscillations  We  have  assumed  that  although  the  ions  are 
not  rigidly  confined  to  their  equilibrium  sites,  their  displacements  from  those 
equilibrium  sites  are  small. 

2.  Harmonic  Approximation  We  have  assumed  that  we  can  calculate  accurately 
the  properties  of  solids  by  retaining  only  the  leading  nonvanishing  term  in  the 
expansion  of  the  ionic  interaction  energy  about  its  equilibrium  value. 

The  assumption  of  small  oscillations  appears  to  be  reasonable  in  most  solids  (with 
the  important  exception  of  solid  helium)  for  temperatures  well  below  the  melting 
point.  In  any  event,  it  is  forced  on  us  by  computational  necessity.  When  it  fails,  one 
must  resort  to  very  complex  approximation  schemes  whose  validity  is  far  from  clear. 

When  the  assumption  of  small  oscillations  does  hold,  one  might  be  tempted  to 
conclude  that  corrections  to  the  harmonic  approximation  are  of  interest  only  in 
calculations  of  high  precision.  This  is  incorrect.  There  are  many  important  physical 
phenomena  that  cannot  be  explained  in  a  purely  harmonic  theory  because  they  are 
entirely  due  to  the  neglected  higher-order  terms  in  the  expansion  of  the  ionic  inter¬ 
action  energy  about  its  equilibrium  value. 

In  this  chapter  we  shall  survey  some  of  those  phenomena  whose  explanation 
requires  the  presence  of  such  anharmonic  terms.  Wc  have  already  encountered  two 
examples : 

1 .  The  quantum  theory  of  the  harmonic  crystal  predicts  that  the  specific  heat  should 
obey  the  classical  law  of  Dulong  and  Petit  at  high  temperatures  (T  »  0D).  The 
failure  of  the  high-temperature  specific  heat  to  approach  this  value  is  an  anhar¬ 
monic  effect  (see  page  428  and  also  page  455). 

2.  In  our  discussion  of  neutron  scattering  (Chapter  24)  we  argued  that  the  inelastic 
neutron  scattering  cross  section  should  have  sharp  peaks  at  energies  allowed  by 
the  conservation  laws  governing  one-phonon  processes.  However,  the  observed 
peaks,  though  quite  clearly  peaks,  have  a  measurable  width  (see  Figure  24.4).  We 
interpreted  this  broadening  to  be  a  consequence  of  the  fact  that  the  eigenstates 
of  the  harmonic  Hamiltonian  were  not  true  stationary  states  of  the  crystal — i.e., 
that  anharmonic  corrections  to  the  harmonic  approximation  were  significant. 
The  width  of  the  one-phonon  peaks  is  a  direct  measure  of  the  strength  of  the 
anharmonic  part  of  the  ionic  interaction  energy. 

Additional  phenomena  dominated  by  the  anharmonic  terms  can  be  grouped  into 
equilibrium  and  transport  properties: 

1.  Equilibrium  Properties  There  is  a  large  class  of  equilibrium  properties  in 
crystals,  observable  at  any  temperature,  whose  consistent  explanation  requires 


1  We  continue  to  use  the  word  “ion"  in  the  broadest  sense,  including  in  the  term,  for  example,  the 
neutral  atoms  or  molecules  making  up  a  molecular  solid. 
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the  presence  of  anharmonic  terms  in  the  ionic  interaction  energy.  The  most 
important  of  these  is  thermal  expansion.  In  a  rigorously  harmonic  crystal  the 
equilibrium  size  would  not  depend  on  temperature.  The  existence  of  anharmonic 
terms  is  also  implied  by  the  fact  that  the  elastic  constants  depend  on  volume  and 
temperature,  and  by  the  fact  that  the  adiabatic  and  isothermal  elastic  constants 
are  not  the  same. 

2.  Transport  Properties  The  thermal  conductivity  of  an  insulating  solid  is  limited 
in  a  perfect  crystal  only  by  the  anharmonic  terms  in  the  ionic  interaction  energy. 
A  rigorously  harmonic  crystal  would  have  an  infinite  thermal  conductivity.  This 
is  probably  the  most  important  transport  property  determined  by  the  anharmonic 
terms,  but  they  also  play  essential  roles  in  almost  any  of  the  processes  by  which 
the  lattice  vibrations  transmit  energy. 


GENERAL  ASPECTS  OF  ANHARMONIC  THEORIES 

The  standard  description  of  anharmonic  terms  is  simple  in  principle,  though  plagued 
in  practice  by  notational  complexities.  The  small-oscillation  assumption  is  retained 
which  permits  one  to  keep  only  the  leading  corrections  to  the  harmonic  terms  in  the 
expansion  of  the  ionic  interaction  energy,  U,  in  powers  of  the  ionic  displacements  u. 
Thus  one  replaces  (22.8)  and  (22.10)  by: 

U  =  Ucq  +  Ubarm  +  U:mh,  (25.1) 

where  (see  (22.10))  the  anharmonic  correction  terms  have  the  form: 


where 


^anh  =  I  h  I  ■  ■  ■  *  (R.  •  •  •  R>„(Ri)  •  •  •  *WR„), 

n=3  Ri  ■•■R„ 

D(:l . . .  JR,  •  •  •  R„)  =  FU/du^i R,)  •  •  •  ShJRJUo- 


(25.2) 

(25.3) 


In  the  spirit  of  the  assumption  of  small  oscillations  one  might  be  tempted  to  retain 
only  the  leading  terms  (cubic  in  the  u)  in  C/anb,  and  this  is  frequently  done.  However, 
there  are  two  reasons  for  retaining  the  quartic  terms  as  well : 

1.  The  Hamiltonian  that  retains  only  the  cubic  anharmonic  terms  is  unstable:  The 
potential  energy  can  be  made  as  large  and  negative  as  desired  by  assigning  suitable 
values  to  the  u’s  (see  Problem  1).  This  implies  that  the  cubic  Hamiltonian  has  no 
ground  state,2  so  by  replacing  the  full  Hamiltonian  with  one  truncated  at  the 
cubic  anharmonic  terms,  we  have  replaced  a  well-defined  physical  problem  by 
one  with  spectacular,  but  artificial,  mathematical  pathologies.  Nevertheless,  the 
additional  cubic  terms  are  frequently  treated  as  a  small  perturbation,  and  physi¬ 
cally  sensible  results  are  found  in  spite  of  the  formal  absurdity  of  the  procedure. 
If  one  insists,  however,  on  dealing  with  a  well-defined  problem,  one  must  retain 
the  quartic  terms  as  well. 


See,  for  example,  G.  Baym,  Phys.  Rev.  117,  886  (I960). 
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2.  Contributions  from  the  cubic  terms  often  behave  anomalously  not  for  the 
reason  given  above  but  because  of  the  rather  stringent  requirements  the  con¬ 
servation  laws  impose  on  processes  mediated  by  these  terms.  When  this  happens 
quartic  terms  can  be  of  comparable  importance  even  when  the  small  oscillations 
assumption  is  a  good  one. 

It  is  the  almost  universal  practice  to  retain  no  terms  beyond  the  quartic  anharmonic 
terms  in  detailed  calculations,  unless  one  is  engaged  in  proving  very  general  types  of 
results,  or  dealing  with  crystals  (notably  solid  helium)  for  which  the  entire  assumption 
of  small  oscillations  is  of  doubtful  validity.  Furthermore,  there  is  a  tendency  in  practice 
to  keep  only  the  cubic  anharmonic  terms,  though  one  must  always  keep  in  mind  the 
pitfalls  described  above. 


EQUATION  OF  STATE  AND  THERMAL  EXPANSION  OF  A  CRYSTAL 

To  calculate  the  equation  of  state,  we  write  the  pressure  as  P  =  -(dF/dV)-r  where  F 
the  Helmholtz  free  energy,  is  given  by  F  .  V  -  TS.  Sface  the  miop)  S 
energy  V  are  related  by 


(25.4) 


we  can  express  the  pressure  entirely  in  terms  of  the  internal  energy,  in  the  form:3 

P--w[v-T\T0dWFrv^> ')]•  (*5> 

If  the  small-oscillations  assumption  is  valid,  then  the  internal  energy  of  an  insu¬ 
lating  crystal  should  be  accurately  given  by  the  result  (23.1 1)  of  the  harmonic  approxi¬ 
mation: 


+  +  (25.6) 

Substituting  this  into  the  general  form,  (25.5),  one  finds  that4 

F  =  ~  dv  [LCq  +  B^(k)]  +  E  (-  0®s(k))j  r  (25.7) 

This  result  has  a  very  simple  structure:  The  first  term  (which  is  all  that  survives  at 
/  -  0)  is  the  negative  volume  derivative  of  the  ground-state  energy.  At  nonzero 
temperatures  this  must  be  supplemented  with  the  negative  volume  derivative  of  the 
phonon  energies,  the  contribution  of  each  phonon  level  being  weighted  with  its  mean 
occupation  number. 

According  to  (25.7),  the  equilibrium  pressure  depends  on  temperature  only  because 
the  normal-mode  frequencies  depend  on  the  equilibrium  volume  of  the  crystal.  If, 


We  use  the  fact  that  the  entropy  density  vanishes  at  T  = 
of  an  integration  constant. 

4  See  Problem  2. 


0  (third  law  of  thermodynamics)  to  dispose 
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however,  the  potential  energy  of  the  crystal  were  rigorously  of  the  harmonic  form 
(Eqs.  (22.46)  and  (22.8)) 

l/cq  +  II  u(R)D(R  -  R')u(R'),  (25.8) 

RR' 


with  force  constants  D  that  were  independent  of  the  u(R),  then  the  normal  mode 
frequencies  could  have  no  volume  dependence  at  all.5 

To  see  this,  note  that  to  determine  the  volume  dependence  of  the  normal-mode 
frequencies,  we  must  examine  the  small-oscillations  problem  not  only  for  the  original 
Bravais  lattice  given  by  vectors  R,  but  also  for  the  expanded  (or  contracted)  lattices 
given  by  the  vectors6  R  =  (1  +  e)R,  whose  volumes  differ  by  the  factor  (1  +  e)3 
from  the  volume  of  the  original  lattice.  If  the  potential  energy  is  rigorously  of  the 
form  (25.8)  even  when  the  u(R)  are  not  small,  then  the  new  small-oscillations  problem 
is  easily  reduced  back  to  the  old  one.  For  the  ionic  positions  r(R)  =  R  -f  u(R)  can 
also  be  written  as  r(R)  =  R  +  u(R),  provided  that  the  displacements  u  with  respect 
to  the  original  lattice  are  related  to  the  displacements  u  with  respect  to  the  expanded 
(or  contracted)  lattice  by 

u(R)  =  eR  +  u(R).  (25.9) 

If  the  potential  energy  is  rigorously  given  by  (25.8),  then  to  evaluate  the  energy  of  the 
configuration  given  by  r(R)  =  R  +  u(R)  we  need  not  perform  a  new  expansion  of  U 
about  the  new  equilibrium  positions  R,  but  may  simply  substitute  the  equivalent 
displacements  u  given  by  (25.9)  into  (25.8).  The  resulting  expression  for  the  potential 
energy  of  the  configuration  in  which  the  ions  are  displaced  by  u(R)  from  equilibrium 
positions  at  R  is7 

[/"•  +  i  e2  £  RD(R  -  R')R'  +  i  z  u(R)D(R  -  R')u(R')-  (25.10) 

RR"  RR' 


The  first  two  terms  in  (25.10)  are  independent  of  the  new  displacements  u  and  give 
the  potential  energy  of  the  new  equilibrium  configuration.  The  dynamics  are  deter¬ 
mined  by  the  term  quadratic  in  the  u.  Since  the  coefficients  in  this  term  are  identical 
to  the  coefficients  of  the  corresponding  term  in  (25.8),  the  dynamics  of  the  oscillations 
about  the  new  equilibrium  positions  will  be  identical  to  those  of  the  old.  The  normal¬ 
mode  frequencies  are  therefore  unaffected  by  the  change  in  equilibrium  volume. 

Because  the  normal-mode  frequencies  of  a  rigorously  harmonic  crystal  are  un¬ 
affected  by  a  change  in  volume,  the  pressure  given  by  (25.7)  depends  only  on  volume, 
but  not  on  temperature.  Thus  in  a  rigorously  harmonic  crystal  the  pressure  required 
to  maintain  a  given  volume  does  not  vary  with  temperature.  Since 


cF\  (SP/BT)V 
8Tjp  ~  (8P/dV)r 


(25.11) 


5  This  is  a  generalization  of  the  familiar  observation  that  the  frequency  of  a  harmonic  oscillator 
does  not  depend  on  the  amplitude  of  vibration. 

6  For  simplicity  we  consider  only  monatomic  Bravais  lattices  whose  symmetry  is  such  that  a  uniform 
isotropic  expansion  (or  contraction)  yields  a  new  equilibrium  configuration  (as  opposed,  for  example,  to 
a  crystal  of  orthorhombic  symmetry,  where  the  scale  factor  (1  +  f )  would  differ  along  different  crystal 
axes).  The  final  result,  however,  is  quite  general. 

7  Terms  linear  in  the  u  must  vanish  if  the  new  sites  R  do  indeed  give  an  equilibrium  configuration 
of  the  crystal. 
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it  also  follows  that  the  equilibrium  volume  cannot  vary  with  temperature  at  fixed 
pressure.  Thus  the  coefficient  of  thermal  expansion,® 


/dl\  _  _t_  fdV\  _  J_  /8P\ 
\8tJp  ~  3T  \8TJP  ~  3 B  [dTjy 


(25.12) 


must  vanish. 

The  absence  of  thermal  expansion  in  a  rigorously  harmonic  lattice  implies  ther¬ 
modynamically  several  other  anomalies.  The  constant-volume  and  constant-pressure 
specific  heats  are  related  by : 


T(8P/8T)y 
Cp  ~  Cv  ~  V(8P/8V)T 


(25.13) 


and  must  therefore  be  identical  in  such 
compressibilities,  since 


a  solid.  So  must  the  adiabatic  and  isothermal 


(8P/SV)s 

(8P/8V)T 


(25.14) 


Results  such  as  these  are  anomalous,  because  in  real  crystals  the  force  constants  D 
in  the  harmonic  approximation  to  the  potential  energy  do  depend  on  the  equilibrium 
lattice  about  which  the  harmonic  expansion  is  made.  Implicit  in  this  dependence  is 
the  fact  that  in  real  crystals  the  harmonic  approximation  is  not  exact.  Indeed,  it  is 
possible  to  express  the  amount  by  which  the  normal-mode  frequencies  change  when 
the  equilibrium  lattice  vectors  are  changed  from  R  to  (1  +  e)R,  in  terms  of  the 
coefficients  of  the  anharmonic  terms  appearing  in  the  expansion  of  the  potential 
energy9  about  the  equilibrium  positions  R.  In  this  way,  measurements  of  the  thermal 
expansion  coefficient  can  be  made  to  yield  information  about  the  magnitude  of  the 
anharmonic  corrections  to  the  energy. 


THERMAL  EXPANSION;  THE  GRUNEISEN  PARAMETER 


Having  recognized  that  the  phonon  frequencies  of  a  real  crystal  do  depend  on  the 
equilibrium  volume,  we  may  continue  with  the  analysis  of  the  equation  of  state  (25.7). 
Substituting  this  form  of  the  pressure  into  (25. 1 2),  we  find  that  the  coefficient  of  thermal 
expansion  may  be  written  as 


a  = 


(25.15) 


where  ns(k)  =  —  1]  T  If  we  compare  this  with  the  formula  (23.12)  for  the 

specific  heat,  which  can  be  written  in  the  form 


7  =  1 


hcos( k)  8 
V  8T 


ns(k). 


(25.16) 


the  following  representation  is  suggested  for  the  thermal  expansion  coefficient  a: 


8  We  continue  to  assume  a  symmetric  enough  crystal  that  all  linear  dimensions  scale  in  the  same 
way  with  temperature.  Crystals  of  noncubic  symmetry  have  direction-dependent  expansion  coefficients. 
We  have  introduced  the  bulk  modulus  B,  defined  (Eq.  (2.35))  by  B  =  —  V(dP/dV)T. 

9  See  Problem  4. 
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First,  define  a  quantity 


c  Jk)  = 


ticQs( k)  c 
V  £T 


«s(k). 


(25.17) 


which  is  the  contribution  of  the  normal-mode  k,  s  to  the  specific  heat.  Next,  define 
a  quantity  yks,  known  as  the  Griineisen  parameter  for  the  mode  ks,  as  the  negative 
logarithmic  derivative  of  the  frequency  of  the  mode  with  respect  to  volume;  i.e., 

V  ccos(k) _  r(ln  c;,(k) )  ^  ^ 

Tks  =  “  ^(k)  cV  ~  ~~  5 (In  V) 


Finally,  define  an  overall  Griineisen  parameter 


y  = 


X  VfcCjk) 

X  G#)  ’ 

M 


(25.19) 


as  the  weighted  average  of  the  yks,  in  which  the  contribution  of  each  normal  mode 
is  weighted  by  its  contribution  to  the  specific  heat.  Using  these  definitions,  we  may 
write  (25.15)  in  the  simple  form: 

«  =  (25.20) 

3  B 

The  coefficient  of  thermal  expansion  is  represented  in  this  rather  peculiar  way 
because  in  the  simplest  models  the  volume  dependence  of  the  normal-mode  fre¬ 
quencies  is  contained  in  a  universal  multiplicative  factor,  and  therefore  the  yks  are 
the  same  for  all  normal  modes.  Under  these  circumstances.  (25.15)  reduces  directly 
to  (25.20)  without  need  for  the  intervening  definitions.  In  a  Debye  model,  for  example, 
all  the  normal-mode  frequencies  scale  linearly  with  the  cutoff  frequency  coD,  and 

therefore  ...  . 

_  Uhl  coD )  (25.21) 

ik'  3(ln  V)  ' 


Since  the  bulk  modulus  appearing  in  the  denominator  of  (25.20)  is  only  weakly 
temperature-dependent,10  theories  with  constant  yks  predict  that  the  coefficient  of 
thermal  expansion  should  have  the  same  temperature  dependence  as  the  specific 
heat.  In  particular,  it  should  approach  a  constant  at  temperatures  large  compared 
with  0D,  and  should  vanish  as  T3  as  T  ->  0. 

The  representation  (25.20)  preserves  these  two  limiting  forms.  In  any  real  solid 
the  yk  will  not  be  the  same  for  all  normal  modes,  and  y  will  therefore  depend  on 
temperature.  Nevertheless,  Eq.  (25.19)  implies  that  y  approaches  a  constant  value 
as  T  ->  0,  and  a  (different)  constant  value  at  temperatures  large  compared  with 
0e.  Consequently,  the  limiting  temperature  dependence  of  the  thermal  expansion 
coefficient,  even  in  the  general  case,  will  be 

a.  ~  T3,  T— ►  0; 

a  ~  constant,  T  »  ©D-  (23.22) 

Some  Gruneisen  parameters  and  their  variation  with  temperature  are  given  in 
Table  25.1  and  Figure  25.1. 


10  In  any  event,  B  is  directly  measurable,  so  its  slight  temperature  dependence  can  be  allowed  for. 
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Table  25.1 

LINEAR  EXPANSION  COEFFICIENTS  AND 
GRUNEISEN  PARAMETERS  FOR  SOME  ALKALI  HALIDES 


T  (K) 

LiF 

NaCl 

Nal 

KC1 

KBr 

KI 

Rbl 

CsBr 

0 

a 

0 

0 

0 

0 

0 

0 

0 

0 

7 

1.70 

0.90 

1.04 

0.32 

0.29 

0.28 

-0.18 

2.0 

20 

a 

0.063 

0.62 

5.1 

0.74 

2.23 

4.5 

6.0 

10.0 

7 

1.60 

0.96 

1.22 

0.53 

0.74 

0.79 

0.85 

— 

65 

a 

3.6 

15.8 

27.3 

17.5 

22.5 

26.0 

28.0 

35.2 

7 

1.59 

1.39 

1.64 

1.30 

1.42 

1.35 

1.35 

— 

283 

a 

32.9 

39.5 

45.1 

36.9 

38.5 

40.0 

39.2 

47.1 

7 

1.58 

1.57 

1.71 

1.45 

1.49 

1.47 

— 

2.0 

“  The  units  of  the  linear  expansion  coefficient  a  are  in  10  6  K  1 
Source:  G.  K.  White.  Pwc.  Roy  Soc.  London  A286,  204  (1965). 


Figure  25.1 

Griineisen  parameter  vs.  7/0 „ 
for  some  alkali  halide  crystals. 
(From  G.  K.  White,  Proc.  Roy 
Soc.  London  A286,204  (1965).) 


! 


Lattice  Thermal  Conductivity :  General  Approach  495 


THERMAL  EXPANSION  OF  METALS 

The  above  discussion  assumes  that  the  only  degrees  of  freedom  are  ionic  i.e.,  that 
the  solid  is  an  insulator.  In  the  case  of  a  metal,  we  can  estimate  the  effects  of  the 
additional  electronic  degrees  of  freedom  through  Eq.  (25.12).  Again,  the  bulk  modulus 
is  very  weakly  temperature-dependent  and  can  be  replaced  by  its  value  at  T  —  0. 
For  a  rough  estimate  of  the  electronic  contribution  to  ( dP/3T)v  we  simply  add  to 
the  contribution  from  the  lattice  vibrations  that  of  a  free  electron  gas.  Since  the  free 
electron  gas  equation  of  state  is  (see  Eq.  (2.101)) 


it  follows  that 


(25.23) 

(25.24) 


and  therefore  the  coefficient  of  thermal  expansion  becomes 


Since  the  Gruneisen  parameter  is  typically  of  order  unity,  the  electronic  contri¬ 
bution  to  the  temperature  dependence  of  the  expansion  coefficient  will  be  appreciable 
only  at  temperatures  at  which  the  electronic  contribution  to  the  specific  heat  is 
comparable  to  that  of  the  ions— i.e.,  at  temperatures  of  the  order  of  10  K  or  less 
(see  Eq.  (23.30)). 11  Thus  the  most  striking  predicted  difference  between  the  expan¬ 
sion  coefficient  of  metals  and  insulators  is  that  at  very  low  temperatures  a  should 
vanish  linearly  in  T  in  metals  but  as  7  3  in  insulators.  This  behavior  is  confirmed 
by  experiment.12 

Some  characteristic  thermal  expansion  coefficients  for  metals  are  given  in  Table 
25.2. 


LATTICE  THERMAL  CONDUCTIVITY :  THE  GENERAL  APPROACH 

As  we  have  noted  in  Chapters  22  and  23,  thermal  energy  can  be  stored  in  the  vibra¬ 
tional  normal  modes  of  the  crystal.  Since  these  modes  are  elastic  waves,  one  can 
also  transport  thermal  energy  through  the  lattice  of  ions  by  setting  up  suitable  wave 
packets  of  normal  modes,  just  as  one  can  send  impulses  down  a  stretched  elastic 
string  by  plucking  one  end.  At  low  temperatures  the  fact  that  the  allowed  energies 
of  a  normal  mode  are  quantized  is  of  critical  importance,  and  it  is  much  more  con¬ 
venient  to  describe  this  transport  of  energy  in  the  phonon  language. 


i.  The  electrons  will,  of  course,  make  a  substantial  contribution  to  the  (roughly  temperature  inde¬ 
pendent)  bulk  modulus  (see  pp.  39,  40).  , 

12  See  G.  K.  White,  Free.  Roy  Soc.  London,  A286, 204  (1965),  and  K.  Andres,  Phys.  Kondens.  Mater. 

2,  294  (1964). 
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Table  25.2 

LINEAR  EXPANSION  COEFFICIENTS  FOR 
SELECTED  METALS  AT  ROOM  TEMPERATURE 


METAL 

COEFFICIENT  ° 

METAL 

COEFFICIENT” 

Li 

45 

Ca 

22.5 

Na 

71 

Ba 

18 

K 

83 

Nb 

7.1 

Rb 

66 

Fe 

11.7 

Cs 

97 

Zn 

61  (II) 

Cu 

17.0 

14  (x) 

Ag 

18.9 

A1 

23.6 

Au 

13.9 

In 

-7-5(11) 

Be 

9.4(||) 

50  (x) 

11.7(0 

Pb 

28.8 

Mg 

25.7(||) 

Ir 

6.5 

24.3  (x) 

"  The  units  are  10  6K  l.  In  the  noncubic  cases  separate  coefficients  are  listed 
for  expansion  parallel  and  perpendicular  to  the  axis  of  highest  symmetry. 
Source :  W.  B.  Pearson,  A  Handbook  of  Lattice  Spacings  and  Structures  of  Metals 
and  Alloys,  Pergamon,  New  York,  1958. 


In  using  the  phonon  picture  to  describe  energy  transport  one  considers  phonons 
whose  positions  are  localized  within  a  definite  region,  which  is  small  on  the  scale  of 
the  macroscopic  dimensions  of  the  crystal,  though  it  must  be  large  on  the  scale  of  the 
interionic  spacing.  Since  a  single  normal  mode  with  a  definite  wave  vector  k  involves 
the  motion  of  ions  throughout  the  entire  crystal,  a  state  consisting  of  a  single  phonon 
with  wave  vector  k  cannot  describe  a  localized  disturbance  of  the  crystal.  However, 
by  superposing  states  of  the  crystal  in  each  of  which  a  normal  mode  with  wave  vector  in 
some  small  neighborhood  Ak  of  k  is  excited,  one  can  construct  localized  phonon-like 
disturbances.  The  justification  for  the  change  from  wave  to  particle  language  relies 
on  the  properties  of  wave  packets.  Rather  than  exploring  the  detailed  and  relatively 
unilluminating  mathematics  of  wave  packets,  we  simply  stress  the  analogy  with  the 
electronic  case,13  and  allow  ourselves  the  same  liberty  with  phonons:  By  sacrificing 
some  precision  in  the  specification  of  phonon  wave  vector  we  can  construct  phonon 
wave  functions14  that  are  localized  on  a  scale  Ax  ~  1/A k. 

In  a  perfectly  harmonic  crystal  the  phonon  states  are  stationary  states.  Therefore, 
if  a  distribution  of  phonons  is  established  that  carries  a  thermal  current  (for  example, 
by  having  an  excess  of  phonons  with  similarly  directed  group  velocities)  that  dis- 


13  See  pages  50  and  216.  The  point  at  issue  here  is  completely  parallel  to  that  we  discussed  in  replacing 
the  wave-mechanical  description  of  electrons  with  the  classical  picture  of  localized  particles.  The  transport 
of  energy  by  thermal  currents  can  be  viewed  in  the  same  way  as  the  transport  of  charge  by  electrical  currents. 
The  carriers  are  now  phonons  and  the  quantity  carried  by  each  phonon  is  its  energy  fia>sfk). 

14  Note  that  the  uncertainty  in  wave  vector  must  be  small  compared  with  the  dimensions  of  the 
Brillouin  zone  if  we  are  to  assign  a  wave  vector  with  the  conventional  properties  to  the  phonon.  Since 
the  zone  dimensions  are  on  the  order  of  the  inverse  lattice  constant,  Ax  must  be  large  compared  with  the 
interionic  spacing,  and  as  one  might  have  guessed  the  phonons  are  not  localized  on  the  microscopic  scale. 
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tribution  will  remain  unaltered  in  the  course  of  time,  and  the  thermal  current  will 
remain  forever  undegraded.  A  perfectly  harmonic  crystal  would  have  an  infinite  thermal 
conductivity.15 

The  thermal  conductivity  of  real  insulators16  is  not  infinite  for  several  reasons: 

1.  The  inevitable  lattice  imperfections,  impurities,  isotopic  inhomogeneities,  and  the 
like  (Chapter  30)  that  afflict  real  crystals  act  as  scattering  centers  for  the  phonons, 
and  help  to  degrade  any  thermal  current. 

2.  Even  in  a  perfect,  pure  crystal,  the  phonons  would  eventually  be  scattered  by  the 
surface  of  the  sample,  and  this  would  limit  the  thermal  current. 

3.  Even  in  a  perfect,  pure,  and  infinite  crystal,  the  stationary  states  of  the  harmonic 
Hamiltonian  are  only  approximate  stationary  states  of  the  full  anharmonic 
Hamiltonian,  and  therefore  a  state  with  a  definite  set  of  phonon  occupation 
numbers  will  not  remain  unchanged  in  the  course  of  time. 

In  the  present  discussion  we  are  primarily  interested  in  the  last  point,  which  is  the 
only  intrinsic  source  of  thermal  resistance  that  cannot  in  principle  be  systematically 
reduced  by  making  bigger  and  better  crystals. 

The  point  of  view  generally  taken  in  discussing  this  effect  of  anharmonicity  is  to 
view  the  anharmonic  corrections  to  the  harmonic  Hamiltonian  H0,  as  perturbations 
which  cause  transitions  from  one  harmonic  eigenstate  to  another — i.e.,  which  lead 
to  the  creation,  destruction,  or  scattering  of  phonons.  Thus  the  anharmonic  part  of 
the  ionic  interaction  plays  the  same  role  in  the  theory  of  heat  transport  in  an  insulator 
as  impurities  or  the  electron-phonon  interaction  do  in  the  theory  of  the  transport 
of  charge  in  a  metal. 

If  the  anharmonic  terms  are  small17  compared  with  the  harmonic  part  of  the 
Hamiltonian,  then  it  should  suffice  to  calculate  their  effects  in  perturbation  theory, 
and  this  is  what  is  generally  done.  It  can  be  shown  (Appendix  O)  that  in  lowest-order 
perturbation  theory  an  anharmonic  term  of  degree  n  in  the  ionic  displacements  u 
can  cause  transitions  between  two  eigenstates  of  the  harmonic  Hamiltonian,  precisely 
n  of  whose  phonon  occupation  numbers  differ.  Thus  the  cubic  anharmonic  term  can 
cause  the  following  kinds  of  transitions: 

1.  All  phonon  occupation  numbers  in  the  initial  and  final  states  are  unchanged 

except  that  nks  ->  —  1,  nty  ->  nky  +  1,  and  nlv.  -*•  nwy.  +  1.  Evidently  such  a 

transition  can  be  thought  of  as  an  event  in  which  a  phonon  from  branch  s  with 
wave  vector  k  decays  into  two  with  wave  vectors  and  branch  indices  k's'  and  k"s". 

2.  All  phonon  occupation  numbers  in  the  initial  and  final  states  are  unchanged 
except  that  nks  — >  nks  —  1,  nky  — >  nkv  —  1,  and  ntv  -*  nky.  +  1.  Such  a  transition 
can  be  viewed  as  an  event  in  which  two  phonons  with  wave  vectors  and  branch 


15  This  is  analogous  to  the  fact  (see  page  141)  that  electrons  in  a  perfect  periodic  potential  (without 
defects  or  lattice  vibrations)  would  be  perfect  electrical  conductors. 

16  We  speak  of  insulators,  though  our  remarks  also  apply  to  the  ionic  contribution  to  the  thermal 
conductivity  of  metals.  The  latter,  however,  is  generally  masked  by  an  electronic  contribution  that  is  one 
or  two  orders  of  magnitude  larger. 

17  The  fact  that  one-phonon  peaks  are  clearly  identifiable  in  the  scattering  of  neutrons  from  crystals 
is  some  indication  of  this  (see  Chapter  24). 
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indices  k.s  and  k's'  merge  to  form  a  single  phonon  in  branch  s"  with  wave  vector 
k". 


Such  processes  are  often  depicted  schematically  as  in  Figure  25.2. 


Figure  25.2 

Processes  produced  in  lowest-order  perturbation  theory  by  cubic  and  quartic  anharmonic  terms, 
(a)  Cubic:  one  phonon  decays  into  two.  (b)  Cubic:  two  phonons  merge  into  one.  (c)  Quartic:  one 
phonon  decays  into  three,  (d)  Quartic:  two  phonons  turn  into  two  others  (phonon-phonon 
scattering),  (e)  Quartic:  three  phonons  merge  into  one. 


The  two  other  cubic  processes  that  come  to  mind  (three  phonons  disappearing  or 
three  new  phonons  being  created)  are  forbidden  by  energy  conservation.  Since  the 
total  energy  of  the  three  phonons  must  be  positive,  energy  would  be  lost  in  a  three- 
phonon  annihilation,  and  found  from  nowhere  in  a  three-phonon  creation. 

In  a  similar  vein,  the  quartic  terms  can  yield  transitions  which  are  simply  charac¬ 
terized  as  ones  in  which  one  phonon  decays  into  three,  three  merge  into  one,  or  two 
of  a  given  kind  are  replaced  by  two  others  (see  Figure  25.2). 

Higher-order  anharmonic  terms  contribute  transitions  as  well,  but,  in  the  spirit 
of  the  assumption  of  small  oscillations,  the  cubic  and  quartic  terms  are  expected  to 
be  the  most  important.  Frequently,  only  transitions  caused  by  the  cubic  terms  are 
considered.  However,  as  we  have  mentioned,  the  conservation  laws  often  impose  very 
stringent  restrictions  on  the  processes  produced  by  the  cubic  terms.  As  a  result,  even 
though  the  quartic  terms  are  smaller  than  the  cubic,  so  few  cubic  processes  may  be 
allowed  that  the  both  types  of  process  may  yield  comparable  transition  rates. 

In  the  discussion  that  follows  we  shall  not  appeal  to  any  detailed  features  of  the 
anharmonic  terms,  other  than  those  embodied  in  the  conservation  laws  of  energy 
and  crystal  momentum.18  If  the  phonon  occupation  numbers  are  nk.  before  the 
transition  and  n'v.  after,  then  energy  conservation  requires 

X  fiw,(k)nk  =  £  hms(k)n'ts,  (25.26) 


18  Equation  (M.18).  A  full  discussion  of  crystal  momentum  conservation  is  given  in  Appendix  M. 
See  also  page  47 1 . 
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and  crystal  momentum  conservation  requires 

Xknts  =  I>L  +  K.  (25.27) 

where  K  is  some  reciprocal  lattice  vector. 

These  transitions  are  usually  referred  to  by  the  term  “collisions”  to  emphasize  the 
analogy  with  electronic  transport.  Here,  however,  the  term  includes  processes  in 
which  a  single  phonon  decays  into  several,  several  merge  into  one,  and  similar 
generalized  "collisions”  that  a  theory  without  number  conservation  permits.  To  the 
extent  that  the  small  oscillations  assumption  is  valid  and  high-order  anharmonic 
terms  are  unimportant,  only  a  small  number  of  phonons  will  participate  in  any  given 
“collision.”  One  can  then  treat  the  transport  of  energy  by  phonons  with  a  Boltzmann 
equation  (Chapter  16)  containing  collision  terms  describing  those  processes  in  which 
phonons  can  be  scattered  with  appreciable  probability.  For  a  more  elementary  quali¬ 
tative  theory,  one  may  even  introduce  a  single  phonon  relaxation  time  t,  which 
specifies  the  probability  per  unit  time  of  a  phonon  suflering  any  of  the  various  types 
of  collision.19 

LATTICE  THERMAL  CONDUCTIVITY:  ELEMENTARY 
KINETIC  THEORY 

We  shall  not  explore  here  the  detailed  approach  to  lattice  thermal  conductivity 
embodied  in  the  phonon  Boltzmann  equation,  but  shall,  instead,  illustrate  some  of 
the  important  physical  features  of  the  problem  using  an  elementary  relaxation-time 
approximation,  analogous  to  that  used  in  our  discussion  of  electronic  transport  in 
metals  in  Chapters  1  and  2. 

For  simplicity,  we  continue  to  treat  only  monatomic  Bravais  lattices,  where  the 
phonon  spectrum  has  only  acoustic  branches.  Since  we  are  concerned  primarily  with 
qualitative  features  of  thermal  conduction  rather  than  precise  results,  we  shall  also 
make  the  Debye  approximation,  when  it  is  convenient,  letting  the  phonon  dispersion 
relation  be  co  =  ck  for  all  three  acoustic  branches. 

Suppose  a  small  temperature  gradient  is  imposed  along  the  x-direction  in  an 
insulating  crystal  (Figure  25.3).  As  in  the  Drude  model  (cf.  page  6)  we  assume  that 
collisions  maintain  local  thermodynamic  equilibrium  in  a  particularly  simple  way. 
Those  phonons  emerging  from  collisions  at  position  x  are  taken  to  contribute  to  the 
nonequilibrium  energy  density  an  amount  proportional  to  the  equilibrium  energy 
density  at  temperature  T(x):  u(x)  =  u“*[T(x)].  Each  phonon  at  a  given  point  will 
contribute  to  the  thermal  current  density  in  the  x-direction  an  amount  equal  to  the 
product  of  the  x-component  of  its  velocity  with  its  contribution  to  the  energy  den¬ 
sity.20  However,  the  average  contribution  of  a  phonon  to  the  energy  density  depends 
on  the  position  of  its  last  collision.  There  is  thus  a  correlation  between  where  a  phonon 


19  In  precise  analogy  with  the  electron  relaxation  time  introduced  in  the  discussion  of  the  Drude 
model.  The  argument  that  follows  is  also  analogous  to  the  electronic  case  except  that  the  phonons  are 
uncharged  (no  thermoelectric  field),  the  phonon  number  density  depends  on  temperature,  and  phonons 
need  not  be  conserved,  particularly  at  the  ends  of  specimens. 

20  For  a  more  detailed  description  of  this  procedure  see  the  quite  analogous  discussion  of  the  elec¬ 
tronic  contribution  to  the  thermal  conductivity  of  a  metal  on  pages  20-22. 
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High  T 


dT 

dx 


Figure  25.3 

Heat  propagation  by  phonons  in  the  presence  of  a  uniform  temperature  gradient  along  the 
x-axis.  The  thermal  current  at  x0  is  carried  by  phonons  whose  last  collision  was,  on  the 
average,  a  distance  {=  cz  away  from  x0.  Phonons  with  velocities  making  an  angle  6  with 
the  x-axis  at  x0  collided  last  at  a  point  P  a  distance  (  cos  8  up  the  temperature  gradient,  and 
therefore  carry  an  energy  density  i<(x0  —  (  cos  8)  with  x-velocity  c  cos  6.  The  net  thermal 
current  is  proportional  to  the  product  of  these  quantities  averaged  over  all  solid  angles. 


has  come  from  (i.e.,  the  direction  of  its  velocity)  and  its  contribution  to  the  average 
energy  density,  which  results  in  a  net  thermal  current. 

To  estimate  this  thermal  current  we  average  the  product  of  the  energy  density 
and  the  x-velocity  over  all  the  places  where  the  phonon’s  last  collision  might  have 
occurred.  Assuming,  in  the  spirit  of  the  Drude  model,  that  the  collision  occurred  a 
distance  (  =  ct  from  the  point  x0  (at  which  the  current  is  to  be  calculated),  in  a 
direction  making  an  angle  0  to  the  x-axis  (Figure  25.3)  we  have 


j  =  <Cjci/(x0  -  (  cos  0)>e  = 
1  f1 

=  2  >‘<!V  a'(-x'o  -  <4 


f"  .  27t  d6  . 

c.  cos  6  u(x0  —  (  cos  0)  — —  sin  0, 

Jo 


471 


To  linear  order  in  the  temperature  gradient  we  then  have 

,du 


„  lf‘  1  Bu  (  8T\ 

j  =  "C^  2j_/  ^  =  3 


j  = 


(25.28) 


(25.29) 

(25.30) 


where  the  thermal  conductivity  k  is  given  by 

k  =  k„ef  =  3C„.c2t.  (25.31) 

Here  cv  is  the  specific  heat  of  the  phonons  and  is  one  of  the  quantities  that  determines 
the  temperature  dependence  of  k.  The  other21  is  the  phonon  collision  rate,  t_1.  We 


21  In  the  Debye  model  the  phonon  speed  c  is  a  temperature-independent  constant.  Even  in  a  more 
accurate  model,  in  which  c 2  should  be  replaced  by  some  suitable  average,  it  will  hot  make  a  strong  con¬ 
tribution  to  the  temperature  dependence  of  k,  in  contrast  to  a  classical  gas,  where  c 1  —  kBT. 
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have  discussed  c„(T)  in  Chapter  23,  but  the  question  of  the  temperature  dependence 
oft"1  is  one  of  great  subtlety  and  complexity,  which  took  many  years  to  understand 
fully.  The  problems  that  arise  depend  on  whether  the  high-temperature  ( T  »  ©D)  or 
low-temperature  ( T  «  ©c)  regime  is  considered. 

Case  1  (T  »  ©B)  At  high  temperatures  the  total  number  of  phonons  present  in  the 
crystal  is  proportional  to  T  because  the  thermal  equilibrium  phonon  occupation 
numbers  reduce  to : 

"s(k)  =  cfc*u**r  -  1  ~  fc^k)'  (25'32) 

Since  a  given  phonon  that  contributes  to  the  thermal  current  is  more  likely  to  be 
scattered  the  more  other  phonons  there  are  present  to  do  the  scattering,  we  should 
expect  the  relaxation  time  to  decline  with  increasing  temperature.  Furthermore,  since 
at  high  temperatures  the  phonon  specific  heat  obeys  the  law  of  Dulong  and  Petit 
and  is  temperature-independent,  we  should  expect  the  thermal  conductivity  itself  to 
decline  with  increasing  temperature,  in  the  high-temperature  regime. 

This  is  confirmed  by  experiment.  The  rate  of  decline  is  generally  given  by 

1 

k  .y  x  ,  (25.33) 

where  x  is  somewhere  between  1  and  2.  The  precise  theory  of  the  power  law  is  quite 
complex,  having  to  do  with  competition  between  scattering  processes  produced  by 
cubic  and  quartic  anharmonic  terms.22  This  is  a  case  where  processes  governed  by 
the  cubic  terms  are  so  stringently  limited  by  the  conservation  laws  that  the  quartic 
terms,  even  when  very  much  smaller,  can  produce  enough  processes  to  even  up  the 
balance. 

Case  2  (T  «  ©D)  At  any  temperature  T,  only  phonons  with  energies  comparable  to 
or  less  than  kBT  will  be  present  in  appreciable  numbers.  In  particular,  when  T  «  ©D, 
the  phonons  present  will  have  cos(k)  «  coD,  and  k  «  kD.  With  this  in  mind,  consider 
a  phonon  collision  mediated  by  the  cubic  or  quartic  anharmonic  terms.  Since  only 
a  small  number  of  phonons  are  involved,  the  total  energy  and  total  crystal  momentum 
of  those  phonons  about  to  participate  in  the  collision  must  be  small  compared  with 
hwD  and  kD.  Since  energy  is  conserved  in  the  collision,  the  total  energy  of  the  phonons 
emerging  from  the  collision  must  continue  to  be  small  compared  with  tioD.  This  is 
only  possible  if  the  wave  vector  of  each,  and  hence  their  total  wave  vector,  is  small 
compared  with  kD.  However,  both  initial  and  final  total  wave  vectors  can  be  small 
compared  with  kD  (which  is  comparable  in  size  to  a  reciprocal  lattice  vector)  only  if 
the  additive  reciprocal  lattice  vector  K  appearing  in  the  crystal  momentum  conserva¬ 
tion  law  is  zero.  Thus,  at  very  low  temperatures,  the  only  collisions  occurring  with 
appreciable  probability  are  those  that  conserve  the  total  crystal  momentum  exactly, 
and  not  just  to  within  an  additive  reciprocal  lattice  vector. 

This  very  important  conclusion  is  sometimes  stated  in  terms  of  a  distinction 
between  so-called  normal  and  umklapp  processes:  A  normal  process  is  a  phonon 
collision  in  which  the  total  initial  and  final  crystal  momenta  are  strictly  equal;  in  an 


22 


See,  for  example,  C.  Herring,  Phys.  Rev.  95,954  (1954),  and  references  cited  therein. 
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umklapp  process  they  differ  by  a  nonzero  reciprocal  lattice  vector.  Evidently  this 
distinction  depends  on  the  primitive  cell  in  which  one  chooses  to  specify  the  phonon 
wave  vector  (Figure  25.4).  That  cell  is  almost  always  taken  to  be  the  first  Brillouin 
zone.23  The  effect  of  low  temperatures  on  crystal  momentum  conservation  is  some¬ 
times  summarized  in  the  assertion  that  at  sufficiently  low  temperatures  the  only  scat¬ 
tering  processes  that  can  occur  at  an  appreciable  rate  are  normal  processes:  umklapp 
processes  are  “ frozen  out.'” 

The  freezing  out  of  umklapp  processes  is  of  critical  significance  for  the  low- 


■  •  •  9 


(b) 


Figure  25.4 

An  umklapp  process  (illustrated  in  the  two- 
dimensional  square  lattice).  The  points  are  re¬ 
ciprocal  lattice  points,  the  square  region  in 
(a)  is  the  first  Brillouin  zone,  and  the  parallelo¬ 
gram  in  (b)  is  an  alternative  primitive  cell.  Two 
phonons  with  wave  vectors  k  and  k'  are  allowed 
by  crystal  momentum  conservation  to  merge 
into  a  single  phonon  of  wave  vector  k".  If  all 
phonon  wave  vectors  are  specified  in  the  first 
Brillouin  zone,  then  k"  differs  from  k  +  k'  by 
the  nonzero  reciprocal  lattice  vector  K0,  and  the 
process  is  an  umklapp  process.  If,  however,  all 
phonon  wave  vectors  are  specified  in  the  primi¬ 
tive  cell  shown  in  (b),  then  k"  =  k  +  k',  and  the 
process  is  a  normal  process.  Given  the  primitive 
cell  in  which  phonon  wave  vectors  are  to  be 
specified,  the  distinction  between  umklapp  and 
normal  processes  is  unambiguous,  for  every 
phonon  level  has  a  unique  wave  vector  in  that 
cell,  and  the  sums  of  the  initial  and  final  wave 
vectors  are  uniquely  determined.  The  process  is 
normal  if  the  two  sums  agree,  and  umklapp  if 
they  differ  by  a  nonzero  reciprocal  lattice  vec¬ 
tor.  However,  by  changing  to  another  primitive 
cell,  one  can  make  some  umklapp  processes 
normal,  and  vice  versa.  (Note  that  the  vectors 
k  and  k'  are  the  same  in  (a)  and  (b).) 


23  The  rarity  of  collisions  that  add  a  reciprocal  lattice  vector  to  the  total  crystal  momentum  at  low 
temperatures  can  be  unambiguously  described  in  terms  of  the  freezing  out  of  umklapp  processes,  provided 
that  the  primitive  cell  contains  a  neighborhood  of  the  point  k  =  0  that  is  large  enough  to  include  all 
phonon  wave  vectors  k  with  tiajik)  large  compared  with  k„T.  The  first  Brillouin  zone  is  obviously  such 
a  choice. 
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temperature  thermal  conductivity,  if  only  normal  processes  occur,  then  the  total 
phonon  wave  vector 

I  I  kns(k)  (25.34) 

s  1st  Bz 

will  be  conserved.  However,  in  the  thermal  equilibrium  state,  with  mean  phonon 
occupation  numbers: 

»s(k)  =  _  v  (25-35) 

the  total  phonon  wave  vector  (25.34)  vanishes,  since  cos(  — k)  =  cos(k).  Thus  if  one 
should  start  with  a  phonon  distribution  with  nonvanishing  total  crystal  momentum, 
then  normal  collisions  alone  cannot  bring  about  full  thermodynamic  equilibrium, 
even  if  there  is  no  temperature  gradient  present.  In  fact,  it  can  be  shown24  that  in  the 
absence  of  a  temperature  gradient,  if  all  collisions  conserve  crystal  momentum,  then 
the  phonon  distribution  function  will  relax  to  the  steady-state  form: 

ns  (k)  =  -  k)  _  j.  (25.36) 

where  w  is  a  constant  determined  by  the  condition  that 

I  kfi/(k)  (25.37) 

be  equal  to  the  initial  total  crystal  momentum. 

Evidently  the  distribution  function  (25.36)  is  not  symmetric  in  k  and  will,  in 
general,  support  a  nonvanishing  thermal  current  density:25 

i*  =  17 1  hcofk) £-^  »:  (k)  *  0.  (25.38) 

V  t*  ck 

This  is  tantamount  to  the  assertion  that  in  the  absence  of  umklapp  processes  the  thermal 
conductivity  of  an  insulating  crystal  is  infinite.26 


24  Equation  (25.36)  can  be  derived  with  the  aid  of  a  phonon  Boltzmann  equation;  see,  for  example, 
J.  M.  Ziman,  Electrons  and  Phonons ,  Oxford,  1960,  Chapter  VIII,  We  shall  not  derive  it  here,  since  the  only 
point  we  wish  to  draw  from  it  is  intuitively  quite  plausible:  If  the  steady-state  distribution  function  leads 
to  a  nonzero  total  crystal  momentum,  then  it  must  violate  the  symmetry  leading  to  vanishing  thermal 
current.  The  thermal  current  will  therefore  also  be  nonzero,  barring  fortuitous  cancellations.  A  similar 
point  arises  in  the  theory  of  the  low  temperature  electrical  resistivity  of  a  metal.  See  pps.  527,  528. 

25  That  current  will  be  parallel  to  the  total  crystal  momentum,  in  crystals  of  cubic  symmetry,  and, 
more  generally,  will  flow  in  a  direction  not  too  far  from  the  direction  of  the  total  crystal  momentum. 

26  Implicit  in  this  discussion  is  the  fact  that  phonons  can  appear  and  disappear  at  the  ends  of  the 
sample.  This  becomes  clear  when  one  tries  to  apply  the  same  argument  to  a  dilute  classical  gas,  in  which 
collisions  conserve  real  momentum.  Such  a  gas,  contained  in  a  long  cylindrical  vessel,  does  not  have 
infinite  thermal  conductivity.  Our  argument  fails  in  this  case  because  the  gas  cannot  penetrate  the  ends 
of  the  vessel,  and  the  resulting  accumulation  of  molecules  at  the  ends  leads  to  diffusion  currents  which 
reduce  the  total  momentum  back  to  zero.  Phonons,  however,  though  they  can  be  reflected,  can  also  be 
absorbed  at  the  ends  of  a  cylindrical  crystalline  sample  by  transferring  their  energy  to  the  heat  baths  at 
either  end.  There  is  therefore  no  inconsistency  in  assuming  a  steady-state  distribution  with  net  non  vanishing 
crystal  momentum  throughout  the  specimen.  The  thermal  current  in  a  crystal  in  the  absence  of  umklapp 
processes  is  more  analogous  to  the  transport  of  heat  by  convection  in  a  gas  flowing  through  an  open-ended 
cylinder. 
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A  perfect  infinite  anharmonic  crystal  has  a  finite  thermal  conductivity  at  low 
temperatures  only  because  there  will  still  be  some  small  probability  of  crystal- 
momentum-destroying  umklapp  processes  that  do  degrade  the  thermal  current.  Since 
the  change  in  total  crystal  momentum  in  an  umklapp  process  is  equal  to  a  non¬ 
vanishing  reciprocal  lattice  vector  (of  a  size  comparable  to  kD),  this  means  that  at 
least  one  of  the  phonons  involved  in  a  cubic  or  quartic  umklapp  collision  must  itself 
have  a  crystal  momentum  that  is  not  small  compared  with  kD.  That  same  phonon 
must  also  have  an  energy  that  is  not  small  compared  with  htoD,  and  energy  conserva¬ 
tion  therefore  requires  at  least  one  phonon  with  an  energy  not  small  compared  with 
ha>D  to  be  present  before  the  collision.  When  T  is  small  compared  with  ©D,  the  mean 
number  of  such  phonons  is 


ns(k)  = 


1 

e0»j(k)%T  _  j 


1 

e&»rr  -  1 


eeo'7. 


(25.39) 


Thus,  as  the  temperature  drops,  the  number  of  phonons  that  can  participate  in 
umklapp  processes  drops  exponentially.  Without  the  umklapp  processes  the  thermal 
conductivity  would  be  infinite,  and  we  therefore  expect  that  the  effective  relaxation 
time  appearing  in  the  thermal  conductivity  must  vary  as 

t  ~  eT°IT  (25.40) 

at  temperatures  well  below  0D,  where  T0  is  a  temperature  of  order  ©D.  Determining 
T0  precisely  requires  analysis  of  great  complexity,  an  analysis  which  also  leads  to 
powers  of  T  multiplying  the  exponential.  But  these  are  small  corrections  to  the 
dominant  exponential  behavior,  the  qualitative  form  of  which  is  a  straightforward 
consequence  of  the  freezing  out  of  umklapp  processes. 

When  the  temperature  reaches  a  point  where  the  exponential  increase  in  the 
thermal  conductivity  sets  in,  the  conductivity  increases  so  rapidly  with  decreasing 
temperature  that  the  phonon  mean  free  path  soon  becomes  comparable  to  the  mean 
free  path  due  to  the  scattering  of  phonons  by  lattice  imperfections  or  impurities,  or 
even  to  the  mean  free  path  describing  the  scattering  of  phonons  by  the  sides  of  the 
finite  specimen.  Once  this  happens,  the  mean  free  path  in  (25.31)  ceases  to  be  the 
intrinsic  one  due  to  the  anharmonic  terms,  and  must  be  replaced  by  a  temperature- 
independent  length  determined  by  the  spatial  distribution  of  imperfections  or  the 
size27  of  the  specimen.  The  temperature  dependence  of  k  then  becomes  that  of  the 
specific  heat,  which  declines  as  T3  at  temperatures  well  below  ©D.28 

Surveying  the  full  temperature  range,  we  expect  the  very  low  temperature  thermal 
conductivity  to  be  limited  by  temperature-independent  scattering  processes  deter¬ 
mined  by  the  geometry  and  purity  of  the  sample.  It  will  therefore  rise  as  T3  with 
the  phonon  specific  heat.  This  rise  continues  until  a  temperature  is  reached  at  which 
umklapp  processes  become  frequent  enough  to  yield  a  mean  free  path  shorter  than 


21  This  regime  is  known  as  the  Casimir  limit.  See  H.  B.  G.  Casimir,  Physica  5,595  (1938). 

28  The  degree  to  which  this  result  is  dependent  on  crystallinity  is  indicated  by  experiments  on  glasses 
and  amorphous  materials,  where,  for  T  €  1  K,  the  thermal  conductivity  rises  roughly  as  T2.  See 
R.  C.  Zeller  and  R.  O.  Pohl,  Phys.  Rev.  B4,  2029  (1971). 


Lattice  Thermal  Conductivity:  Elementary  Kinetic  Theory  505 


the  temperature-independent  one.  At  this  point  the  thermal  conductivity  will  reach 
a  maximum,  beyond  which  it  will  decline  rapidly  due  to  the  factor  er°/r,  reflecting  the 
exponential  increase  in  the  frequency  of  umklapp  processes  with  rising  temperature. 
The  decline  in  k  continues  up  to  temperatures  well  above  ©D,  but  the  drastic  expo¬ 
nential  decline  is  quickly  replaced  by  a  slow  power  law,  reflecting  simply  the  increasing 
number  of  phonons  available  to  participate  in  scattering  at  high  temperatures. 

Some  typical  measured  thermal  conductivities  that  illustrate  these  general  trends 
are  displayed  in  Figure  25.5. 


Figure  25.5 

Thermal  conductivity  of  iso-  200 
topically  pure  crystals  of  LiF. 

Below  about  10  K  the  con¬ 
ductivity  is  limited  by  surface 
scattering.  Therefore  the  tem¬ 
perature  dependence  comes  50 

entirely  from  the  T1  depen¬ 
dence  of  the  specific  heat, 
and  the  larger  the  cross- 
sectional  area  of  the  sample,  20 

the  larger  the  conductivity. 

As  the  temperature  rises, 
umklapp  processes  become  10 

less  rare,  and  the  conductivi- 
ty  reaches  a  maximum  when  U 

the  mean  free  path  due  to  f  5 

phonon-phonon  scattering  is  v 

comparable  to  that  due  to  ^ 

surface  scattering.  At  still 
higher  temperatures  the  con-  g 

ductivity  falls  because  the  g 

phonon-phonon  scattering  ^  ( 

rate  is  rapidly  increasing,  § 

while  the  phonon  specific  p 

heat  is  starting  to  level  off.  0.5 

(After  P.  D.  Thatcher,  Phys. 

Rev.  156,  975  (1967).) 
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Crystal  dimensions: 

(A)  7.55  X  6.97  mm 

(B)  4.24X3.77  mm 

(C)  2.17X2.10  mm 

(D)  1.23X0.91  mm 
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SECOND  SOUND 

As  we  have  had  occasion  to  note,  there  is  an  analogy  between  the  phonons  in  an 
insulator  and  the  molecules  of  an  ordinary  classical  gas.  Like  the  molecules  of  a  gas, 
phonons  can  exchange  energy  and  (crystal)  momentum  in  collisions,  and  can  transport 
thermal  energy  from  one  region  to  another.  Unlike  the  molecules  in  a  gas,  however, 
the  number  of  phonons  need  not  be  conserved  in  a  collision  or  at  the  surfaces  of 
the  containing  vessel  (the  container,  in  the  case  of  phonons,  being  the  crystal  itself). 
Finally,  though  momentum  is  always  conserved  in  intermolecular  collisions  in  a  gas, 
phonon  crystal  momentum  is  conserved  only  in  normal  collisions,  and  therefore 
crystal  momentum  conservation  is  a  good  conservation  law  only  to  the  extent  that 
the  temperature  is  low  enough  to  freeze  out  umklapp  processes.  These  similarities 
and  differences  are  summarized  in  Table  25.3. 


Table  25.3 

THE  CLASSICAL  GAS  vs.  THE  PHONON  GAS 


CLASSICAL  GAS  OF  MOLECULES 

PHONON  GAS 

Container 

A  vessel  with 
impenetrable  walls 

A  crystal,  which  is  the 
medium  that  sustains  the  phonons 

Collisions 

Molecules  collide  with  each 
other  and  with  wails  of  v  essel 

Phonons  collide  with  each  other, 
with  surface  of  crystal, 
and  with  impurities 

Energy  conserved  in 
collisions 

Yes 

Yes 

(Crystal)  momentum 
conserved  in  collisions 

Yes  (except  at  walls) 

Yes  (except  at  surfaces  and  in 
collisions  with  impurities), 
provided  that  T  «  ©D,  so  that 
umklapp  processes  are  frozen  out 

Number  conserved  in 
collisions 

Yes 

No 

One  of  the  most  striking  phenomena  observed  in  an  ordinary  gas  is  sound,  a 
wavelike  oscillatory  disturbance  in  the  local  density  of  molecules.  According  to 
elementary  kinetic  theory  sound  can  propagate  in  a  gas  provided  that: 


(a)  Collisions  between  the  molecules  conserve  number,  energy,  and  momentum. 

(b)  The  collision  rate,  1/t,  is  large  compared  with  the  frequency  of  the  sound  wave, 
v  =  co/2jt: 

1 

w  «  (25.41) 

T 

Condition  (b)  insures  that  at  any  instant  in  the  oscillatory  cycle  collisions  occur 
rapidly  enough  to  bring  about  a  local  state  of  thermodynamic  equilibrium,  in  which 
the  instantaneous  local  density,  pressure,  and  temperature  are  related  by  the  equi¬ 
librium  equation  of  state  for  the  uniform  gas.  The  conservation  laws  (condition  (a)) 
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are  essential  for  the  establishment  of  this  equilibrium.  The  momentum  conservation 
law  is  of  critical  importance,  in  requiring  the  instantaneous  local  equilibrium  con¬ 
figuration  to  have  a  net  nonvanishing  momentum  (sometimes  referred  to  as  “local 
equilibrium  in  a  moving  frame  of  reference”),  which  is  the  kinematical  basis  for  the 
oscillation. 

In  considering  whether  sound  has  an  analogue  in  the  phonon  gas,  one  must 
recognize  that  the  phonon  gas  differs  in  two  relevant  ways  from  an  ordinary  gas: 

1.  Number  is  not  conserved  in  collisions. 

2.  Crystal  momentum  is  not  conserved  exactly,  although  it  becomes  conserved  to 
increasingly  greater  accuracy  as  the  temperature  drops  and  umklapp  processes 
are  “frozen  out.” 

The  first  point  presents  no  serious  problems:  The  loss  of  one  of  the  conservation 
laws  reflects  the  fact  that  the  equilibrium  phonon  distribution  function 


eho>J,k)ikBT  _  ]  '  ' 

is  determined  entirely  by  the  temperature,  while  the  equilibrium  distribution  func¬ 
tion  in  an  ideal  gas  depends  on  both  temperature  and  density.  Since  local  equilibrium 
is  specified  by  one  less  variable  in  the  phonon  gas,  one  less  conservation  law  is  required 
to  maintain  it. 

However,  momentum  conservation  is  quite  essential  for  the  propagation  of  sound, 
and  this  means  that  the  rate  of  umklapp  collisions,  which  destroy  crystal  momentum, 
must  be  small  compared  with  the  frequency  of  the  oscillation: 

—  «  to.  (25.43) 

This  condition  has  no  analogue  in  the  theory  of  sound  in  an  ordinary  gas.  However, 
the  analogue  of  Eq.  (25.41)  must  continue  to  hold,  where  the  relevant  relaxation  time 
is  that  describing  the  momentum-conserving  normal  collisions,  rw, 

to  «  — ,  (25.44) 

tn 

for  it  is  still  essential  that  local  thermodynamic  equilibrium  be  maintained  on  a  time 
scale  short  compared  to  the  period  of  an  oscillation.  Combining  conditions  (25.43) 
and  (25.44),  we  find  that  the  frequency  must  lie  in  the  “window” 

—  «  to  «  — .  (25.45) 

Tu  Tft, 

Therefore  the  analogue  of  sound  in  the  phonon  gas  will  exist  at  temperatures  low 
enough  for  the  rate  of  normal  collisions  to  be  substantially  greater  than  the  rate  of 
umklapps  and  at  frequencies  intermediate  between  the  two  collision  rates.  The 
phenomenon,  known  as  second  sound,  can  be  regarded  as  an  oscillation  in  the  local 
phonon  number  density  (just  as  ordinary  sound  is  an  oscillation  in  the  local  density 
of  molecules)  or,  perhaps  more  pertinently  for  phonons  (since  their  chief  attribute 
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is  that  they  carry  energy),  as  an  oscillation  in  the  local  energy  density.  Since  the  local 
equilibrium  number  and  energy  densities  of  the  phonon  gas  in  a  crystal  are  uniquely 
determined  by  the  local  temperature,  second  sound  will  manifest  itself  as  a  vvavelike 
oscillation  in  the  temperature.  Its  observation  is  most  favorable  in  solids  of  very  high 
isotopic  purity  (since  any  deviation  from  a  perfect  Bravais  lattice,  including  the 
presence  of  occasional  ions  of  different  isotopic  mass,  will  lead  to  collisions  in  which 
crystal  momentum  is  not  conserved)  and  with  fairly  large  anharmonic  terms  (since 
a  high  rate  of  normal  phonon  collisions  is  required  to  maintain  local  thermodynamic 
equilibrium).  These  considerations  make  solid  helium  and  the  ionic  crystal  sodium 
fluoride  promising  media  for  the  observation  of  second  sound.  In  both  crystals  heat 
impulses  have  indeed  been  observed  to  propagate  at  the  velocity  predicted  by  the 
second-sound  wave  equation,  rather  than  in  the  diffusional  manner  associated  with 
ordinary  thermal  conduction.29  The  prediction  and  detection  of  second  sound  is  one 
of  the  great  triumphs  of  the  theory  of  lattice  vibrations. 


PROBLEMS 


1.  Instability  of  a  Theory  with  Only  Cubic  Anharmonicity 

Show  that  if  only  cubic  corrections  to  the  harmonic  potential  energy  are  retained,  then  that 
potential  energy  can  be  made  negative  and  arbitrarily  large  in  magnitude  by  a  suitable  choice 
of  the  ionic  displacements  u(R).  (Hint:  Take  an  arbitrary  set  of  displacements  and  consider  the 
effect  on  the  total  potential  energy  of  multiplying  them  all  by  a  scale  factor  and  changing  all 
of  their  signs.) 

2.  Equation  of  State  of  the  Harmonic  Crystal 

Derive  the  form  (25.7)  for  the  pressure  in  the  harmonic  approximation,  by  substituting  the 
harmonic  form  (25.6)  of  the  internal  energy  U  into  the  general  thermodynamic  relation  (25.5). 
(Hint:  Change  the  integration  variable  from  T'  to  x  =  kc)Jk)/T',  and  integrate  by  parts  with 
respect  to  x,  taking  due  care  of  the  integrated  terms.) 

3.  Griineisen  Parameters  in  One  Dimension 

Consider  a  one-dimensional  array  of  N  atoms  interacting  through  pair  potentials  <p(r)  and 
constrained  to  have  a  length  L=  Na  (i.e.,  the  equilibrium  lattice  constant  is  constrained  to  be  a). 

(a)  Show  that  if  only  nearest-neighbor  interactions  are  appreciable,  then  the  k-dependent 
Griineisen  parameters  are  in  fact  independent  of  k  and  given  by 


2  <t>"(a) ' 


(25.46) 


(b)  Show  that  if  next-nearest-neighbor  interactions  are  retained,  the  Griineisen  parameters 
for  the  individual  normal  modes  will,  in  general,  depend  on  the  wave  vector. 


20  The  observation  of  second  sound  in  solid  4He  is  reported  by  Ackerman  et  al.,  Phys.  Rev.  Lett.  16, 
789  (1966).  and  in  solid  3  He  by  C.  C.  Ackerman  and  W.  C.  Overton,  Jr.,  Phys.  Rev.  Lett.  22,  764  (1969). 
The  onset  of  second  sound  in  NaF  is  reported  by  T.  F.  McNelly  et  al.,  Phys.  Rev.  Lett.  24, 100  (1970).  The 
subject  is  reviewed  by  C.  C.  Ackerman  and  R.  A.  Guyer.  Annals  of  Physics  50, 128  (1968). 
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4.  General  Form  of  the  Griineisen  Parameters 

If  the  harmonic  approximation  is  not  made,  the  full  ionic  potential  energy  of  a  monatomic  Bravais 
lattice  will  have  the  form 

Ueq  +  iX^(R)“v(R'Rv(R  -  R') 

#*v 

RR 


+  11  tv(R)uv(R')uA(R")D^vA(R,  R',  R")  +  ■  ■  • ,  (25.47) 

fivX 

RRR- 

where  u(R)  gives  the  displacement  from  the  equilibrium  position  R. 

(a)  Show  that  if  the  expansion  is  made  not  about  the  equilibrium  positions  R,  but  about 
sites  R  =  (1  +  tj) R,  then  the  coefficients  of  the  quadratic  term  in  the  new  expansion  are  given  to 
linear  order  in  t]  by 

D„M  -  R')  =  £>„v(R  -  R  )  +  ri  SD^i R  -  R'),  (25.48) 

where 

SD^(R  -  R  )  =  I  ZV;(R,  R,  R")R,".  (25.49) 


Note  that  only  the  cubic  term  in  (25.47)  contributes  to  this  order  in  r\. 

(b)  Show  that  the  Griineisen  parameter  for  the  normal-mode  ks  is  given  by 


e(ks)<5D(k)e(k.s) 

6A/cos(k)2 


(25.50) 


5.  Three-Phonon  Processes  in  One  Dimension 

Consider  a  process  in  which  two  phonons  combine  to  give  a  third  (or  one  phonon  decays  into 
two  others).  Let  all  phonons  be  acoustic,  assume  that  the  two  transverse  branches  lie  below  the 
longitudinal  branch,  and  assume  that  d2w,:dk 2  <  0  for  all  three  branches. 

(a)  Interpreting  the  conservation  laws  graphically  (as,  for  example,  in  Figure  24.5),  show 
that  there  can  be  no  process  in  which  all  three  phonons  belong  to  the  same  branch. 

(b)  Show  that  the  only  possible  processes  are  those  in  which  the  single  phonon  is  in  a  branch 
higher  than  at  least  one  of  the  members  of  the  pair;  i.e.. 


or 


Transverse  +  transverse  <->  longitudinal 
Transverse  +  longitudinal  «-►  longitudinal. 
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The  general  theory  oflattice  vibrations  expounded  in  Chapters  22  and  23  is  applicable 
to  both  metals  and  insulators.  However,  its  detailed  application  to  metals  is  compli¬ 
cated  by  two  aspects  of  the  metallic  state: 

1.  The  Ions  Are  Charged  This  leads  to  difficulties  associated  with  the  very  long 
range  of  the  direct  electrostatic  interaction  between  ions.1 

2.  Conduction  Electrons  Are  Present  Even  the  simplest  theory  of  lattice  vibrations 
in  a  metal  must  acknowledge  the  presence  of  a  set  of  electrons  that  cannot  be 
considered  rigidly  bound  up  in  the  ion  cores.  The  conduction  electrons  interact 
with  the  ions  through  electrostatic  forces  which  are  just  as  strong  as  the  direct 
Coulomb  forces  between  ions,  and  it  is  therefore  essential  to  know  what  they 
too  are  doing  in  the  course  of  a  lattice  vibration. 

As  it  turns  out,  these  mobile  conduction  electrons  provide  just  the  mechanism 
required  to  remove  the  problems  connected  with  the  long  range  of  the  direct  electro¬ 
static  interaction  between  ions. 


ELEMENTARY  THEORY  OF  THE  PHONON  DISPERSION 
RELATION 

Suppose  that  we  were  to  ignore  the  forces  the  conduction  electrons  exert  on  the 
ions.  Then  the  theory  of  metallic  lattice  vibrations  would  be  just  the  theory  of  the 
normal  modes  of  a  set  of  N  charged  particles  of  charge  Ze  and  mass  M  in  a  volume 
V.  In  the  limit  of  long  wavelengths,  except  for  the  difference  in  particle  mass  and 
charge,2  this  is  just  the  problem  we  analyzed  in  Chapter  1  (pages  19,  20),  where  we 
found  that  an  electron  gas  could  sustain  density  oscillations  at  the  plasma  frequency 
cop,  given  by 


Making  the  substitutions  e  -*  Ze,  in  ->  M,  ne  -+  nt  —  nJZ,  we  can  conclude  in  the 
same  way  that  a  set  of  charged  point  ions  should  undergo  long-wavelength  vibrations 
at  an  ionic  plasma  frequency  fip,  given  by 


4ml, (Ze)2 
M 


(26.2) 


This  contradicts  the  conclusion  in  Chapter  22  that  the  long-wavelength  normal¬ 
mode  frequencies  of  a  monatomic  Bravais  lattice  should  vanish  linearly  with  k.  That 
result  is  inapplicable  because  the  approximation  (22.64)  leading  to  the  linear  form 


'  This  problem  also  arises  in  ionic  crystals,  as  we  have  seen  in  Chapter  20.  We  will  discuss  the  theory 
of  their  lattice  vibrations  in  Chapter  27.  ... 

2  We  also  ignore  the  fact  that,  unlike  an  electron  gas,  the  ions  are  distributed  at  lattice  sites  in  equi- 
librium,  and  have  short-range  repulsive  interactions.  A  more  careful  analysis  is  given  in  Problem  1. 
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for  oj(k)  at  small  k  is  only  valid  if  the  forces  between  ions  separated  by  R  are  negligibly 
small  for  R  of  order  l/k.  But  the  inverse  square  force  falls  off  so  slowly  with  distance 
that  no  matter  how  small  k  is,  interactions  of  ions  separated  by  R  it  \/k  can  con¬ 
tribute  substantially  to  the  dynamical  matrix  (22.59).3  Nevertheless,  the  phonon 
spectra  of  metals  clearly  possess  branches  in  which  co  vanishes  linearly  with  k.  This 
can  be  seen  directly  from  neutron  scattering,  and  also  from  the  T3  term  in  the  specific 
heat,4  characteristic  of  such  a  linear  dependence. 5 

To  understand  why  the  phonon  dispersion  is  linear  at  small  k  it  is  essential,  when 
considering  ionic  motion,  to  take  the  conduction  electrons  into  account. 

The  response  of  the  electrons  is  treated  in  the  adiabatic  approximation  (see 
Chapter  22,  p.  425)  which  assumes  that  at  any  time  the  electrons  take  on  the  con¬ 
figurations  they  would  have  if  the  ions  were  frozen  into  their  instantaneous  positions. 
Furthermore  we  have  seen  in  Chapter  17  that  in  the  presence  of  an  external  charge 
distribution  (in  this  case  the  instantaneous  distribution  of  the  ions)  the  electron  gas 
distributes  itself  so  as  to  shield  or  screen  the  fields  produced  by  that  distribution. 
Thus  as  the  ions  execute  their  comparatively  sluggish  vibrations,  the  nimble  con¬ 
duction  electrons  are  continually  redistributing  themselves  so  as  to  cancel  out  the 
long-range  part  of  the  ionic  field,  yielding  an  effective  ionic  field  that  is  short-ranged, 
and  therefore  capable  of  leading  to  a  phonon  dispersion  relation  that  is  linear  in  k 
at  long  wavelengths. 

One  often  refers  to  the  original  direct  Coulomb  interaction  between  ions  as  the 
“bare”  ion-ion  interaction,  and  the  short-range  effective  interaction  produced  by 
conduction  electron  screening  as  the  “dressed”  interaction. 

To  estimate  the  actual  phonon  frequencies  from  this  picture  we  consider  the  ionic 
configuration  in  a  phonon  of  wave  vector  k,  to  constitute,  as  far  as  the  conduction 
electrons  are  concerned,  an  external  charge  density6  with  wave  vector  k.  According 
to  (17.36)  the  field  associated  with  such  a  distribution  is  reduced  (by  virtue  of  the 
screening  action  of  the  electrons)  by  l/€(k),  where  e(k)  is  the  electron  gas  dielectric 
constant.  Since  the  square  of  the  phonon  frequency,  co(k)2,  is  proportional  to  the 
restoring  force,  and  hence  to  the  field,  we  must  reduce  (26.2)  by  l/e(k),  for  it  was 
derived  without  taking  screening  into  account.  This  yields  a  “dressed”  phonon 
frequency  given  by 

«(k)2  =  (26.3) 

e(k) 

As  k  ->  0  the  dielectric  constant  is  given  by  the  Thomas-Fermi  form  (17.51): 

e(k)  =  1  +  (26.4) 


3  Consider,  for  example,  a  plane  sheet  of  charged  particles,  which  gives  rise  to  an  electric  field 
independent  of  the  distance  from  the  sheet.  See  also  Chapter  22,  footnote  28. 

4  See  Problem  2. 

5  See  Problem  2,  Chapter  23. 

6  This  ignores  complications  stemming  from  the  discreteness  of  the  lattice  (and  hence  the  ambiguity 
of  k  to  within  an  additive  reciprocal  lattice  vector). 
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I 


and  therefore,  as  k  -»  0, 


cofk)  ~  ck  c2  =  —  =  — ^ 
ot(k)  ^  ck,  c  M  k°2  . 


(26.5) 


To  see  that  this  gives  a  reasonable  value  for  the  phonon  velocity,  we  estimate  k0  by 
its  free  electron  value7  (17.55)  given  by 

47tc2  h2n2 
mkf  ’ 


k  2 
Ko 


(26.6) 


and  evaluate  the  electron  plasma  frequency  using  (2.21), 

k/ 


= 


371 


2' 


(26.7) 


Thus  the  velocity  of  sound  is  given  by 


-2  2 

f  “3  ZMLf' 


(26.8) 


which  is  known  as  the  Bohm-Staver  relation.8 

Since  the  electron-ion  mass  ratio  is  typically  of  order  10' 4  or  10"5,  this  predicts 
a  sound  velocity  about  a  hundredth  of  the  Fermi  velocity,  or  of  order  106  cm/sec, 
in  agreement  with  observed  orders  of  magnitude.  Alternatively,  since 


k~)  i i  hckn/kji  2  k;,  c  c 

Tf  fytkfVf/kg  kp  v f  Vf 


(26.9) 


1  Alternatively,  one  can  use  the  exact  long-wavelength  relation  (17.50) 


47te2  1 

k0 1  dnjdfi 

together  with  the  thermodynamic  identity 

n  _  dP_ 
dnf  dji  dn  ’ 

to  write  (26.5)  in  the  form 


2  = 

^  -3  »  Pion  ry  • 


Since  continuum  mechanics  predicts  (ignoring  anisotropy)  that  the  sound  velocity  of  any  medium  is 
given  by  the  square  root  of  the  derivative  of  pressure  with  respect  to  mass  density,  (26.5)  is  as  good  as 
the  approximation  that  the  compressibility  of  a  metal  is  dominated  by  the  electronic  contribution  (the 
mass  density,  of  course,  is  dominated  by  the  ionic  contribution)  and  (26.8)  is  as  good  as  the  approximation 
that  the  compressibility  is  dominated  by  the  free  electron  contribution.  By  coincidence,  this  is  very  nearly 
the  case  in  the  alkali  metals  (see  page  39).  However,  it  is  evident  that  (26.8)  overlooks,  at  the  least,  both 
electron-electron  interactions  and  ionic  core-core  repulsion. 

8  D.  Bohm  and  T.  Staver,  Phys.  Rev.  84,  836  (1950). 
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Eq.  (26.8)  accounts  for  the  fact  that  the  Debye  temperature  in  a  metal  is  typically 
of  order  room  temperature,  while  the  Fermi  temperature  is  several  tens  of  thousands 
of  degrees  Kelvin. 


KOHN  ANOMALIES 

The  assumption  that  the  Coulombic  part  of  the  effective  ion-ion  interaction  is  reduced 
by  the  electronic  dielectric  constant  also  has  implications  for  normal  modes  of  short 
wavelength.  At  wave  vectors  that  are  not  small  compared  with  kF,  the  Thomas-Fermi 
dielectric  constant  (26.4)  must  be  replaced  by  the  more  accurate  Lindhard  result,9 
which  is  singular10  when  the  wave  vector  q  of  the  disturbance  has  magnitude  2 kF. 
W.  Kohn  pointed  out11  that  this  singularity  should  be  conveyed,  via  the  screened 
ion-ion  interaction,  into  the  phonon  spectrum  itself,  resulting  in  weak  but  discernible 
“kinks”  (infinities  in  Seu/Sq)  at  values  of  q  corresponding  to  extremal  diameters  of  the 
Fermi  surface. 

Highly  accurate  neutron  measurements  of  <u(q)  are  required  to  reveal  these 
anomalies.  Such  measurements  have  been  performed12  and  indicate  a  structure  of 
singularities  that  is  consistent  with  the  Fermi  surface  geometry  deduced  from  quite 
unrelated  data. 

DIELECTRIC  CONSTANT  OF  A  METAL 

Our  discussion  of  screening  in  Chapter  17  was  based  on  the  electron  gas  model,  which 
treats  the  ions  as  an  inert  uniform  background  of  positive  charge.  This  overlooks 
the  fact  that  an  external  source  of  charge  can  induce  fields  in  a  metal  by  distorting 
the  charge  distribution  of  the  ions,  as  well  as  that  of  the  electrons.  There  are  circum¬ 
stances  in  which  one  is  legitimately  interested  in  the  screening  action  of  the  electrons 
alone.13  However,  one  also  often  wishes  to  consider  the  screening  of  an  external 
source  by  all  the  charged  particles  in  the  metal — ions  and  electrons.  We  are  now  in 
a  position  to  consider  an  elementary  treatment  of  this  additional  ionic  source  of 
screening. 

We  define,  in  the  usual  way  (see  Chapter  17),  the  total  dielectric  constant  of  the 
metal  as  the  proportionality  constant  relating  the  Fourier  transform  of  the  total 
potential  in  the  metal  to  the  Fourier  transform  of  the  potential  of  the  external  charge: 

e<j>waS  =  <pm.  (26.10) 

It  is  instructive  to  relate  the  total  dielectric  constant  e  to  the  dielectric  constant 
€d  of  the  electron  gas  alone,  the  dielectric  constant  of  the  bare  ions  alone,  and 


9  See  Chapter  17,  page  343. 

10  Its  derivative  has  a  logarithmic  divergence. 

11  W.  Kohn,  Phys .  Rev.  Lett .  2,  393  (1959). 

12  R.  Stedman  et  al.,  Phys.  Rev.  162,  545  (1967). 

13  For  example,  in  our  derivation  of  the  Bohm-Staver  relation  (26.8).  In  that  argument  the  ions 
were  treated  as  a  source  of  charge  external  to  the  electron  gas,  and  not  as  additional  constituents  of  the 
screening  medium. 
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the  dielectric  constant  e“r"ssed  appropriate  to  the  dressed  ions — i.e.,  describing  a  set 
of  “particles”  (ions  with  their  clouds  of  screening  electrons)  interacting  through  a 
screened  interaction  Vctf. 

If  we  were  to  consider  the  medium  to  be  the  electrons  alone,  including  the  ions  in 
with  the  explicitly  external  sources  through  a  total  “external”  potential  ft"  +  ft0", 
then  we  could  write 

eel^>,0,aI  =  ft”  +  ft00.  (26.11) 

On  the  other  hand,  we  could  also  consider  the  medium  to  consist  of  the  bare  ions 
alone,  regarding  the  electrons  as  an  additional  source  of  external  potential.  We 
should  then  have 

=  <r  +  (26.12) 

Adding  these  last  two  equations  together  and  subtracting  the  definition  (26.10)  of  e 
we  find 

(cel  +  e“are  ~  ft  ft™'  =  ft"  +  ft'  +  ft™  i  (26.13) 

but  since  ftot  —  ft*'  +  ft'  4-  ft0",  we  have  deduced  that14 

(26.14) 

Equation  (26.14)  gives  the  dielectric  constant  of  the  metal  in  terms  of  those  of  the 
electrons  and  the  bare  ions.  However,  it  is  often  more  convenient  to  deal  not  with 
bare,  but  with  the  dressed  ions.  By  dressed  ions  one  means  the  ions  together  with 
their  cloud  of  screening  electrons;  i.e.,  particles  that  give  rise  to  an  effective  potential 
that  is  the  bare  ionic  potential  screened  by  electrons.  The  dielectric  constant  edr^sed 
describes  the  total  potential  that  would  be  established  by  such  a  set  of  particles  in 
the  presence  of  a  given  external  potential.  To  describe  the  response  of  a  nietal  (as 
opposed  to  a  set  of  dressed  ions)  to  an  external  potential  we  must  note  that  in  addition 
to  “dressing”  the  ions,  the  electrons  also  screen  the  external  potential ;  i.e.,  the  “external 
potential”  to  be  screened  by  the  dressed  ions  is  not  the  bare  external  potential,  but 
the  electronically  screened  one. 

Thus  the  response  of  a  metal  to  a  potential  ft*'  can  be  viewed  as  the  response  of 
a  set  of  dressed  ions  to  a  potential  (l/ea)ft*',  and  we  have 

r,al  =  (26.15) 

6  dressed 

Comparing  this  with  the  definition  (26.10)  of  e,  we  have: 

(26.16) 


14  In  terms  of  the  polarizability  a  =  (e  —  1)  '4n.  this  is  simply  a  special  case  of  the  assertion  that 
the  polarizability  of  a  medium  composed  of  several  types  of  carriers  is  the  sum  of  the  polarizabilities  of 
the  individual  carriers. 
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which  should  be  used  in  place  of  (26.14)  if  we  wish  a  description  based  on  dressed, 
rather  than  bare,  ions.  The  two  formulations  must,  of  course,  be  equivalent.  Writing 
(26.14)  in  the  form: 

11  1 
e~ed  1  +(*£.-  I  )/ecl 

we  see  that  consistency  with  (26.16)  requires  that15 

=  1  +^(C!e-  1). 


(26.17) 


(26.18) 


To  study  the  rough  quantitative  significance  of  the  ionic  contribution  to  the 
dielectric  constant,  we  use  the  simplest  expressions  available  for  ed  and  e““re.  For 
the  former  we  use  the  Thomas-Fermi  result  (26.4).16  For  the  latter  we  can  simply 
transcribe  the  result  (1.37)  for  the  dielectric  constant  of  a  gas  of  charged  particles, 
provided  that  we  replace  the  electronic  plasma  frequency  (26.1)  by  that  of  the  ions 
(26.2). 17  Thus  with 

Q  2 

«Ce  =  1  -  -V,  (26.19) 

CO 


the  total  dielectric  constant  (26.14)  becomes 


and  the  dielectric  constant  (26.18)  of  the  dressed  ions  becomes 


--ion 
t  dressed 


fip2/e" 

co2 


=  I  - 


(26.20) 


(26.21) 


where  we  have  used  the  screening  relation  (26.3)  to  introduce  the  dressed  phonon 
frequency  co(q).  Note  that  has  the  same  form  as  e,b°anrc,  with  the  bare  phonon 

frequency  flp  replaced  by  the  dressed  frequency. 


15  In  terms  of  polarizabilities  this  is  the  reasonable  assertion  that 

«ion 

_,ion  _  bare 
^■dressed  ^cl  * 

16  By  using  the  static  electronic  dielectric  constant  we  are  restricting  our  attention  to  disturbances 
of  wave  vector  q  whose  frequencies  are  low  enough  to  satisfy  co  «  qvF. 

17  Equation  (26.19)  (like  (1.37),  from  which  it  is  taken),  ignores  the  dependence  on  wave  vector  q. 
This  is  valid  if  the  characteristic  particle  velocity  carries  one  a  distance  small  compared  with  a  wavelength 
of  the  disturbance,  in  a  period  of  the  disturbance:  v/co  «  1/q,  or  co  »  qv.  Since  a  typical  ionic  velocity 
is  very  much  smaller  than  vF,  there  is  a  large  range  of  frequencies  and  wave  vectors  for  which  one  may 
consistently  use  c(q,  co)  %  e(q.  co  =  0)  for  the  electrons  (cf.  footnote  16)  and  e(q,  ru)  ^  e(q  =  0,  co)  for  the 
bare  ions. 
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We  may  substitute  (26.21)  into  the  form  (26.16)  for  the  total  dielectric  constant, 
to  find : 


(26.22) 


which  is,  of  course,  equivalent  to  (26.20). 

The  most  important  consequences  of  the  form  (26.22)  of  the  total  dielectric  constant 
arise  in  treating  the  effective  electron-electron  interaction  in  a  metal.  We  therefore 
continue  our  discussion  from  this  point  of  view. 


EFFECTIVE  ELECTRON-ELECTRON  INTERACTION 


In  Chapter  1 7  we  argued  that  for  many  purposes  the  Fourier  transform  of  the  electron- 
electron  Coulomb  interaction  should  be  screened  by  the  electronic  dielectric  constant, 

4ne2  4r.e2  4ne2 

IF  F3  =  k 2  +  k02’  (26-23) 


to  represent  the  effect  of  the  other  electrons  in  screening  the  interaction  between  a 
given  pair.  However,  the  ions  also  screen  interactions,  and  we  should  have  used  not 
eeI  but  the  full  dielectric  constant.  Using  the  form  (26.22)  we  find  that  (26.23)  should 
be  replaced  by 


4ne2  4ne2  _  4ne2  ( t  co(k)2  \ 

k 2  k2e  k2  +  k02  \  +  co2  -  co(k)2J' 


(26.24) 


Thus  the  effect  of  the  ions  is  to  multiply  (26.23)  by  a  correction  factor  that  depends 
on  frequency  as  well  as  wave  vector.  The  frequency  dependence  reflects  the  fact  that 
the  screening  action  of  the  ions  is  not  instantaneous,  but  limited  by  the  (small  on  the 
scale  of  vF)  velocity  of  propagation  of  elastic  waves  in  the  lattice.  As  a  result  the 
part  of  the  effective  electron-electron  interaction  mediated  by  the  ions  is  retarded. 

In  using  (26.24)  as  an  effective  interaction  between  a  pair  of  electrons,  one  needs 
to  know  how  co  and  k  depend  on  the  quantum  numbers  of  the  pair.  From  the  anal¬ 
ysis  of  Chapter  17  we  know  that  when  the  effective  interaction  is  taken  to  have  the 
frequency-independent  form  (26.23),  then  k  is  to  be  taken  as  the  difference  in  the  wave 
vectors  of  the  two  electronic  levels.  By  analogy,  when  the  effective  interaction  is 
frequency-dependent,  we  shall  take  o>  as  the  difference  in  the  angular  frequencies 
(i.e.,  the  energies  divided  by  h)  of  the  levels.  Thus,  given  two  electrons  with  wave 
vectors  k  and  k  and  energies  £k  and  £k.,  we  take  their  effective  interaction  to  be18 


i> 


err 

k.k' 


47te2  co(q)2 

q1  +  k02  _  +  w2  -  co(q)2  ’ 


q  =  k  -  k',  co  =  £— -  £|1'.  (26.25) 
n 


16  This  form  was  arrived  at  through  the  work  of  Frohlich  and  of  Bardeen  and  Pines  (H.  Frohlich, 
Phys.  Rev.  79, 845  ( 1 950) ;  J.  Bardeen  and  D.  Pines,  Rhys.  Rev.  99, 1 1 40  ( 1 955) ).  The  argument  we  have  given 
should  be  viewed  more  as  an  indication  of  the  plausibility  of  (26.25)  than  as  a  derivation.  A  systematic 
derivation  of  (26.25),  together  with  a  specification  of  the  circumstances  under  which  it  can  be  used  as  an 
effective  interaction,  requires  the  use  of  field  theoretic  (“Green’s  function”)  methods. 
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There  are  two  important  qualitative  features19  of  reH: 

1.  The  dressed  phonon  frequency  co(q)  is  of  order  ojd  or  less.  Thus  when  the  energies 
of  the  two  electrons  differ  by  much  more  than  ticoD,  the  phonon  correction  to 
their  effective  interaction  is  negligibly  small.  Since  the  range  of  variation  of 
electronic  energies,  SF,  is  typically  102  to  103  times  hwD,  only  electrons  with 
energies  quite  close  together  have  an  interaction  appreciably  affected  by  the 
phonons. 

2.  When,  however,  the  electronic  energy  difference  is  less  than  ficoD,  the  phonon 
contribution  has  the  opposite  sign  from  the  electronically  screened  interaction, 
and  is  larger  in  magnitude;  i.e.,  the  sign  of  the  effective  electron-electron  inter¬ 
action  is  reversed.  This  phenomenon,  known  as  overscreening  is  a  crucial 
ingredient  in  the  modern  theory  of  superconductivity.  We  shall  return  to  it  in 
Chapter  34. 


PHONON  CONTRIBUTION  TO  THE  ELECTRONIC 
ENERGY- WAVE  VECTOR  RELATION 


Aside  from  its  crucial  role  in  the  theory  of  superconductivity,  the  effective  interaction 
(26.25)  has  important  implications  for  the  less  dramatic  properties  of  conduction 
electrons.  In  Chapter  17  we  noted  that  the  simplest  correction  to  the  electronic 
energy  £k  due  to  electron-electron  interactions  was  the  exchange  term  of  Hartree-Fock 
theory; 

“  d  k'  47te2 


Ask 


(27T)3  |k  -  k'l 


r /(*')■ 


(26.26) 


We  observed  that  this  correction  led  to  a  spurious  singularity  in  dS/£k  at  k  =  kr, 
which  we  removed  by  screening  the  interaction  in  (26.26)  with  the  electronic  dielectric 
constant.  In  a  more  accurate  treatment  the  screening  should  be  described  not  by  the 
electronic  dielectric  constant  alone,  but  by  the  full  dielectric  constant  of  the  metal. 
This  suggests  in  conjunction  with  (26.25)  that  the  screened  modification  of  the 
Hartree-Fock  exchange  term  should  be  taken  to  be 


dk'  47ie2 
{2nf  |k  -  k'|2  +  k02 

f  co(k  -  k')2 

*  |l  +  [(£,  -  £„)/ft]2  -  o(k  -  k')2 

Because  the  ionic  contribution  to  the  screening  depends  on  the  electronic  energy, 
this  is  a  rather  complicated  integral  equation  for  Ek.  However,  by  exploiting  the  fact 
that  the  phonon  energy  hw( k  -  k')  is  very  small  compared  with  Sf,  it  is  possible  to 


}/(k')-  (26.27) 


19  The  vanishing  of  (26.25)  at  w  =  0  should  not  be  taken  seriously.  It  would  mean  that  for  very 
slowly  varying  disturbances  the  ions  have  time  to  adjust  themselves  so  as  to  cancel  perfectly  the  field  of 
the  electrons.  This  cannot  be  so,  if  only  because  electrons  are  point  particles,  while  ions  have  impenetrable 
cores.  This  was  ignored  in  the  derivation  of  the  bare  ionic  dielectric  constant,  which  took  into  account 
only  the  Coulomb  interactions  between  ions.  More  accurate  calculations  that  take  finite  core  effects  into 
account  eliminate  the  perfect  cancellation. 
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extract  the  most  important  information  from  (26.27)  without  solving  the  full  integral 
equation  (see  Problem  3).  The  most  important  conclusions  are : 

1.  The  value  of  £f  and  the  shape  of  the  Fermi  surface  are  unaffected  by  the  ionic 
correction  to  the  screening;  i.e.,  they  are  correctly  given  by  ignoring  the  second  term 
within  the  braces  in  (26.27). 

2.  When  £k  is  close  to  £;.  on  the  scale  of  hwD,  one  finds  that 

ZTF  —  £ 

£k  -  £f  =  (26.28) 


where  £jfF  is  the  energy  calculated  in  the  absence  of  the  ionic  correction  to  the 
screening,  and  A  is  given  by  an  integral  over  the  Fermi  surface: 


dS'  Ane2 

87r3fif(k')(k  -  k')2  +  k02' 


(26.29) 


In  particular,  this  means  that  the  phonon  correction  to  the  electronic  velocity  and 
density  of  levels  at  the  Fermi  surface  are  given  by20 


...  _  l  S£  _  1 

v(  ]  fid k  (1  +  A) 

s(£r)  =  (1  +  X)g°{tF). 


v°(k). 


(26.30) 


These  corrections  apply  only  to  one-electron  energy  levels  well  within  hwD  of  £f. 
However,  at  temperatures  well  below  room  temperature  ( kBT  «  hwD)  these  are  pre¬ 
cisely  the  electronic  levels  that  determine  the  great  bulk  of  metallic  properties,  and 
therefore  corrections  due  to  ionic  screening  must  be  taken  into  account.  This  becomes 
particularly  clear  when  we  estimate  the  size  of  A. 

Since  k0  is  of  order  kF  (see  (17.55)),  we  have  that 

^  47te2  f  dS' 

S  k02  J  87r3fit(k') 

However,  from  (17.50)  and  (8.63),  we  find: 

47iez  dn  1  T  f  dS'  "I-1 

k02  ~  dn  ~  g(SF)  l  J  47i3fir(k')J 

Thus  A  in  this  simple  model  is  less  than,  but  of  order,  unity.  As  a  result,  in  many 
metals  the  correction  due  to  ionic  screening  of  the  electron-electron  interaction  (more 
commonly  known  as  the  phonon  correction)  is  the  major  reason  for  deviations  of 
the  density  of  levels  from  its  free  electron  value,  being  more  important  than  either 
band  structure  effects  or  corrections  due  to  direct  electron-electron  interactions.21 


(26.31) 


(26.32) 


-°  We  assume  a  spherical  Fermi  surface,  so  A  is  constant.  The  superscript  0  indicates  the  Thomas- 
Fermi  value. 

21  In  determining  the  effect  of  the  electron-phonon  interaction  on  various  one-electron  properties 
it  is  not  enough  simply  to  replace  the  uncorrected  density  of  levels  by  Eq.  (26.30).  One  must,  in  general, 
reexamine  the  full  derivation  in  the  presence  of  the  effective  interaction  (26.27).  One  finds,  for  example, 
that  the  specific  heat  (Eq.  (2.80))  should  be  corrected  by  the  factor  (1  +  2),  but  that  the  Pauli  susceptibility 
(Eq.  (31.69))  should  not  (see  Chapter  31,  page  663,  footnote  29). 
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3.  When  £k  is  several  times  ha>D  from  £f,  then 

and  the  phonon  correction  rapidly  becomes  insignificant. 
These  results  are  summarized  in  Figure  26.1. 


(26.33) 


Figure  26.1 

Correction  to  the  electronic 
£  vs.  k  relation  due  to 
screening  by  the  ions  (elec¬ 
tron-phonon  correction).  The 
correction  (lighter  curve)  is 
appreciable  only  within  hcoD 
of  £f ,  where  the  slope  of  the 
uncorrected  curve  can  be 
considerably  reduced. 


THE  ELECTRON-PHONON  INTERACTION 


According  to  (26.27)  the  ionic  screening  adds  to  the  energy  of  an  electron  with  wave 
vector  k  an  amount 


1  /  47te2  \  /  [fico(k  -  k')]2 

V  V(k  -  k')2  +  k02 )  -  k')]2  -  (Ek  -  Ek)2. 


(26.34) 


for  each  occupied  electronic  level  k'  (with  the  same  spin).  There  is,  however,  an 
alternative  way  of  deriving  the  effect  of  lattice  deformations  on  the  electronic  energy. 
Without  making  any  explicit  reference  to  screening,  one  can  simply  calculate  the 
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change  in  the  total  energy  of  the  metal  due  to  the  fact  that  the  electrons  (charged 
particles)  can  interact  with  the  phonons  (density  waves  in  the  charged  lattice  of  ions). 
If  this  interaction  is  described  by  some  interaction  Hamiltonian  V':p,  then  the  change 
in  the  energy  of  the  metal  due  to  the  interaction  will  be  given  in  second-order  per¬ 
turbation  theory  by  an  expression  of  the  form : 


>r  yKgMQi! 

i  E0-  E, 


(26.35) 


The  most  important  excited  states  |/>  are  taken  to  be  those  in  which  an  electron, 
which  had  wave  vector  k  in  the  ground  state,  has  emitted  a  phonon  with  wave  vector 
q,  ending  up  itself  with  wave  vector  k'  in  the  excited  state.  (Processes  in  which  a 
phonon  is  absorbed  cannot  occur  at  T  =  0  since  no  phonons  are  present.  Multi¬ 
phonon  transitions  can  be  shown  to  be  of  considerably  less  importance.) 

Since  total  crystal  momentum  must  be  conserved,22  it  follows  that  k'  +  q  =  k, 
to  within  a  reciprocal  lattice  vector.  The  energy  of  the  intermediate  state  differs  from 
that  of  the  ground  state  by  the  energy  of  the  extra  phonon  and  the  new  electronic 
level,  minus  the  energy  of  the  old  electronic  level: 


£,•  —  £0  =  6*  +  Mk  -  k')  -  E,,.  (26.36) 


One  such  intermediate  state  is  possible  for  every  pair  of  occupied  and  unoccupied 
one-electron  levels  in  the  original  ground-state  configuration.  If  we  let  gkk.  be  the 
matrix  element  of  Vep  between  the  ground  state  and  such  an  excited  state,  then 
the  sum  on  i  in  (26.35)  is  just  a  sum  over  all  pairs  of  wave  vectors  of  occupied  and 
unoccupied  levels,  and  we  have: 


A£  =  X  «k(l  -  nk)  - 


|gk,c|' 


kk-  Sk  &k-  hco{k  k  ) 

It  is  natural23  to  identify  the  neft  in  (26.34)  with 

B2AE 


dnk  dnk- 


(26.37) 


(26.38) 


Equation  (26.37)  then  gives: 


t’ifk- 
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8k  -  Sk-  -  Mk  -  k')  +  £k-  -  St  -  Mk'  -  k); 

2/icj(k  —  k') 


IMk  -  k')]2  -  (8k  -  8k)2_ 


(26.39) 


22  See  Appendix  M,  page  788. 

23  This  argument  (which  is  not  rigorous)  is  in  the  spirit  of  the  Landau  approach  (Chapter  17), 
which  holds  that  the  appropriate  quasiparticle  energies  follow  from  the  first  derivative  (with  respect  to 
occupation  number)  of  (26.37)  and  the  modification  of  those  energies  due  to  the  occupation  of  those  levels, 
from  the  second  derivative. 
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By  requiring  this  form  of  the  effective  interaction  to  agree  with  the  result  (26.34), 
we  can  deduce  the  electron-phonon  coupling  constant: 


\a  I2  —  — 

4ne 2  1  , 

fun.  . , 

\9ky\  ~  y 

|k  —  k'|2  +  kD2  2  lk‘ 

(26.40) 


The  most  important  feature  of  this  result  is  that,  in  the  limit  of  small  |k  -  k'|, 
g 2  vanishes  linearly  with  |k  -  k'|.  If  we  write  the  free  electron  estimate  (26.6)  for  k0 
in  the  form : 


4ne2  2£f 
k0 2  3 ne 


(26.41) 


we  find  that 


hco( k  —  k')8F  _  hco(k  —  k')£f 
3^V  ~  '  3NZ 


|k  -  k'|  «  k0. 


(26.42) 


The  fact  that  the  square  of  the  electron-phonon  coupling  constant  vanishes  linearly 
with  the  wave  vector  of  the  phonon  has  important  consequences  for  the  theory  of 
the  electrical  resistivity  of  a  metal. 


THE  TEMPERATURE-DEPENDENT  ELECTRICAL  RESISTIVITY 
OF  METALS 

We  have  noted24  that  Bloch  electrons  in  a  perfect  periodic  potential  can  sustain  an 
electric  current  even  in  the  absence  of  any  driving  electric  field ;  i.e.,  their  conductivity 
is  infinite.  The  finite  conductivity  of  metals  is  entirely  due  to  deviations  in  the  lattice 
of  ions  from  perfect  periodicity.  The  most  important  such  deviation  is  that  associated 
with  the  thermal  vibrations  of  the  ions  about  their  equilibrium  positions,  for  it  is  an 
intrinsic  source  of  resistivity,  present  even  in  a  perfect  sample  free  from  such  crystal 
imperfections  as  impurities,  defects,  and  boundaries. 

The  quantitative  theory  of  the  temperature  dependence  of  the  resistivity  provided 
by  the  lattice  vibrations  starts  from  the  observation  that  the  periodic  potential  of  a 
set  of  rigid  ions, 

C/per(  r)  =  X  V(t  -  R),  (26.43) 

R 

is  only  an  approximation  to  the  true,  aperiodic  potential: 

U( r)  =  X  E[r  -  R  -  u(R)]  =  C/^r)  -  £  u(R)  •  VF(r  -  R)  +  ■  •  • .  (26.44) 

R  R 

The  difference  between  these  two  forms  can  be  considered  as  a  perturbation  that  acts 
on  the  stationary  one-electron  levels  of  the  periodic  Hamiltonian,  causing  transitions 
among  Bloch  levels  that  lead  to  the  degradation  of  currents. 


24 


See,  for  example.  Chapter  12,  page  215. 
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As  is  generally  the  case  with  transitions  caused  by  lattice  vibrations,  they  can  be 
considered  here  as  processes  in  which  an  electron  absorbs  or  emits  a  phonon  (or 
phonons),  changing  its  energy  by  the  phonon  energy  and  its  wave  vector  (to  within 
a  reciprocal  lattice  vector)  by  the  phonon  wave  vector.  Indeed,  this  picture  of  the 
scattering  of  electrons  by  lattice  vibrations  is  very  similar  to  the  picture  in  Chapter  24 
of  the  scattering  of  neutrons  by  lattice  vibrations. 

The  simplest  theories  of  the  lattice  contribution  to  the  resistivity  of  metals  assume 
that  the  scattering  is  dominated  by  processes  in  which  an  electron  emits  (or  absorbs) 
a  single  phonon.  If  the  electronic  transition  is  from  a  level  with  wave  vector  k  and 
energy  Sk  to  one  with  wave  vector  k'  and  energy  Ek,,  then  energy  and  crystal  momentum 
conservation25  require  that  the  energy  of  the  phonon  involved  satisfy 

Sk  =  Sk.  ±  ha( k  -  k'),  (26.45) 

where  the  plus  (minus)  sign  is  appropriate  to  phonon  emission  (absorption)  (and 
where  we  assume  that  co(— q)  =  oj(q) ).  This  equation  can  be  viewed  as  a  constraint 
on  the  wave  vectors  q  of  phonons  capable  of  participating  in  a  one-phonon  process 
with  an  electron  with  wave  vector  k,  namely 

®(q)  =  ±1  [&..+,  -  &  J  (26.46) 

As  in  the  case  of  neutron  scattering,  this  constraint,  being  a  single  restriction,  deter¬ 
mines  a  two-dimensional  surface  of  allowed  wave  vectors  in  the  three-dimensional 
phonon  wave  vector  space.  Indeed,  since  hco(q)  is  a  minute  energy  on  the  electronic 
energy  scale,  the  surface  of  allowed  q  for  a  given  k  is  very  close  to  the  set  of  vectors 
connecting  k  to  all  other  points  on  the  constant  energy  surface  Ek.  =  Sk  (Figure  26.2). 


Figure  26.2 

Construction  of  the  wave  vectors  of  those  phonons 
allowed  by  the  conservation  laws  to  participate  in  a 
one-phonon  scattering  event  with  an  electron  with  wave 
vector  k.  Because  the  phonon  energy  is  at  most  h(oD  «  £r, 
the  surface  containing  the  tips  of  the  phonon  wave  vectors 
originating  from  k  differs  only  slightly  from  the  Fermi 
surface.  At  temperatures  well  below  &„  the  only  phonons 
that  can  actually  participate  in  scattering  events  have 
wave  vectors  whose  tips  lie  within  the  small  sphere  of 
size  kBT /he  about  the  tip  of  the  wave  vector  k. 


25  See  Appendix  M. 
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At  high  temperatures  (T  »  ©D)  the  number  of  phonons  in  any  normal  mode  is 
given  by 


«(q) 


1 

p/tfi'jlq)  _  j 


kBT 

fico(q)' 


(26.47) 


Thus  the  total  number  of  phonons  on  the  surface  of  allowed  wave  vectors  for  the 
scattering  of  a  given  electron  is  directly  proportional  to  T.  Since  the  number  of 
scatterers  increases  linearly  with  T,  so  will  the  resistivity: 

p  ~  T,  T »  Qd.  I  (26.48) 


At  low  temperatures  ( T  «  ©D)  things  are  rather  more  complicated.  We  first  note 
that  only  phonons  with  fico(q)  comparable  to  or  less  than  kBT  can  be  absorbed  or 
emitted  by  electrons.  In  the  case  of  absorption  this  is  immediately  obvious,  since 
these  are  the  only  phonons  present  in  appreciable  numbers.  It  is  also  true  in  the 
case  of  emission,  for  in  order  to  emit  a  phonon  an  electron  must  be  far  enough  above 
the  Fermi  level  for  the  final  electronic  level  (whose  energy  is  lower  by  hco( q))  to  be 
unoccupied;  since  levels  are  occupied  only  to  within  order  kBT  above  Sf,  and  un¬ 
occupied  only  to  within  order  kBT  below,  only  phonons  with  energies  ha>( q)  of  order 
kBT  can  be  emitted. 

Well  below  the  Debye  temperature,  the  condition  ftco(q)  ^  kBT  requires  q  to  be 
small  compared  with  kD.  In  this  regime  co  is  of  order  cq  so  the  wave  vectors  q  of  the 
phonons  are  of  order  kBT/hc  or  less.  Thus  within  the  surface  of  phonons  that  the 
conservation  laws  permit  to  be  absorbed  or  emitted,  only  a  subsurface  of  linear 
dimensions  proportional  to  T,  and  hence  of  area  proportional  to  T2,  can  actually 
participate. 

We  conclude  that  the  number  of  phonons  that  can  scatter  an  electron  declines 
as  T2  well  below  the  Debye  temperature.  However,  the  electronic  scattering  rate 
declines  even  faster,  for  when  q  is  small  the  square  of  the  electron -phonon  coupling 
constant  (26.42)  vanishes  linearly  with  q.  Well  below  0C  the  physically  relevant 
phonons  have  wave  vectors  q  of  order  kBT/hc,  and  therefore  the  scattering  rate 
(which  is  proportional  to  the  square  of  the  coupling  constant)  for  those  processes 
that  can  take  place,  declines  linearly  with  T. 

Combining  these  two  features,  we  conclude  that  for  T  well  below  ©D  the  net 
electron-phonon  scattering  rate  declines  as  T3: 


^~T\  T «  ©c.  (26.49) 


However,  low-temperature  electron-phonon  scattering  is  one  of  those  cases  in 
which  the  rate  at  which  the  current  degrades  is  not  simply  proportional  to  the  scat¬ 
tering  rate.  This  is  because  well  below  ©D  any  given  one-phonon  process  can  change 
the  electronic  wave  vector  by  only  a  very  small  amount  (namely  the  wave  vector  of 
the  participating  phonon,  which  is  small  compared  with  kD  or  kF).  Provided  that  the 
electronic  velocity  v(k)  does  not  undergo  large  variations  between  Fermi  surface 
points  separated  by  very  small  q,  the  velocity  will  also  not  change  very  much  in  a 
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single  scattering  event.  Thus  as  the  temperature  declines,  the  scattering  becomes  more 
concentrated  in  the  forward  direction  and  is  therefore  less  effective  in  degrading  a 
current. 

The  quantitative  consequences  of  this  for  the  low-temperature  phonon  resistivity 
are  suggested  by  the  analysis  of  Chapter  16  (pages  324-326).  There  we  showed,  in 
the  case  of  elastic  scattering  in  an  isotropic  metal,  that  the  effective  scattering  rate 
appearing  in  the  resistivity  is  proportional  to  an  angular  average  of  the  actual  scat¬ 
tering  rate,  weighted  with  the  factor  1  —  cos  0 ,  where  0  is  the  scattering  angle  (Figure 
26.3).  At  very  low'  temperatures  phonon  scattering  is  very  nearly  elastic  (the  energy 
change  being  small  compared  with  tuoD),  and  we  can  apply  this  result  with  some 
confidence,  at  least  in  metals  with  isotropic  Fermi  surfaces.  Since  sin  (6/2)  =  q/2kF 
(Figure  26.3),  1  —  cos  6  =  2  sin2  (0/2)  =  j(q/kr)2.  But  q  =  0(kBT/hc )  for  T  well 
below  0C,  and  this  introduces  a  final  factor  of  T2  into  the  low-temperature  resistivity. 

Figure  26.3 

Small-angle  scattering  on  a  spherical  Fermi 
surface.  Since  the  scattering  is  nearly  elastic, 
k  s:  k'  ar  kr.  When  the  phonon  wave  vector 
q  (and  hence  6)  are  small,  we  have  0/2  « 
qi2kr. 

n 


lql=  2 kF  sitiy 

The  additional  factor  of  T 2,  expressing  the  growing  predominance  of  forward 
scattering  with  declining  temperature,  arises  even  in  anisotropic  metals  (with  certain 
exceptions  noted  in  the  following  section).  When  combined  with  the  T3  dependence 
of  the  scattering  rate,  it  leads  to  the  “Bloch  T5  law”: 

(26.50) 


MODIFICATION  OF  THE  T5  LAW  BY  UMKLAPP  PROCESSES 

The  factor  of  T 2  in  the  low-temperature  resistivity  due  to  the  dominance  of  forward 
scattering  depends  on  the  assumption  that  electronic  levels  with  nearly  the  same 
wave  vectors  have  nearly  the  same  velocities.  For  sufficiently  complex  Fermi  surfaces, 
or  when  interband  scattering  is  possible  this  need  not  be  so.  In  such  cases  the  current 
can  still  be  efficiently  degraded  even  though  the  change  in  wave  vector  (but  not 
velocity)  is  small  in  each  scattering  event,  and  the  low-temperature  resistivity  need 
not  decline  as  rapidly  as  T5. 

One  of  the  most  important  examples  of  a  small  change  in  wave  vector  causing 
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a  large  change  in  velocity  arises  when  a  nearly  free  electron  Fermi  surface  approaches 
a  Bragg  plane  (Figure  26.4).  A  small  wave  vector  q  can  then  join  Fermi  surface 
levels  on  opposite  sides  of  the  plane,  with  nearly  oppositely  directed  velocities.  Such 
events  are  called  “umklapp  processes."26  Within  the  nearly  free  electron  point  of 
view,  one  can  regard  the  large  change  in  velocity  as  due  to  a  phonon-induced  Bragg 
reflection.27 

Figure  26.4 

A  simple  umklapp  process.  The  wave 
vectors  k  and  k  +  q  differ  by  an  amount 
small  compared  with  kF( or  kD),  but  the 
velocities  v(k)  and  v(k  +  q)  are  not  close 
together. 


PHONON  DRAG 

Peierls28  pointed  out  a  way  in  which  the  low  temperature  resistivity  might  decline 
more  rapidly  than  T5.  This  behavior  has  yet  to  be  observed,  owing,  presumably,  to 


26  Compare  the  discussion  in  Chapter  25.  pages  501-502. 

21  If  the  Fermi  surface  is  displayed  in  the  first  zone  rather  than  in  an  extended-zone  scheme,  then  the 
change  in  wave  vector  is  only  small  modulo  a  reciprocal  lattice  vector.  Umklapp  processes  are  sometimes 
said  to  be  those  in  which  the  additive  reciprocal  lattice  vector  appearing  in  the  crystal  momentum  con¬ 
servation  law  is  nonzero.  As  emphasized  in  Chapter  25,  this  distinction  is  not  independent  of  the  choice 
of  primitive  celi.  The  crucial  point  for  electron-phonon  scattering  is  whether  small  changes  in  electron 
crystal  momentum  (to  within  a  possible  additive  reciprocal  lattice  vector)  can  result  in  large  changes  in 
electron  velocity.  When  put  this  way,  the  criterion  is  independent  of  primitive  cell. 

28  R.  E.  Peierls,  Ann.  Pliys.  (5)  12,  154  (1932). 
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its  being  masked  by  temperature-independent  scattering  by  defects  (which  eventually 
dominates  the  resistivity  at  low  enough  temperatures). 

The  derivation  of  the  T5  law  assumes  that  the  phonons  are  in  thermal  equilibrium, 
whereas  in  fact  the  nonequilibrium  nature  of  the  current-carrying  electronic  distri¬ 
bution  should  lead,  through  electron-phonon  scattering,  to  a  phonon  distribution 
that  is  also  out  of  equilibrium.  Suppose  (to  take  a  simple  case)  that  the  Fermi  surface 
lies  within  the  first  Brillouin  zone.  We  define  umklapp  processes  as  those  in  which 
total  crystal  momentum  is  not  conserved,  under  the  convention  that  the  primitive 
cell  in  which  individual  electron  and  phonon  wave  vectors  are  specified  is  the  first 
zone.  If  the  total  crystal  momentum  of  the  combined  electron-phonon  system  were 
initially  nonzero  then  in  the  absence  of  umklapp  processes,  it  would  remain  nonzero  at 
all  subsequent  times,  even  in  the  absence  of  an  electric  field,29  and  the  electron-phonon 
system  could  not  come  to  complete  thermal  equilibrium.  Instead,  the  electrons  and 
phonons  would  drift  along  together,  maintaining  their  nonzero  crystal  momentum 
and  also  a  nonzero  electric  current. 

Metals  (free  from  defects)  have  finite  conductivities  only  because  umklapp  pro¬ 
cesses  can  occur.  These  do  degrade  the  total  crystal  momentum  and  make  it  possible 
for  a  current  to  decay  in  the  absence  of  a  driving  electric  field.  If  however,  the  Fermi 
surface  is  within  the  interior  of  the  zone,  then  there  is  a  minimum  phonon  wave 
vector  and  energy  (Figure  26.5)  below  which  umklapp  processes  cannot  occur.  When 
kB  T  is  well  below  this  energy,  the  number  of  phonons  available  for  such  events  should 
become  proportional  to  exp  ( ~hcomin/kBT ),  and  therefore  the  resistivity  should  drop 
exponentially  in  1  /T. 

Figure  26.5 

Extended-zone  picture  of  a  metal  whose  Fermi  surface  is 
completely  contained  in  the  first  zone.  Flere  qm„,  is  the 
minimum  wave  vector  for  a  phonon  that  can  participate  in 
an  umklapp  process.  At  temperatures  below  those  corre¬ 
sponding  to  the  energy  of  this  phonon,  the  contribution 
from  umklapp  scattering  should  drop  exponentially. 
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Compare  the  very  similar  discussion  of  the  thermal  conductivity  of  an  insulator  in  Chapter  25. 
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PROBLEMS 


1.  A  More  Detailed  Treatment  of  the  Phonon  Dispersion  Relation  in  Metals 
In  deriving  the  Bohm-Staver  relation  (26.8)  we  regarded  the  ions  as  point  particles,  interacting 
only  through  Coulomb  forces.  A  more  realistic  model  would  take  the  ions  as  extended  distribu¬ 
tions  of  charge,  and  allow  for  the  impenetrability  of  the  ion  cores  by  an  effective  ion-ion  interaction 
in  additional  to  the  Coulomb  interaction.  Since  the  core-core  repulsion  is  short-ranged,  it  leads 
to  no  difficulties  in  the  usual  treatment  of  lattice  vibrations,  and  can  be  described  by  a  dynamical 
matrix  Dc  in  the  manner  described  in  Chapter  22.  We  can  therefore  treat  lattice  vibrations  in 
metals  by  the  methods  of  Chapter  22,  provided  that  we  take  the  full  dynamical  matrix  D  to  be 
Dc  plus  a  term  arising  from  the  Coulomb  interactions  between  the  ionic  charge  distributions,  as 
screened  by  the  electrons. 

Let  us  take  the  ion  at  position  R  +  u(R)  to  have  a  charge  distribution  p[r  —  R  —  u(R)],  so 
that  the  electrostatic  force  on  such  an  ion  is  given  by  J  dr  E(r)p[r  —  R  —  u(R)],  where  E(r)  is  the 
electric  field  reduced  by  electronic  screening,30  due  to  all  the  other  ions  (whose  charge  density  is 
Ixvr  P[r  R  -  U(R')])- 

(a)  Expand  this  additional  electrostatic  interaction  to  linear  order  in  the  ionic  displacements 
u,  and,  assuming  that  the  electronic  screening  is  described  by  the  static  dielectric  constant31  e(q), 
show  that  the  dynamical  matrix  appearing  in  Eq.  (22.57)  must  now  be  taken  to  be 


^v(k)  =  DlJL k)  +  l/v(k)  f  £  [VJk  +  K)  -  E„V(K)], 

KjtO 

i/  -  4m(iM p(q)l2 


(26.51) 


(b)  Show  that  if  the  electronic  screening  is  not  taken  into  account  (e  =  1),  Eq.  (26.5 1 )  predicts 
in  the  long-wavelength  limit  a  longitudinal  normal  mode  at  the  ionic  plasma  frequency  (26.2). 

(c)  Show  that  when  screening  is  taken  into  account  via  the  Thomas-Fermi  dielectric  function 
(26.4),  all  phonon  frequencies  vanish  linearly  with  k  at  long  wavelengths,  though  the  dispersion 
relation  is  not  of  the  simple  Bohm-Staver  form  (26.5). 

2.  Electronic  vs.  Ionic  Contributions  to  the  Specific  Heat  of  Metals 

(a)  Estimating  the  sound  velocity  in  a  metal  by  the  Bohm-Staver  relation  (26.8),  show  that 
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(26.52) 


30  The  screening  theory  of  Chapter  17  was  based  on  the  assumption  that  the  external  potential  was 
a  weak  perturbation  on  the  electron  gas.  Since  this  is  not  the  case  for  the  potential  of  the  ions,  a  relation 
of  the  form  $total(q)  =  (l,€)$lon(q)  is  not  strictly  valid.  One  can  find  a  linear  relation  between  the  deviations 
of  the  total  and  ionic  potentials  from  their  equilibrium  values.  To  derive  it,  however,  one  must  take  the 
system  perturbed  by  the  ions  to  be  not  a  free  electron  gas.  but  a  gas  of  electrons  in  the  presence  of  the 
full  equilibrium  periodic  potential.  The  formula  describing  the  screening  is  then  more  complex.  These 
extra  difficulties  are  referred  to  as  band  structure  effects.  We  ignore  them  in  this  problem. 

31  More  precisely,  we  should  use  the  frequency-dependent  dielectric  function  c(q,  co),  where  co  is  the 
frequency  of  the  normal  mode  being  examined.  When,  however,  w  is  less  than  coD,  the  frequency  dependence 
of  the  dielectric  function  (17.60)  is  quite  negligible.  This  observation  is  the  analytic  justification  of  the 
adiabatic  approximation. 
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(b)  Using  this  result  and  Eq.  (23.30),  show  that  the  low  temperature  electronic  and  ionic 
contributions  to  the  specific  heat  are  related  by 


ion 

cv 


(26.53) 


(c)  Estimating  the  mass  of  the  ion  by  AM ^  where  A  is  the  mass  number  and  Mv  is  the  proton 
mass  (M  =  1836  m)  show  that  the  electronic  specific  heat  exceeds  the  ionic  contribution  when 
the  temperature  falls  below 

T0  =  5.3Z1'2  4  (yj  x  102  K.  (26.54) 


(d)  Evaluate  T0  for  sodium,  aluminum,  and  lead. 

(e)  Show  that  the  magnitude  of  the  leading  (cubic)  term  in  the  lattice  specific  heat  is  bigger 
than  the  magnitude  of  the  cubic  correction  to  the  electronic  specific  heat  (Eq.  (2.102),  evaluated  in 
the  free  electron  approximation)  by  a  factor 


i  (  m  V2 

Z  \Zl/3m) 


(26.55) 


3.  Phonon  Corrections  to  the  Electronic  Energy 

In  the  limit  coD  ->  0,  the  correction  (26.27)  to  the  electronic  energy  reduces  to  that  of  the  Hartree- 
Fock  approximation  as  modified  by  Thomas-Fermi  screening  (Chapter  17,  page  344): 


f  d  k'  Ane1 

Jr<*F(2ti)3  |k  -  k'|2  +  k02' 


(26.56) 


When  the  phonon  frequencies  are  not  considered  vanishingly  small,  (26.27)  differs  appreciably 
from  (26.56)  only  for  those  values  of  the  integration  variable  k'  for  which  £k.  is  within  O(ficoc)  of 
£k.  Since  ha>D  is  small  compared  with  £F,  the  region  of  k'  for  which  the  correction  is  appreciable 
is  a  shell  about  the  surface  £„.  =  £k  that  is  thin  (on  the  scale  of  the  dimensions  of  the  zone).  We 
may  exploit  this  fact  to  simplify  the  correction  term  by  writing  the  integral  over  k'  as  an  integral 
over  energy  £',  and  an  integral  over  the  constant-energy  surfaces  £*.  =  £'.  As  £'  varies,  the  variation 
of  the  term  in  (£k  —  £')2  in  the  denominator  of  (26.27)  is  very  important,  since  the  denominator 
vanishes  within  this  range.  However,  the  remaining  £'  dependence  of  the  integrand  (due  to  the 
fact  that  k'  is  constrained  to  a  surface  of  energy  £')  leads  to  very  little  variation  as  £'  varies  within 
0(te>D)  of  £r.  It  is  thus  a  good  approximation  to  replace  the  k'  integrations  over  the  surfaces 
£k.  =  £'  by  integrations  over  the  single  surface  £k.  =  £k.  After  this  replacement,  the  only  £'  depen¬ 
dence  left  comes  from  the  explicit  term  in  £'  in  the  denominator.  The  integral  over  £'  is  then  easily 
performed. 

(a)  Show  that  under  this  approximation, 
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(26.57) 
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(b)  Show  that  (26.57)  immediately  implies  that  the  phonon-corrected  Fermi  surface  £*  =  £F 

is  identical  to  the  uncorrected  Fermi  surface,  £  l 1  =  £r. 

(c)  Show  that  when  6t  is  several  times  htoD  from  Sr.  the  phonon  correction  is  smaller  than 

the  Thomas-Fermi  correction  by  0(fimo/£r)(fimD;[£k  —  6r])- 

(d)  Show  that  when  £t  -  £,,  is  small  compared  with  htoD,  (26.57)  reduces  to  (26.28)  and 

(26.29). 
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Because  charge  cannot  flow  freely  in  insulators,  applied  electric  fields  of  substantial 
amplitude  can  penetrate  into  their  interiors.  There  are  at  least  three  broad  contexts 
in  which  it  is  important  to  know  how  the  internal  structure  of  an  insulator,  both 
electronic  and  ionic,  readjusts  when  an  additional  electric  field  is  superimposed  on 
the  electric  field  associated  with  the  periodic  lattice  potential: 

1.  We  may  place  a  sample  of  the  insulator  in  a  static  electric  field  such  as  that 
existing  between  the  plates  of  a  capacitor.  Many  important  consequences  of  the 
resulting  internal  distortion  can  be  deduced  if  one  knows  the  static  dielectric 
constant  e0  of  the  crystal,  whose  calculation  is  therefore  an  important  aim  of  any 
microscopic  theory  of  insulators. 

2.  We  may  be  interested  in  the  optical  properties  of  the  insulator — i.e.,  in  its  response 
to  the  AC  electric  field  associated  with  electromagnetic  radiation.  In  this  case  the 
important  quantity  to  calculate  is  the  frequency-dependent  dielectric  constant 
e(co),  or,  equivalently,  the  index  of  refraction,  n  —  x/e. 

3.  In  an  ionic  crystal,  even  in  the  absence  of  externally  applied  fields,  there  may  be 
long-range  electrostatic  forces  between  the  ions  in  addition  to  the  periodic  lattice 
potential,  when  the  lattice  is  deformed  from  its  equilibrium  configuration  (as, 
for  example,  in  the  course  of  executing  a  normal  mode).  Such  forces  are  often 
best  dealt  with  by  considering  the  additional  electric  field  giving  rise  to  them, 
whose  sources  are  intrinsic  to  the  crystal. 

In  dealing  with  any  of  these  phenomena  the  theory  of  the  macroscopic  Maxwell 
equations  in  a  medium  is  a  most  valuable  tool.  We  begin  with  a  review  of  the  electro¬ 
static  aspects  of  this  theory. 

MACROSCOPIC  MAXWELL  EQUATIONS  OF  ELECTROSTATICS 

When  viewed  on  the  atomic  scale,  the  charge  density  pmicro(r)  of  any  insulator  is  a  very 
rapidly  varying  function  of  position,  reflecting  the  microscopic  atomic  structure  of 
the  insulator.  On  the  same  atomic  scale  the  electrostatic  potential  $m'cro(r)  and  the 
electric  field  En;,cr"(r)  =  —  V$m,cro(r)  also  have  strong  and  rapid  variations  since  they 
are  related  to  pm,cro(r)  by 

V  .  Emicro(r)  =  47ipmicro(r).  (27.1) 

On  the  other  hand,  in  the  conventional  macroscopic  electromagnetic  theory  of  an 
insulator  the  charge  density  p( r),  potential  (f>( r),  and  electric  field  E(r)  show  no  such 
rapid  variation.1  Specifically,  in  the  case  of  an  insulator  bearing  no  excess  charge 
beyond  that  of  its  component  ions  (or  atoms  or  molecules),  the  macroscopic  electro¬ 
static  field  is  determined  by  the  macroscopic  Maxwell  equation:2 

V  •  D(r)  =  0,  (27.2) 


1  Indeed,  in  an  insulating  medium  in  the  absence  of  any  externally  applied  fields,  $( r)  is  zero  (or 
constant). 

2  More  generally,  one  writes  V  •  D  =  4n p,  where  p  is  the  so-called  free  charge— i.e.,  that  part  of  the 
macroscopic  charge  density  due  to  excess  charges  not  intrinsic  to  the  medium.  Throughout  the  following 
discussion  we  assume  that  there  is  no  free  charge,  so  that  our  macroscopic  charge  density  is  always  the 
so-called  bound  charge  of  macroscopic  electrostatics.  The  inclusion  of  free  charge  is  straightforward,  but 
not  relevant  to  any  of  the  applications  we  wish  to  make  here. 
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in  conjunction  with  the  equation  giving  the  macroscopic  electric  field  E  in  terms  of 
the  electric  displacement  D  and  polarization  density  P, 

D(r)  =  Efr)  +  4jiP(r).  (27.3) 

These  imply  (in  the  absence  of  free  charge)  that  the  macroscopic  electric  field  satisfies 

V  •  E(r)  =  —  An\  •  P(r),  (27.4) 


where  P  (to  be  defined  in  detail  below)  is  generally  a  very  slowly  varying  function  of 
position  inside  an  insulator. 

Although  it  is  very  convenient  to  work  with  the  macroscopic  Maxwell  equations, 
it  is  also  essential  to  deal  with  the  microscopic  field  acting  on  individual  ions.3  One 
must  therefore  keep  the  relation  between  macroscopic  and  microscopic  quantities 
clearly  in  mind.  The  connection,  first  derived  by  Lorentz,  can  be  made  as  follows:4 

Suppose  we  have  an  insulator  (not  necessarily  in  its  equilibrium  configuration) 
that  is  described  (at  an  instant)  by  a  microscopic  charge  density  pm,cr0(r),  which  reflects 
the  detailed  atomic  arrangement  of  electrons  and  nuclei  and  which  gives  rise  to  the 
rapidly  varying  microscopic  field,  Em,cro(r).  The  macroscopic  field  E(r)  is  defined  to  be 
an  average  of  En‘,cro  over  a  region  about  r  that  is  small  on  the  macroscopic  scale,  but 
large  compared  with  characteristic  atomic  dimensions  a  (Figure  27.1).  We  make  the 
averaging  procedure  explicit  by  using  a  positive  normalized  weight  function  /, 
satisfying: 


/( r)  2*  0;  /( r)  =  0,  r  >  r0; 


t/r/(r)=l;  /(-r)=/(r). 


(27.5) 


The  distance  r0  beyond  which /  vanishes  is  large  compared  with  atomic  dimensions 


Figure  27.1 

The  value  of  a  macroscopic  quantity  at 
a  point  P  is  an  average  of  the  microscopic 
quantity  over  a  region  of  dimensions  r0 
in  the  neighborhood  of  P,  where  r0  is 
large  compared  to  the  interparticle 
spacing  a. 


IT 


3  We  continue  with  our  convention  of  using  the  single  term  “ion”  to  refer  to  the  ions  in  ionic  crystals, 
but  also  the  atoms  or  molecules  making  up  molecular  crystals. 

4  The  following  discussion  is  very  similar  to  a  derivation  of  all  the  macroscopic  Maxwell  equations 
by  G.  Russakoff,  Am.  J.  Phys.  10,  1188  (1970). 
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a,  but  small  on  the  scale  over  which  macroscopically  defined  quantities  vary.5  We  also 
require  that  /  vary  slowly;  i.e.,  |V/|//  should  not  be  appreciably  greater  than  the 
minimum  value,  of  order  1  /r0.  required  by  Eqs.  (27.5).  Beyond  these  assumptions, 
the  form  of  the  macroscopic  theory  is  independent  of  the  properties  of  the  weight 
function  /. 

We  can  now  give  a  precise  definition  of  the  macroscopic  electric  field  E(r)  at  the 
point  r:  it  is  the  average  of  the  microscopic  field  in  a  region  of  radius  r0  about  r,  with 
points  displaced  by  -r'  from  r  receiving  a  weight  proportional  to/(r');  i.e., 


E(r) 


dr'  ¥m"°(r  -  r')/(r'). 


(27.6) 


Loosely  speaking,  the  operation  specified  by  (27.6)  washes  out  those  features  of  the 
microscopic  field  that  vary  rapidly  on  the  scale  of  r0,  and  preserves  those  features 
that  vary  slowly  on  the  scale  of  r0  (Figure  27.2).  Note,  for  example,  that  if  Eraicro  should 
happen  to  vary  slowly  on  the  scale  of  r0  (as  would  be  the  case  if  the  point  r  were  in 
empty  space,  far  from  the  insulator),  then  E(r)  would  equal  L""l"’(r). 


Figure  27.2 

The  lighter  rapidly  oscillatory  curve  illustrates  the  characteristic  spatial  variation  of  a  micro¬ 
scopic  quantity.  The  heavier  curve  is  the  corresponding  macroscopic  quantity.  Only  spatial 
variations  that  occur  on  a  scale  comparable  to  or  larger  than  r0  are  preserved  in  the  macroscopic 
quantity. 

Equations  (27.6)  and  (27.1)  immediately  imply  that 

V  •  E(r)  =  jdr'V-  Emcro(r  -  r’)/(r') 

=  4 n  J  r/r'  pmicro( r  -  r')/(r').  (27.7) 

Therefore,  to  establish  (27.4)  we  must  show  that 

dr'  p™a°(i  -  r')/(r')  dr'  =  —  V  ■  P(r),  (27.8) 

where  P(r)  is  a  slowly  varying  function  that  can  be  interpreted  as  a  dipole  moment 
density. 


5  More  precisely,  the  macroscopic  Maxwell  equations  are  valid  only  when  the  variation  in  the 
macroscopic  fields  is  sufficiently  slow  that  their  minimum  characteristic  wavelength  allows  a  choice  of 
r0  satisfying  2  »  r0  »  a.  This  condition  can  be  satisfied  by  the  field  associated  with  visible  light  (A  ~  104a), 
but  not  by  the  field  associated  with  X  rays  (2  —  a). 
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We  shall  discuss  only  the  case  in  which  the  microscopic  charge  density  can  be 
resolved  into  a  sum  of  contributions  from  ions  (or  atoms  or  molecules)  located  at 
positions  r,  characterized  by  individual  charge  distributions  p/r  —  tj): 


pmicra(r)  =  pj(r  —  Tj).  (27.9) 

j 

Such  a  resolution  is  quite  natural  in  ionic  or  molecular  solids,  but  is  rather  more 
difficult  to  achieve  in  covalent  crystals,  where  important  parts  of  the  electronic  charge 
distribution  are  not  readily  associated  with  a  particular  site  in  the  crystal.  Our  dis¬ 
cussion  is  therefore  primarily  applicable  to  the  former  two  insulating  categories.  Quite 
a  different  approach  is  required  to  calculate  the  dielectric  properties  of  covalent 
crystals.  We  shall  return  to  this  point  below. 

We  are  interested  in  nonequilibrium  configurations  of  the  insulator  in  which  the 
ions  are  displaced  from  their  equilibrium  positions  r°  and  are  deformed  from  their 
equilibrium  shapes,6  which  are  described  by  charge  densities  p9.  Thus  pmicro(r)  will 
not,  in  general,  be  equal  to  the  equilibrium  microscopic  charge  density, 

Pro(r)  =  £p?(r-r°).  (27.10) 

Using  (27.9)  we  can  write  (27.7)  as: 


V  •  E(r)  =  4n  £  j  dr'  pjr  -  r,  -  r')f(r') 

=  4n  £  J  dr  p/r)/(r  -  r?  -  (r  +  A,.)),  (27.1 1) 

where  A,  =  r;  -  r The  displacement  A,-  of  the/th  ion  from  its  equilibrium  position 
is  a  microscopic  distance  of  order  a  or  less.  Furthermore  the  charge  density  p  (r) 
vanishes  when  r  exceeds  a  microscopic  distance  of  order  a.  Since  the  variation  in  the 
weight  function  /  is  very  small  over  distances  of  order  a,  we  can  expand  (27.11)  in 
what  is  effectively  a  series  in  powers  of  a/r0  by  using  the  Taylor  expansion : 


/(r  -  r?  -  (r  +  A,))  =  f  i  [-(r  +  A,)  •  v]/(r  -  r°). 

If  we  substitute  the  first  two  terms7  from  (27.12)  into  (27.11)  we  find  that 

V  •  E(r)  =  4nWejf(r  -  r°)  -  £  (p.  +  e.Aj.)  •  V/(r  -  r° 

Li  j 

ei  =  Pj(«0.  Pj  =  J  dr  Pj(  r)r . 


where 


(27.12) 


(27.13) 

(27.14) 


6  We  have  in  mind  applications  (a)  to  monatomic  Bravais  lattices  (in  which  the  r'1  are  just  the  Bravais 
lattice  vectors  R  and  all  of  the  functions  pj  are  identical,  and  (b)  to  lattices  with  a  basis,  in  which  the  r° 
run  through  all  vectors  R,  R  +  d.  etc.,  and  there  are  as  many  distinct  functional  forms  for  the  p?  as  there 
are  distinct  types  of  ions  in  the  basis.  1 

We  shah  find  that  the  first  (n  =  0)  term  makes  no  contribution  to  (27.11),  and  we  must  therefore 
retain  the  next  (n  =  1)  term  to  get  the  leading  contribution. 
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The  quantities  and  p j  are  simply  the  total  charge  and  dipole  moment  of  the  jth  ion. 

In  the  case  of  a  monatomic  Bravais  lattice  the  charge  of  each  “ion”  must  be  zero 
(since  the  crystal  is  neutral  and  all  “ions”  are  identical).  In  addition  the  equilibrium 
positions  r?  are  the  Bravais  lattice  sites  R,  so  (27.1 3)  reduces  to 

V  •  E(r)  =  -4ttV  ■  X  f(r  -  R)p(R),  (27.15) 

R 

where  p(R)  is  the  dipole  moment  of  the  atom  at  site  R. 

With  a  straightforward  generalization  of  the  definition  of  p(R),  this  result  remains 
valid  (to  leading  order  in  a/r0 )  even  when  we  allow  for  ionic  charge  and  a  polyatomic 
basis.  To  see  this,  suppose  that  the  r?  now  run  through  the  sites  R  +  d  of  a  lattice 
with  a  basis.  We  can  then  label  pj  and  e;  by  the  Bravais  lattice  vector  R  and  basis 
vector  d  specifying  the  equilibrium  position  of  the  jth  ion:8 

Pj  P(R>  d)>  ej->e( d),  r®  -  R  +  d,  Ay  -»  u(R,  d).  (27.16) 

Since  d  is  a  microscopic  length  of  order  a ,  we  can  perform  the  further  expansion: 

/(r  —  R  —  d)  a  /(r  —  R)  -  d  •  V/(r  -  R).  (27.17) 

Substituting  this  into  (27.13)  and  dropping  terms  of  higher  than  linear  order  in  a/rQ, 
we  again  recover  (27.15),  where  p(R)  is  now  the  dipole  moment  of  the  entire  primitive 
cell9  associated  with  R: 

P(R)  =  X  [>(d)u(R,  d)  +  p(R,  d)].  (27.18) 

d 

Comparing  (27.15)  with  the  macroscopic  Maxwell  equation  (27.4),  we  find  that 
the  two  are  consistent  if  the  polarization  density  is  defined  by 

PM  =  I/(r  -  R)P(R).  (27.19) 

R 

If  we  are  dealing  with  distortions  from  equilibrium  whose  form  does  not  vary  much 
from  cell  to  cell  on  the  microscopic  scale,  then  p(R)  will  vary  only  slowly  from  cell 
to  cell,  and  we  can  evaluate  (27.19)  as  an  integral: 

PM  =  - 1  »/(r  -  R)P(R)  »  -  f  drf(r  -  r)p(r),  (27.20) 

v  R  v  J 

where  p(r)  is  a  smooth,  slowly  varying  continuous  function  equal  to  the  polarization 
of  the  cells  in  the  immediate  vicinity  of  r,  and  v  is  the  volume  of  the  equilibrium 
primitive  cell. 


8  Ions  separated  by  Bravais  lattice  vectors  have  the  same  total  charge,  so  ej  depends  only  on  d,  and 
not  on  R. 

9  In  deriving  (27.18)  we  have  used  the  fact  that  the  total  charge  of  the  primitive  cell,  Ee{d),  vanishes. 
We  have  also  neglected  an  additional  term,  Z  de(d),  which  is  the  dipole  moment  of  the  primitive  cell  in 
the  undistorted  equilibrium  crystal.  In  most  crystals  this  term  vanishes  for  the  most  natural  choices  of 
primitive  cell.  If  it  did  not  vanish,  the  crystal  would  have  a  polarization  density  in  equilibrium  in  the 
absence  of  distorting  forces  or  externa!  electric  fields.  Such  crystals  do  exist,  and  are  known  as  pyroelectrics. 
We  shall  discuss  them  later  in  this  chapter,  where  we  shall  also  make  clearer  what  is  meant  by  “most 
natural  choices  of  primitive  cell”  (see  page  554). 
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We  shall  restrict  our  use  of  the  macroscopic  Maxwell  equations  to  situations  in 
which  the  variation  in  cellular  polarization  is  appreciable  only  over  distances  large 
compared  with  the  dimensions  r0  of  the  averaging  region;  this  is  certainly  the  case 
for  fields  whose  wavelengths  are  in  the  visible  part  of  the  spectrum  or  longer.  Since 
the  integrand  in  (27.20)  vanishes  when  7  is  more  than  r0  from  r,  then  if  p(7)  varies 
negligibly  over  a  distance  r0  from  7,  we  can  replace  p(r)  by  p(r),  and  bring  it  outside 
the  integral  to  obtain: 

P(r)  =  ^JV/(r-r).  (27.21) 

Since  J  dr'f(r')  =  1,  we  finally  have 

P(r)  —  —  P(r);  (27.22) 

v 

i.e.,  provided  the  dipole  moment  of  each  cell  varies  appreciably  only  on  the  macro¬ 
scopic  scale,  then  the  macroscopic  Maxwell  equation  (27.4)  holds  with  the  polarization 
density  P(r)  defined  to  be  the  dipole  moment  of  a  primitive  cell  in  the  neighborhood 
of  r,  divided  by  its  equilibrium  volume.10 


THEORY  OF  THE  LOCAL  FIELD 

To  exploit  macroscopic  electrostatics,  a  theory  is  required  relating  the  polarization 
density  P  back  to  the  macroscopic  electric  field  E.  Since  each  ion  has  microscopic 
dimensions,  its  displacement  and  distortion  will  be  determined  by  the  force  due  to 
the  microscopic  field  at  the  position  of  the  ion,  diminished  by  the  contribution  to  the 
microscopic  field  from  the  ion  itself.  This  field  is  frequently  called  the  local  (or  effective) 
field,  E1<K(r). 

We  can  exploit  macroscopic  electrostatics  to  simplify  the  evaluation  of  Eloc(r)  by 
dividing  space  into  regions  near  to  and  far  from  r.  The  far  region  is  to  contain  all 
external  sources  of  field,  all  points  outside  the  crystal,  and  only  points  inside  the 
crystal  that  are  far  from  r  compared  with  the  dimensions  r0  of  the  averaging  region 
used  in  (Eq.  (27.6)).  All  other  points  are  said  to  be  in  the  near  region  (Figure  27.3). 
The  reason  for  this  division  is  that  the  contribution  to  Eloc(r)  of  all  charge  in  the 
far  region  will  vary  negligibly  over  a  distance  r0  about  r,  and  would  be  unaffected 
if  we  were  to  apply  the  averaging  procedure  specified  in  (27.6).  Therefore  the  con¬ 
tribution  to  Eloc(r)  of  all  charge  in  the  far  region  is  just  the  macroscopic  field,  E[J“cro(r), 
that  would  exist  at  r  if  only  the  charge  in  the  far  region  were  present : 

E>)  =  E^,(r)  +  Er°(r)  =  E£ar(r)  +  E™-(r).  (27.23) 

Now  E(r),  the  full  macroscopic  field  at  r,  is  constructed  by  averaging  the  microscopic 
field  within  r0  of  r  due  to  all  charges,  in  both  the  near  and  the  far  regions;  i.e., 

E(r)  =  E£T°(r)  +  E^TM,  (27.24) 


10 


The  derivation  of  this  intuitive  result  permits  us  to  estimate  corrections  when  required. 
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Figure  27.3 

In  calculating  the  local  field  at  a  point  r  it  is  convenient  to  consider  separately  contributions  from 
the  far  region  (i.e.,  all  the  crystal  outside  the  sphere  of  radius  r,  about  r  and  all  external  sources 
of  field)  and  from  the  near  region  (i.e.,  all  points  within  the  sphere  about  r).  The  far  region  is  taken 
to  be  far  from  r  on  the  scale  of  the  averaging  length  r0,  to  insure  that  the  microscopic  field  due 
to  charges  in  the  far  region  is  equal  to  its  macroscopic  average. 


where  E£acrr°(r)  is  the  macroscopic  field  that  would  exist  at  r  if  only  the  charges  in 
the  near  region11  were  present.  We  can  therefore  rewrite  Eq.  (27.23)  as: 


E?oc(r)  =  EM  +  E£ar(r)  -  Kfffr). 


(27.25) 


Thus  we  have  related  the  unknown  local  field  at  r  to  the  macroscopic  electric  field12 
at  r  and  additional  terms  that  depend  only  on  the  configuration  of  charges  in  the 
near  region. 

We  shall  apply  (27.25)  only  to  nonequilibrium  configurations  of  the  crystal  with 
negligible  spatial  variation  from  cell  to  cell  over  distances  of  order  rlt  the  size  of  the 
near  region.13  In  such  cases  E^“ro(r)  will  be  the  macroscopic  field  due  to  a  uniformly 
polarized  medium,  whose  shape  is  that  of  the  near  region.  If  we  choose  the  near  region 


11  Including,  of  course,  the  ion  on  which  we  are  calculating  the  force. 

12  A  further  complication  of  a  purely  macroscopic  nature  is  peripheral  to  the  argument  here,  in  which  . 
E(r)  is  assumed  to  be  given.  If  the  internal  field  and  polarization  are  produced  by  placing  the  sample  in 

a  specified  field  E"‘,  then  an  additional  problem  in  macroscopic  electrostatics  must  be  solved  to  determine 
the  macroscopic  field  E  in  the  interior  of  the  sample,  since  the  discontinuity  in  the  polarization  density 
P  at  the  surface  of  the  sample  acts  as  a  bound  surface  charge,  and  contributes  an  additional  term  to  the 
macroscopic  field  in  the  interior.  For  certain  simply  shaped  samples  in  uniform  external  fields  the  induced 
polarization  P  and  the  macroscopic  field  E  in  the  interior  will  both  be  constant  and  parallel  to  hr’r ,  and 
one  can  write:  E  =  E™  -  N P,  where  N,  the  “depolarization  factor,"  depends  on  the  geometry  of  the 
sample.  The  most  important  elementary  case  is  the  sphere,  for  which  N  =  4tr/3.  For  a  general  ellipsoid 
(in  which  P  need  not  be  parallel  to  E)  see  E.  C.  Stoner,  Phil.  Mag.  36,  803  (1945). 

13  Note  that  we  are  now  very  macroscopic  indeed,  requiring  that  X  »  r,  »  r0  »  a. 
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to  be  a  sphere,  then  this  field  is  given  by  the  following  elementary  result  of  electro¬ 
statics  (see  Problem  1):  The  macroscopic  field  anywhere  inside  a  uniformly  polarized 
sphere  is  just  E  =  -47tP/3,  where  P  is  the  polarization  density.  Therefore  if  the 
near  region  is  a  sphere  over  which  P  has  negligible  spatial  variation,  then  Eq.  (27.25) 
becomes 

E'“(r)  =  EM  +  Ej“,(r)  +  .  (27.26) 


We  are  thus  left  with  the  problem  of  calculating  the  microscopic  local  field  E^lr(r) 
appropriate  to  a  spherical  region  whose  center  is  taken  to  be  the  ion  on  which  the 
field  acts.  Inside  this  region  the  charge  density  is  the  same  in  every  cell  (except  for 
the  removal  of  the  ion  at  the  center  on  which  we  are  calculating  the  force).  In  most 
applications  this  calculation  is  done  under  the  following  simplifying  assumptions : 

1.  The  spatial  dimensions  and  the  displacement  from  equilibrium  of  each  ion  are 
considered  to  be  so  small  that  the  polarizing  field  acting  on  it  can  be  taken  to 
be  uniform  over  the  whole  ion  and  equal  to  the  value  of  E'°c  at  the  equilibrium 
position  of  the  ion. 

2.  The  spatial  dimensions  and  the  displacement  from  equilibrium  of  each  ion  are 
considered  to  be  so  small  that  the  contribution  to  the  local  field  at  the  equilibrium 
position  of  the  given  ion,  from  the  ion  whose  equilibrium  position  is  R  +  d,  is 
accurately  given  by  the  field  of  a  dipole  of  moment  <?(d)u(R  +  d)  +  p(R  +  &). 

Since  the  dipole  moments  of  ions  at  equivalent  sites  (displaced  from  each  other 
by  Bravais  lattice  vectors  R)  are  identical  within  the  near  region  over  which  P  has 
negligible  variation,  the  calculation  of  Ej£r  at  an  equilibrium  site  reduces  to  the  type 
of  lattice  sum  we  described  in  Chapter  20.  Furthermore,  in  the  special  case  in  which 
every  equilibrium  site  in  the  equilibrium  crystal  is  a  center  of  cubic  symmetry,  it  is 
easily  shown  (Problem  2)  that  this  lattice  sum  must  vanish;  i.e.,  Ej^fr)  =  0  at  every 
equilibrium  site.  Since  this  case  includes  both  the  solid  noble  gases  and  the  alkali 
halides,  it  is  the  only  one  we  consider.  For  these  crystals  we  may  assume  that  the 
field  polarizing  each  ion  in  the  neighborhood  of  r  is14 


Eloc(r)  =  E(r)  + 


47iP(r) 

3 


(27.27) 


This  result,  sometimes  known  as  the  Lorentz  relation,  is  widely  used  in  theories  of 
dielectrics.  It  is  very  important  to  remember  the  assumptions  underlying  it,  par¬ 
ticularly  that  of  cubic  symmetry  about  every  equilibrium  site. 

Sometimes  (27.27)  is  written  in  terms  of  the  dielectric  constant  e  of  the  medium, 
using  the  constitutive  relation15 

D(r)  =  cE(r),  (27.28) 


14  Note  that  implicit  in  this  relation  is  the  fact  that  the  local  field  acting  on  an  ion  depends  only 
on  the  general  location  of  the  ion  but  not  (in  a  lattice  with  a  basis)  on  the  type  of  ion  (i.e.,  it  depends  on 
R  but  not  on  d).  This  convenient  simplification  is  a  consequence  of  our  assumption  that  every  ion  occupies 
a  position  of  cubic  symmetry, 

15  In  noncubic  crystals  P,  and  therefore  D,  need  not  be  parallel  to  E,  so  €  is  a  tensor. 
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together  with  the  relation  (27.3)  between  D,  E,  and  P,  to  express  P(r)  in  terms  of  E(r): 

P(r)  =  — -p —  E(r).  (27.29) 

4 n 

Using  this  to  eliminate  P(r)  from  (27.27),  one  finds  that 

(27.30) 

Yet  another  way  of  expressing  the  same  result  is  in  terms  of  the  polarizability,  a, 
of  the  medium.  The  polarizability  a(d)  of  the  type  of  ion  at  position  d  in  the  basis 
is  defined  to  be  the  ratio  of  its  induced  dipole  moment  to  the  field  actually  acting 
on  it.  Thus 

p(R  +  d)  +  eu(R  +  d)  =  a(d)  Eloc(r)|risR.  (27.31) 

The  polarizability  a  of  the  medium  is  defined  as  the  sum  of  the  polarizabilities  of 


the  ions  in  a  primitive  cell: 

«  =  E  “(d)- 

d 

(27.32) 

Since  (cf.  (27.18)  and  (27.22)). 

P(r)  = 

-£  p(R,  d)  +  e(d)u(R,  d)  , 

(27.33) 

11  d  L  _jR=Kr 

it  follows  that 

P(r)  =  -  Eloc(r). 

(27.34) 

Using  (27.29)  and  (27.30)  to  express  both  P  and  Eloc  in  terms  of  E,  we  find  that  (27.34) 
implies  that 

(27.35) 

This  equation,  known  as  the  Clausius-Mossotti  relation,16  provides  a  valuable 
link  between  macroscopic  and  microscopic  theories.  A  microscopic  theory  is  required 
to  calculate  a ,  which  gives  the  response  of  the  ions  to  the  actual  field  Eloc  acting  on 
them.  The  resulting  e  can  then  be  used,  in  conjunction  with  the  macroscopic  Maxwell 
equations,  to  predict  the  optical  properties  of  the  insulator. 

THEORY  OF  THE  POLARIZABILITY 

Two  terms  contribute  to  the  polarizability  a.  The  contribution  from  p  (see  Eq.  (27.31) ), 
the  “atomic  polarizability,”  arises  from  the  distortion  of  the  ionic  charge  distribution. 

16  When  written  in  terms  of  the  index  of  refraction,  n  =  v/c,  the  Clausius-Mossotti  relation  is  known 
as  the  Lorentz- Lorenz  relation.  (In  the  recent  physics  and  chemistry  literature  of  England  and  the  United 
States  it  has  become  the  widespread  practice  to  misspell  the  last  name  of  O.  F.  Mossotti  with  a  single 
“s”  and/or  to  interchange  his  initials.) 
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The  contribution  from  eu,  the  “displacement  polarizability,”  arises  from  ionic  dis¬ 
placements.  There  is  no  displacement  polarizability  in  molecular  crystals  where  the 
“ions”  are  uncharged,  but  in  ionic  crystals  it  is  comparable  to  the  atomic  polarizability. 


Atomic  Polarizability 


We  allow  the  local  field  acting  on  the  ion  in  question  to  be  frequency-dependent, 
writing 

E,oc  =  Re  (E0e (27.36) 


where  E0  is  independent  of  position  (assumption  1,  page  541).  The  simplest  classical 
theory  of  atomic  polarizability  treats  the  ion  as  an  electronic  shell  of  charge  Z,e 
and  mass  Ztm  tied  to  a  heavy,  immobile,  undeformable  ion  core,  by  a  harmonic 
spring,  of  spring  constant  K  =  Z,-mco02  (Figure  27.4).  If  the  displacement  of  the  shell 


Figure  27.4 

Crude  classical  model  of  atomic  polarizability. 
The  ion  is  represented  as  a  charged  shell  of 
charge  Z,e  and  mass  Z.ni  tied  to  an  immobile 
nucleus  by  a  spring  of  force  constant  K  = 

ZjW&jo2. 


from  its  equilibrium  position  is  given  by 

r  =  Re  (r0e_,“'), 

then  the  equation  of  motion  of  the  shell, 


implies  that 


Z,m r  =  ~Kt  -  Z;cE'“, 


= _ eE0 

0  m(oj02  —  co2) 

Since  the  induced  dipole  moment  is  p  =  —  Z;er,  we  have 


p  =  Re  (p0e  '“'), 

with 

n  Z‘g2  E 

Po  —  ,  2  2\  *^0- 

m(co0  —  co  ) 

Defining  the  frequency-dependent  atomic  polarizability  by 

Po  =  a°'(co)E0, 


(27.37) 

(27.38) 

(27.39) 

(27.40) 

(27.41) 

(27.42) 
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we  have 


a al(co)  = 


Z,e2 


m(to02  —  co2) 


(27.43) 


The  model  leading  to  (27.43)  is,  of  course,  very  crude.  However,  for  our  purposes 
the  most  important  feature  of  the  result  is  that  if  co  is  small  compared  with  co0,  the 
polarizability  will  be  independent  of  frequency  and  equal  to  its  static  value. 

of  =  (27.44) 


rnojQ 


We  would  expect  co0,  the  frequency  of  vibration  of  the  electronic  shell,  to  be  of  the 
order  of  an  atomic  excitation  energy  divided  by  h.  This  suggests  that,  unless  hco  is 
of  the  order  of  several  electron  volts,  we  can  take  the  atomic  polarizability  to  be 
independent  of  frequency.  This  is  confirmed  by  more  accurate  quantum-mechanical 
calculations  of  a. 

Note  that  we  can  also  use  (27.44)  to  estimate  the  frequency  below  which  a  will 
be  frequency-independent,  in  terms  of  the  observed  static  polarizabilities. 


hojQ 


lh2Z;e 


1  maal 

4a0 

iZi  e 

1  « 

a*  2n0’ 

h 

fUT 24  cm3\ 

V  ) 

h2 

,o  = 

x  10.5  eV. 


(27.45) 


Since  the  measured  polarizabilities  (see  Table  27.1)  are  of  the  order  of  10  cm  ,  we 
conclude  that  the  frequency  dependence  of  the  atomic  polarizability  will  not  come  into 
play  (in  all  but  the  most  highly  polarizable  of  ions)  until  frequencies  corresponding 
to  ultraviolet  radiation. 


Table  27.1 

ATOMIC  POLARIZABILITIES  OF  THE  HALOGEN  IONS, 
NOBLE  GAS  ATOMS,  AND  ALKALI  METAL  IONS° 


HALOGENS 

NOBLE  GASES 

ALKALI  METALS 

He  0.2 

Li+ 

0.03 

F~  1.2 

Ne  0.4 

Na+ 

0.2 

cr  3 

Ar  1.6 

K  + 

0.9 

Br“  4.5 

Kr  2.5 

Rb+ 

1.7 

r  7 

Xe  4.0 

Cs+ 

2.5 

•  In  units  of  10“ 24 

cm3.  Note  that  entries  in  the  same  row 

have  the 

same  electronic  shell  structure,  but  increasing  nuclear  charge. 

Source:  A.  Dalgarno,  Advances  Phys.  11,  ZtSI  (1961). 

Displacement  Polarizability 

In  ionic  crystals  we  must  consider  the  dipole  moment  due  to  the  displacement  of  the 
charged  ions  by  the  electric  field,  in  addition  to  the  atomic  polarization  resulting 
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from  the  deformation  of  their  electronic  shells  by  the  field.  We  begin  by  ignoring 
the  atomic  polarization  (rigid-ion  approximation).  To  simplify  the  discussion  we  also 
consider  only  crystals  with  two  ions  per  primitive  cell,  of  charges  e  and  —e.  If  the 
ions  are  undeformable,  then  the  dipole  moment  of  the  primitive  cell  is  just 

P  =  ew,  w  =  u+-  u~,  (27.46) 

where  u1  is  the  displacement  of  the  positive  or  negative  ion  from  its  equilibrium 
position. 

To  determine  w(r)  we  note  that  the  long-range  electrostatic  forces  between  ions 
are  already  contained  in  the  field  E'°c.  The  remaining  short-range  interionic  forces 
(e.g.,  higher-order  electrostatic  multipole  moments  and  core-core  repulsion)  will  fall 
off  rapidly  with  distance,  and  we  may  assume  that  they  produce  a  restoring  force 
for  an  ion  at  r  that  depends  only  on  the  displacement  of  the  ions  in  its  vicinity.  Since 
we  are  considering  disturbances  that  vary  slowly  on  the  atomic  scale,  in  the  vicinity 
of  r  all  ions  of  the  same  charge  move  as  a  whole  with  the  same  displacement,  u+(r) 
or  u“(r).  Thus  the  short-range  part  of  the  restoring  force  acting  on  an  ion  at  r  will 
simply  be  proportional  to17  the  relative  displacement  w(r)  =  u'l‘(r)  —  u“(r)  of  the 
two  oppositely  charged  sublattices  in  the  neighborhood  of  r. 

Consequently  in  a  distortion  of  the  crystal  with  slow  spatial  variation  on  the 
microscopic  scale,  the  displacements  of  the  positive  and  negative  ions  satisfy  equations 
of  the  form : 

M+ii+  =  —  k(u+  —  u~)  +  eEloc, 

M_u-  =  —  k(u ~  -  u+)  -  eEl0C,  (27.47) 

which  can  be  written 

*  =  T,ElOC-4^  (27.48) 


where  M  is  the  ionic  reduced  mass,  M  1  =  (M+)  1  +  (M_)  *.  Letting  Eloc  be  an 
AC  field  of  the  form  (27.36),  we  find  that 


where 


Accordingly, 


w  =  Re(w0e  ’’“')>  w0 


eE0/M 


k 

M' 


adis  =  Po=ewo=  e 

E0  E0  M(  a)2  —  co2)' 


(27.49) 

(27.50) 

(27.51) 


Note  that  the  displacement  polarizability  (27.51)  has  the  same  form  as  the  atomic 
polarizability  (27.43).  However  the  resonant  frequency  co  is  characteristic  of  lattice 
vibrational  frequencies,  and  therefore  hw  ft  hooD  ft  10“ 1  to  10-2  eV.  This  can  be 
102  to  103  times  smaller  than  the  atomic  frequency  co0,  and  therefore,  in  contrast 
to  the  atomic  polarizability,  the  displacement  polarizability  has  a  significant  frequency 
dependence  in  the  infrared  and  optical  range. 


17  The  proportionality  constant  in  general  will  be  a  tensor,  but  it  reduces  to  a  constant  in  a  crystal 
of  cubic  symmetry,  the  only  case  we  consider  here. 
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Note  also  that  because  the  ionic  mass  M  is  about  104  times  the  electronic  mass 
m,  the  static  (co  =  0)  ionic  and  displacement  polarizabilities  may  well  be  of  the 
same  size.  This  means  that  the  rigid-ion  model  we  have  used  is  unjustifiable,  and 
(27.51)  must  be  corrected  to  take  into  account  the  atomic  polarizability  of  the  ions 
as  well.  The  most  naive  way  to  do  this  is  simply  to  add  the  two  types  of  contribution 
to  the  polarizability: 


a  =  (a+ 


+  a  )  + 


M(w2  -  co2)' 


(27.52) 


where  a  and  a  are  the  atomic  polarizabilities  of  the  positive  and  negative  ions. 
There  is  no  real  justification  for  this,  since  the  first  term  in  (27.52)  was  calculated 
on  the  assumption  that  the  ions  were  immobile  but  polarizable,  while  the  second 
was  calculated  for  ions  that  could  be  moved,  but  not  deformed.  Evidently  a  more 
reasonable  approach  would  combine  the  models  that  lead  on  the  one  hand  to  (27.43) 
and,  on  the  other,  to  (27.51),  calculating  in  one  step  the  response  to  the  local  field 
of  ions  that  can  be  both  displaced  and  deformed.  Such  theories  are  known  as  shell 
model  theories.  They  generally  lead  to  results  that  differ  considerably  in  numerical 
detail  from  those  predicted  by  the  more  naive  (27.52),  but  have  many  of  the  same 
basic  structural  features.  We  therefore  explore  the  consequences  of  (27.52)  further, 
indicating  later  how  it  should  be  modified  in  a  more  reasonable  model. 

In  conjunction  with  the  Clausius-Mossotti  relation  (27.35),  the  approximation 
expressed  by  (27.52)  leads  to  a  dielectric  constant  e(co)  for  an  ionic  crystal,  given  by 


e(co)  -  1 


4nf  + 

=  Tv[a  +a 


e(co)  +  2  3v  ^  '  n  M(oo 2  —  co2))' 

In  particular,  the  static  dielectric  constant  is  given  by 


1 


4n  f  +  -  e 

3v  I  Moo2 


> 


eo  +  2 

while  the  high-frequency18  dielectric  constant  satisfies 


(co  «  co). 


(27.53) 


(27.54) 


~ jT2  =  3v^+  +  “  (0J  <-c  0J  K<  wo)-  (27.55) 

It  is  convenient  to  write  e(co)  in  terms  of  e0  and  e^,  since  the  two  limiting  forms 
are  readily  measured:  e0  is  the  static  dielectric  constant  of  the  crystal,  while  em  is 
the  dielectric  constant  at  optical  frequencies,  and  is  therefore  related  to  the  index  of 
refraction,  n,  by  n2  —  We  have 


‘fr0*-1  =  ^  -  1  .  l  (e o-l  _  -  1\ 

e(co)  +  2  €„  +  2  1  -  (co2/co2)  \e0  +  2  +  2 


18  In  this  context,  by  “high  frequencies”  we  shall  always  mean  frequencies  high  compared  with  lattice 
vibrational  frequencies,  but  low  compared  with  atomic  excitation  frequencies.  The  frequency  of  visible 
hght  generally  satisfies  this  condition. 


Optical  Modes  in  Ionic  Crystals  547 


which  can  be  solved  for  e(co): 

«(co)  =  €co 


^  ^oo _ ^0 

+  Wh7)  - 1’ 


where 


(27.57) 

(27.58) 


Application  to  Long-Wavelength  Optical  Modes  of  Ionic  Crystals 

To  calculate  the  normal  mode  dispersion  relations  in  an  ionic  crystal  we  could  proceed 
by  the  general  techniques  described  in  Chapter  22.  However,  we  would  encounter 
severe  computational  difficulties  because  of  the  very  long  range  of  the  interionic 
electrostatic  interactions.  Techniques  have  been  developed  for  dealing  with  this 
problem,  similar  to  those  exploited  in  calculating  the  cohesive  energy  of  an  ionic 
crystal  (Chapter  20).  However,  for  long-wavelength  optical  modes  one  can  avoid 
such  calculations  by  stating  the  problem  as  one  in  macroscopic  electrostatics: 

In  a  long-wavelength  (k  a:  0)  optical  mode  the  oppositely  charged  ions  in  each 
primitive  cell  undergo  oppositely  directed  displacements,  giving  rise  to  a  non¬ 
vanishing  polarization  density  P.  Associated  with  this  polarization  density  there  will 
in  general  be  a  macroscopic  electric  field  E  and  an  electric  displacement  D,  related  by 

D  =  eE  =  E  +  4ttP.  (27.59) 

In  the  absence  of  free  charge,  we  have 

V  •  D  =  0.  (27.60) 

Furthermore,  Eralcco  is  the  gradient  of  a  potential.19  It  follows  from  (27.6)  that  E  is 
also,  so  that 

V  x  E  =  V  x  =  0.  (27.61) 

In  a  cubic  crystal  D  is  parallel  to  E  (i.e.,  e  is  not  a  tensor)  and  therefore,  from  (27.59), 
both  are  parallel  to  P.  If  all  three  have  the  spatial  dependence. 


Je*r,  (27.62) 

then  (27.60)  reduces  to  k  •  D0  =0,  which  requires  that 

D  =  0  or  D,  E,  and  P  1  k,  (27.63) 

while  (27.61)  reduces  to  k  x  E0  =  0,  which  requires  that 

E  =  0  or  E,  D,  and  P  ||  k.  (27.64) 

In  a  longitudinal  optical  mode  the  (nonzero)  polarization  density  P  is  parallel  to  k. 


19  At  optical  frequencies  one  might  worry  about  keeping  only  electrostatic  fields,  since  the  right 
side  of  the  full  Maxwell  equation,  V  x  E  =  —  (1/c)  dB/dt  need  not  be  negligible.  We  shall  see  shortly, 
however,  that  a  full  electrodynamic  treatment  leads  to  very  much  the  same  conclusions. 
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and  Eq.  (27.63)  therefore  requires  that  D  must  vanish.  This  is  consistent  with  (27.59) 
only  if 

E  =  —  47iP,  e=0  (longitudinal  mode).  (27.65) 

On  the  other  hand,  in  a  transverse  optical  mode  the  (nonzero)  polarization  density 
P  is  perpendicular  to  k,  which  is  consistent  with  (27.64)  only  if  E  vanishes.  This, 
however,  is  consistent  with  (27.59)  only  if 


E  =  0,  e  =  co  (transverse  mode).  (27.66) 

According  to  (27.57),  e  =  co  when  to2  =  cor2,  and  therefore  the  result  (27.66) 
identifies  coT  as  the  frequency  of  the  long-wavelength  (k  — >  0)  transverse  optical 
mode.  The  frequency  coL  of  the  longitudinal  optical  mode  is  determined  by  the 
condition  e  =  0  (Eq.  (27.65)),  and  (27.57)  therefore  gives 


2  ^0  2 
COL  —  - C0T  . 

^00 


(27.67) 


This  equation,  relating  the  longitudinal  and  transverse  optical-mode  frequencies  to 
the  static  dielectric  constant  and  index  of  refraction,  is  known  as  the  Lyddane-Sachs- 
Teller  relation.  Note  that  it  follows  entirely  from  the  interpretation  that  (27.65)  and 
(27.66)  lend  to  the  zeros  and  poles  of  e[co),  together  with  the  functional  form  of 
(27.57) — i.e„  the  fact  that  in  the  frequency  range  of  interest  e  as  a  function  of  co2  is 
a  constant  plus  a  simple  pole.  As  a  result,  the  relation  has  a  validity  going  well  beyond 
the  crude  approximation  (27.52)  of  additive  polarizabilities,  and  also  applies  to  the 
far  more  sophisticated  shell  model  theories  of  diatomic  ionic  crystals. 

Since  the  crystal  is  more  polarizable  at  low  frequencies20  than  at  high,  coL  exceeds 
(0T.  It  may  seem  surprising  that  coL  should  differ  at  all  from  coT  in  the  limit  of  long 
wavelengths,  since  in  this  limit  the  ionic  displacements  in  any  region  of  finite  extent 
are  indistinguishable.  However,  because  of  the  long  range  of  electrostatic  forces,  their 
influence  can  always  persist  over  distances  comparable  to  the  wavelength,  no  matter 
how  long  that  wavelength  may  be ;  thus  longitudinal  and  transverse  optical  modes 
will  always  experience  different  electrostatic  restoring  forces.21  Indeed,  if  we  use  the 
Lorentz  relation  (27.27)  we  find  from  (27.65)  that  the  electrostatic  restoring  force  in 
a  long-wavelength  longitudinal  optical  mode  is  given  by  the  local  field 


4nP 

(E,m)l  =  E  +  —  = 


8ttP 


(longitudinal), 


(27.68) 


while  (from  (27.66))  it  is  given  in  a  long-wavelength  transverse  optical  mode  by 


(E|0C)T-  =  (transverse). 


(27.69) 


20  At  frequencies  well  above  the  natural  vibrational  frequencies  of  the  ions,  they  fail  to  respond  to 
an  oscillatory  force,  and  one  has  only  atomic  polarizability.  At  low  frequencies  both  mechanisms  can 
contribute. 

21  This  argument,  based  on  instantaneous  action  at  a  distance,  must  be  reexamined  when  the  electro¬ 
static  approximation  (27.61)  is  dropped  (see  footnotes  19  and  25). 
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Thus  in  a  longitudinal  mode  the  local  field  acts  to  reduce  the  polarization  (i.e.,  it 
adds  to  the  short-range  restoring  force  proportional  to  k  =  Mm2)  while  in  a  transverse 
mode  it  acts  to  support  the  polarization  (i.e.,  it  reduces  the  short-range  restoring 
force).  This  is  consistent  with  (27.58),  which  predicts  that  wT  is  less  than  co  (since 
e0  —  €a  is  positive).  It  is  also  consistent  with  (27.67),  which,  with  the  aid  of  (27.58), 
can  be  written: 


2  —2 
a>L  =  or 


1  +  2 


*0  ^co 

e0  +  2 


(27.70) 


which  indicates  that  coL  exceeds  <5. 

The  Lyddane-Sachs-Teller  relation  (27.67)  has  been  confirmed  by  comparing 
measurements  of  coL  and  coT  from  neutron  scattering,  with  measured  values  of  the 
dielectric  constant  and  index  of  refraction.  In  two  alkali  halides  (Nal  and  KBr), 
(tiL/o)T  and  (60/€oo)1/2  were  found  to  agree  to  within  the  experimental  uncertainty  of 
the  measurements  (a  few  percent).22 

However,  because  it  is  merely  a  consequence  of  the  analytic  form  of  e(co),  the 
validity  of  the  Lyddane-Sachs-Teller  relation  does  not  provide  a  very  stringent  test 
of  a  theory.  A  more  specific  prediction  can  be  constructed  from  Eqs.  (27.54),  (27.55), 
and  (27.58),  which  combine  together  to  give 


9  (e0  -  e-J 
4ti  (€x  +  2)2 


mT 


(27.71) 


Since  e2/Mv  is  determined  entirely  by  the  ionic  charge,  the  ionic  reduced  mass,  and 
the  lattice  constant,  the  right  side  of  (27.71)  is  known.  However,  measured  values  of 
Co,  €x,  and  mT  in  the  alkali  halides  lead  to  a  value  for  the  left  side  of  (27.71)  that  can 
be  expressed  in  the  form  (e*)2/Mv,  where  e*  (known  as  the  Szigeti  charge)  ranges 
between  about  0.7e  and  0.9e.  This  should  not  be  taken  as  evidence  that  the  ions  are 
not  fully  charged,  but  as  a  telling  sign  of  the  failure  of  the  crude  assumption  (27.52)  that 
atomic  and  displacement  polarizabilities  simply  add  to  give  the  total  polarizability. 

To  remedy  this  defect,  one  must  turn  to  a  shell  model  theory  in  which  atomic  and 
displacement  polarizations  are  calculated  together,  by  allowing  the  electronic  shell 
to  move  relative  to  the  ion  core  (as  done  above  in  calculating  the  atomic  polariz¬ 
ability)  at  the  same  time  as  the  ion  cores  are  themselves  displaced.23  The  general 
structural  form  (27.57)  of  e(co)  is  preserved  in  such  a  theory,  but  the  specific  forms 
for  the  constants  e0,  tx,  and  cor  can  be  quite  different. 


Application  to  the  Optical  Properties  of  Ionic  Crystals 

The  above  discussion  of  the  transverse  optical  mode  is  not  completely  accurate,  based 
as  it  is  on  the  electrostatic  approximation  (27.61)  to  the  Maxwell  equation:24 

„  1  c® 

V  x  E  = - — .  (27.72) 

c  St 


21  A.  D.  B.  Woods  et  al„  Phys.  Rev.  131, 1025  (1963). 

23  An  early  and  particularly  simple  model  is  given  by  S.  Roberts,  Phys.  Rev.  77, 258  (1950). 

24  Our  discussion  of  the  longitudinal  optical  mode  is  founded  entirely  on  the  Maxwell  equation 
V  •  D  =  0  and  remains  valid  in  a  fully  electrodynamic  analysis. 
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When  (27.61)  is  replaced  by  the  more  general  (27.72),  the  conclusion  (27.66)  that  the 
transverse  optical  mode  frequency  is  determined  by  the  condition  e(to)  =  x  must  be 
replaced  by  the  more  general  result  (Eq.  (1.34))  that  transverse  fields  with  angular 
frequency  co  and  wave  vector  k  can  propagate  only  if 

k2c2 

e(w)  =  -  2  .  (27.73) 

Thus  for  optical  modes  with  wave  vectors  satisfying  kc  »  co.  the  approximation 
e  =  co  is  reasonable.  The  frequencies  of  optical  phonons  are  of  order  coD  =  kDs, 
where  s  is  the  speed  of  sound  in  the  crystal,  so  this  requires  that 

~  (27.74) 

kD  c 

Since  kD  is  comparable  to  the  dimensions  of  the  Brillouin  zone,  while  s/c  is  of  order 
10-4  to  10-5,  the  electrostatic  approximation  is  fully  justified  except  for  optical 
modes  whose  wave  vectors  are  only  a  small  fraction  of  a  percent  of  the  dimensions  of 
the  zone  from  k  =  0. 

We  can  describe  the  structure  of  the  transverse  modes  all  the  way  down  to  k  =  0, 
by  plotting  e  vs.  co  (Eq.  (27.57) )  (Figure  27.5).  Note  that  e  is  negative  between  coT  and 
coL,  so  Eq.  (27.73)  requires  kc  to  be  imaginary.  Thus  no  radiation  can  propagate  in 
the  crystal  between  the  transverse  and  longitudinal  optical  frequencies.  Outside  this 
forbidden  range  co  is  plotted  vs.  k  in  Figure  27.6.  The  dispersion  relation  has  two 
branches,  lying  entirely  below  coT  and  entirely  above  to,  .  The  lower  branch  has  the 
form  co  =  coT  except  when  k  is  so  small  as  to  be  comparable  to  coT/c.  It  describes  the 
electric  field  accompanying  a  transverse  optical  mode  in  the  constant-frequency 
region.  However,  when  k  is  of  order  coT/c  the  frequency  falls  below  c oT,  vanishing  as 
kc/y/T0,  a  relation  characteristic  of  ordinary  electromagnetic  radiation  in  a  medium 
with  dielectric  constant  e0. 


Figure  27.5 

Frequency-dependent  dielec¬ 
tric  constant  for  a  diatomic 
ionic  crystal. 


(e  (-«)  =  £(«)) 
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Figure  27.6 

Solutions  to  the  dispersion 
relation  co  =  kc/^J for 
transverse  electromagnetic 
modes  propagating  in  a  di¬ 
atomic  ionic  crystal.  (The 
relation  to  Figure  27.5  is 
most  readily  seen  by  rotating 
the  figure  through  90°  and 
considering  it  to  be  a  plot  of 
k  =  co^JEUoVc,  vs.  w.)  In  the 
linear  regions  one  mode  is 
clearly  photonlike  and  one 
clearly  optical  phononlike. 
In  the  curved  regions  both 
modes  have  a  mixed  nature, 
and  are  sometimes  referred 
to  as  “polaritons.” 


The  upper  branch,  on  the  other  hand,  assumes  the  linear  form  co  —  kc/y/e^,, 
characteristic  of  electromagnetic  radiation  in  a  medium  with  dielectric  constant  em 
when  k  is  large  compared  with  coT/c,  but  as  k  approaches  zero,  the  frequency  does  not 
vanish  linearly,  but  levels  off  to  coL.25 

Finally,  note  that  if  the  dielectric  constant  is  a  real  number,  then  the  reflectivity  of 
the  crystal  is  given  by  (see  Eq.  (K.6)  in  Appendix  K) 


As  e  ->  oo,  the  reflectivity  approaches  unity.  Thus  all  incident  radiation  should  be 
perfectly  reflected  at  the  frequency  of  the  transverse  optical  mode.  This  effect  can  be 
amplified  by  repeated  reflections  of  a  ray  from  crystal  faces.  Since  n  reflections  will 
diminish  the  intensity  by  r",  after  very  many  reflections  only  the  component  of  radia¬ 
tion  with  frequencies  very  close  to  coT  will  survive.  This  surviving  radiation  is  known 
as  the  reststralil  (residual  ray).  Such  repeated  reflections  provide  a  very  precise  way 
of  measuring  coT,  as  well  as  a  method  for  producing  very  monochromatic  radiation 
in  the  infrared. 


25  Thus  as  k  -*  0  a  transverse  mode  does  occur  at  the  same  frequency  as  the  longitudinal  mode 
(see  page  548).  The  reason  this  behavior  emerges  in  an  electrodynamic,  but  not  an  electrostatic,  analysis 
is  basically  the  finite  velocity  of  signal  propagation  in  an  electro  dynamic  theory.  Electromagnetic  signals 
can  only  propagate  with  the  speed  of  light,  and  therefore  no  matter  how  long  their  spatial  range,  they 
can  be  effective  in  distinguishing  longitudinal  from  transverse  modes  only  if  they  can  travel  a  distance 
comparable  to  a  wavelength  in  a  time  small  compared  with  a  period  (i.e.,  kc  »  co).  The  argument  on 
page  548,  which  explains  why  coL  and  coT  differ,  implicitly  assumes  that  Coulomb  forces  act  instantaneously 
at  a  distance,  and  becomes  invalid  when  this  assumption  fails. 
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To  the  extent  that  the  lattice  vibrations  are  anharmonic  (and  therefore  damped) 
e  will  also  have  an  imaginary  part.  This  broadens  the  reststrahl  resonance.  The  typical 
behavior  of  observed  frequency-dependent  dielectric  constants  in  ionic  crystals,  as 
deduced  from  their  optical  properties,  is  shown  in  Figure  27.7.  Alkali  halide  dielectric 
properties  are  summarized  in  Table  27.2. 


Figure  27.7 

(a)  Real  (solid  line)  and  imaginary  (dashed  line)  parts  of  the  dielectric  constant  of  zinc  sulfide. 
(After  F.  Abeles  and  J.  P.  Mathieu,  Annales  de  Physique  3, 5  (1958);  quoted  by  E.  Burstein,  Phonons 
ami  Phonon  Interactions,  T.  A.  Bak,  ed.,  W.  A.  Benjamin,  Menlo  Park.  California.  1964.)  (b)  Real 
(solid  line)  and  imaginary  (dashed  line)  parts  of  the  dielectric  constant  of  potassium  chloride. 
(After  G.  R.  Wilkinson  and  C.  Smart;  quoted  by  D.  H.  Martin,  Advances  Phys.  14,  39  (1965).) 


COVALENT  INSULATORS 

The  above  analysis  of  ionic  and  molecular  crystals  has  relied  on  the  possibility  of 
resolving  the  charge  distribution  of  the  crystal  into  contributions  from  identifiable 
ions  (atoms,  molecules)  as  in  (27.9).  In  covalent  crystals,  however,  appreciable  elec¬ 
tronic  charge  density  resides  between  ions  (forming  the  so-called  covalent  bonds). 
This  part  of  the  total  charge  distribution  is  uniquely  a  property  of  the  condensed 
state  of  matter,  bearing  no  resemblance  to  the  charge  distribution  of  single  isolated 
ions  (atoms,  molecules).  Furthermore,  since  it  comes  from  the  most  loosely  bound 
atomic  electrons,  it  makes  a  very  important  contribution  to  the  polarizability  of  the 
crystal.  Therefore,  in  calculating  dielectric  properties  of  covalent  crystals  one  must 
deal  with  the  polarizability  of  the  crystal  as  a  whole,  either  invoking  band  theory 
from  the  start  or  developing  a  phenomenology  of '‘bond  polarizabilities.” 

We  shall  not  pursue  this  subject  here,  except  to  point  out  that  covalent  crystals 
can  have  quite  large  dielectric  constants,  reflecting  the  relatively  delocalized  structure 
of  their  electronic  charge  distributions.  Static  dielectric  constants  for  selected  covalent 
crystals  are  listed  in  Table  27.3.  As  we  shall  see  (Chapter  28),  the  fac'  t  the  dielectric 
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Table  27.2 

STATIC  DIELECTRIC  CONSTANT,  OPTICAL  DIELECTRIC 
CONSTANT,  AND  TRANSVERSE  OPTICAL  PHONON 
FREQUENCY  FOR  ALKALI  HALIDE  CRYSTALS 


COMPOUND 

eo 

£co 

h<±>T/kBa 

LiF 

9.01 

1.96 

442 

NaF 

5.05 

1.74 

354 

KF 

5.46 

1.85 

274 

RbF 

6.48 

1.96 

224 

CsF 

— 

2.16 

125 

Lid 

11.95 

2.78 

276 

Nad 

5.90 

2.34 

245 

Kd 

4.84 

2.19 

215 

RbCI 

4.92 

2.19 

183 

Csd 

7.20 

2.62 

151 

LiBr 

13.25 

3.17 

229 

NaBr 

6.28 

2.59 

195 

KBr 

4.90 

2.34 

166 

RbBr 

4.86 

2.34 

139 

CsBr 

6.67 

2.42 

114 

Lil 

16.85 

3.80 

— 

Nal 

7.28 

2.93 

167 

KI 

5.10 

2.62 

156 

Rbl 

4.91 

2.59 

117.5 

Csl 

6.59 

2.62 

94.6 

°  From  the  reststrahl  peak;  in  degrees  Kelvin. 

Source :  R.  S.  Knox  and  K.  J.  Teegarden,  Physics  of  Color 
Fowler,  ed..  Academic  Press,  New  York,  1968,  page  625. 

Centers ,  W.  B. 

Table  27.3 

STATIC  DIELECTRIC  CONSTANTS  FOR  SELECTED  COVALENT  AND 
COVALENT-IONIC  CRYSTALS  OF  THE  DIAMOND,  ZINCBLENDE, 

AND  WURTZITE  STRUCTURES  “ 

CRYSTAL  STRUCTURE 

eo 

CRYSTAL 

STRUCTURE 

e0 

C  d 

5.7 

ZnO 

w 

4.6 

Si  d 

12.0 

ZnS 

w 

5.1 

Ge  d 

16.0 

ZnSe 

z 

5.8 

Sn  d 

23.8 

ZnTe 

z 

8.3 

SiC  z 

6.7 

CdS 

w 

5.2 

GaP  z 

8.4 

CdSe 

w 

7.0 

GaAs  z 

10.9 

CdTe 

z 

7.1 

GaSb  z 

14.4 

BeO 

w 

3.0 

InP  z 

9.6 

MgO 

z 

3.0 

InAs  z 

12.2 

InSb  z 

15.7 

“  Quoted  by  J.  r  Phillips,  Phys.  Rev.  Lett.  20,  550  (1968). 

554  Chapter  27  Dielectric  Properties  of  Insulators 

constants  can  be  quite  substantial  is  a  point  of  considerable  importance  in  the  theory 
of  impurity  levels  in  semiconductors. 

PYROELECTRICITY 

In  deriving  the  macroscopic  equation 

V  •  E  =  —  4nV  •  P  (27.76) 

for  ionic  crystals,  we  assumed  (see  footnote  9)  that  the  equilibrium  dipole  moment  of 
the  primitive  cell, 

Po  =  £  de(d),  (27.77) 

(1 

vanished,  and  therefore  ignored  a  term 

AP  =  ^  (27.78) 

in  the  polarization  density  P.  As  Figure  27.8  demonstrates,  the  value  of  the  dipole 
moment  p0  is  not  independent  of  the  choice  of  primitive  cell.  However,  since  only  the 
divergence  of  P  has  physical  significance,  an  additive  constant  vector  AP  does  not 
affect  the  physics  implied  by  the  macroscopic  Maxwell  equations. 


_ _ ,  ,  ,  :  : _  Figure  27.8 

i  +  ~l  ±  ~|  +  ~1+  z\  +  ~ I  t~~l +  ~1+  ~l+  H+  ~~ 1+  ~  1+)  The  dipole  moment  of  the  primitive 
^  cell  depends  on  the  choice  of  primi- 

VF+I-  +  1-  +1-  +1-  +1-  +P~T|~+|-  +1-  +1-  +1  tive  celL  This  is  dlustrated  for  a 

-  one-dimensional  ionic  crystal. 


There  would  be  nothing  more  to  say  if  all  crystals  were  infinite  in  extent.  However, 
real  crystals  have  surfaces,  at  which  the  macroscopic  polarization  density  P  drops 
discontinuously  to  zero,  thereby  contributing  a  singular  term  on  the  right  side  of 
(27.76).  This  term  is  conventionally  interpreted  as  a  bound  surface  charge  per  unit 
area,  whose  magnitude  is  the  normal  component  of  P  at  the  surface,  P„.  Thus  an 
additive  constant  in  P  is  far  from  inconsequential  in  a  finite  crystal. 

In  a  finite  crystal,  however,  we  must  reexamine  our  assumption  that  each  primitive 
cell  has  zero  total  charge: 

£  e(d)  =  0.  (27.79) 

d 

In  an  infinite  crystal  of  identical  cells  this  is  merely  the  statement  that  the  crystal  as 
a  whole  is  neutral,  but  in  a  crystal  with  surfaces,  only  the  interior  cells  are  identically 
occupied,  and  charge  neutrality  is  perfectly  consistent  with  partially  filled,  and  there¬ 
fore  charged,  surface  cells  (Figure  27.9).  Should  one’s  choice  of  cell  lead  to  surface 
cells  containing  net  charge,  an  additional  term  would  have  to  be  added  to  (27.76)  to 
represent  this  bound  surface  charge,  ps.  When  the  choice  of  cell  is  changed,  both  P„ 
and  ps  will  change,  in  such  a  way  that  the  total  net  macroscopic  surface  charge  density, 
Pn  +  ps,  is  unchanged. 
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Figure  27.9 

The  “natural”  choice  of  primitive  cell  is  one  that  leads  to  uncharged  cells  at  the  surface.  The 
cells  chosen  in  (a)  and  (b)  violate  this  criterion,  and  their  contribution  to  the  polarization  density 
is  cancelled  by  the  contribution  from  the  charged  surface  cells.  The  cell  in  (c)  (which  is  non¬ 
primitive)  leads  to  uncharged  surface  cells  and  has  no  dipole  moment. 


Thus  the  “natural”  choice  of  cell  for  which  (27.76)  is  valid  without  an  additional 
term  representing  the  unbalanced  charge  in  surface  cells,  is  a  cell  whose  neutrality  is 
maintained  even  at  the  surfaces  of  actual  physical  specimens.26 

Crystals  whose  natural  primitive  cells  have  a  nonvanishing  dipole  moment  p0  are 
called  pyroelectric.21  In  equilibrium  a  perfect  specimen  of  a  pyroelectric  crystal  has 
a  total  dipole  moment  of  p0  times  the  number  of  cells  in  the  crystal,28  and  thus  a 
polarization  density  P  =  p0/v  throughout  the  crystal,  even  in  the  absence  of  an 
external  field.  This  immediately  implies  some  severe  restrictions  on  the  point-group 
symmetries  of  a  pyroelectric  crystal,  for  a  symmetry  operation  must  preserve  all 
crystalline  properties  and,  in  particular,  the  direction  of  P.  Thus  the  only  possible 
rotation  axis  is  one  parallel  to  P,  and  furthermore  there  cannot  be  mirror  planes 
perpendicular  to  that  axis.  This  excludes  all  point  groups  except  (Table  7.3)  C„  and 
C„v  (n  =  2,  3, 4,  6)  and  C,  and  Clh.  A  glance  at  Table  7.3  reveals  that  these  are  the 
only  point  groups  compatible  with  the  location  of  a  directed  object  (an  arrow,  for 
example)  at  each  site.29 


26  This  often  requires  a  cell  that  is  not  primitive  (see  Figure  27.9),  but  it  is  easily  verified  that  the 
earlier  analysis  in  this  chapter  is  in  no  way  affected  by  using  a  larger  microscopic  cell. 

27  The  name  (pyro  =  fire)  reflects  the  fact  that  under  ordinary  conditions  the  moment  of  a  pyroelectric 
crystal  will  be  masked  by  neutralizing  layers  of  ions  from  the  atmosphere  that  collect  on  the  faces  of  the 
crystal.  If,  however,  the  crystal  is  heated,  then  the  masking  will  no  longer  be  complete,  since  the  polarization 
will  change  due  to  thermal  expansion  of  the  crystal,  neutralizing  ions  will  be  evaporated,  etc.  Thus  the 
effect  was  first  thought  to  be  the  production  of  an  electric  moment  by  heat.  (Sometimes  the  term  "polar 
crystal”  is  used  instead  of  "pyroelectric  crystal.”  However,  “polar  crystal”  is  also  widely  used  as  a  synonym 
for  “ionic  crystal”  (whether  pyroelectric  or  not),  and  the  term  is  therefore  best  avoided.)  The  net  polarization 
can  also  be  masked  by  a  domain  structure,  as  in  ferromagnets  (see  Chapter  33). 

28  The  dipole  moment  of  the  surface  cells  need  not  be  p0,  but  in  the  limit  of  a  large  crystal  this  will 
have  a  negligible  effect  on  the  total  dipole  moment,  since  the  overwhelming  majority  of  ceils  will  be  in 
the  interior. 

25  Some  crystals,  though  nonpyroelectric  in  the  absence  of  external  stresses  can  develop  a  spontaneous 
dipole  moment  when  mechanically  sirained;  i.e.,  by  suitable  squeezing,  their  crystal  structures  can  be 
distorted  to  ones  that  can  sustain  a  dipole  moment.  Such  crystals  are  called  piezoelectric.  The  point  group 
of  a  piezoelectric  crystal  (when  unstrained)  cannot  contain  the  inversion. 
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FERROELECTRICITY 

The  most  stable  structure  of  some  crystals  is  nonpyroelectric  above  a  certain  tem¬ 
perature  Tc  (known  as  the  Curie  temperature)  and  pyroelectric  below  it.30  Such 
crystals  (examples  are  given  in  Table  27.4)  are  called  ferroelectrics.31  The  transition 
from  the  unpolarized  to  the  pyroelectric  state  is  called  first  order  if  it  is  discontinuous 
(i.e.,  if  P  acquires  a  nonzero  value  immediately  below  Tc)  and  second  or  higher  order, 
if  it  is  continuous  (i.e.,  if  P  grows  continuously  from  zero  as  T  drops  below  Tc).32 

Just  below  the  Curie  temperature  (for  a  continuous  ferroelectric  transition)  the 
distortion  of  the  primitive  cell  from  the  unpolarized  configuration  will  be  very  small, 
and  it  is  therefore  possible,  by  applying  an  electric  field  opposite  to  this  small  polariza¬ 
tion,  to  diminish  and  even  reverse  it.  As  T  drops  farther  below  Tc,  the  distortion  of 
the  cell  increases,  and  very  much  stronger  fields  are  required  to  reverse  the  direction 
of  P.  This  is  sometimes  taken  as  the  essential  attribute  of  ferroelectrics,  which  are 
then  defined  as  pyroelectric  crystals  whose  polarization  can  be  reversed  by  applying 
a  strong  electric  field.  This  is  done  to  include  those  crystals  one  feels  would  satisfy 
the  first  definition  (existence  of  a  Curie  temperature),  except  that  they  melt  before 
the  conjectured  Curie  temperature  can  be  reached.  Well  below  the  Curie  temperature, 
however,  the  reversal  of  polarization  may  require  so  drastic  a  restructuring  of  the 
crystal  as  to  be  impossible  even  in  the  strongest  attainable  fields. 

Immediately  below  the  Curie  temperature  of  a  continuous  ferroelectric  transition, 
the  crystal  spontaneously  and  continuously  distorts  to  a  polarized  state.  One  would 
therefore  expect  the  dielectric  constant  to  be  anomalously  large  in  the  neighborhood 
of  Tc,  reflecting  the  fact  that  it  requires  very  little  applied  field  to  alter  substantially 
the  displacement  polarization  of  the  crystal.  Dielectric  constants  as  large  as  105  have 
been  observed  near  ferroelectric  transition  points.  In  an  ideal  experiment  the  dielectric 
constant  should  actually  become  infinite  precisely  at  Tc.  For  a  continuous  transition 
this  simply  expresses  the  fact  that  as  Tc  is  approached  from  above,  the  net  restoring 
force  opposing  a  lattice  distortion  from  the  unpolarized  to  the  polarized  phase 
vanishes. 

If  the  restoring  force  opposing  a  particular  lattice  distortion  vanishes,  there  should 
be  a  zero-frequency  normal  mode  whose  polarization  vectors  describe  precisely  this 
distortion.  Since  the  distortion  leads  to  a  net  dipole  moment  and  therefore  involves 
a  relative  displacement  between  ions  of  opposite  charge,  the  mode  will  be  an  optical 
mode.  In  the  vicinity  of  the  transition,  relative  displacements  will  be  large,  anharmonic 
terms  will  be  substantial,  and  this  “soft”  mode  should  be  rather  strongly  damped. 

These  two  observations  (infinite  static  dielectric  constant  and  a  zero-frequency 
optical  mode)  are  not  independent.  One  implies  the  other  by  the  Lyddane-Sachs- 
Teller  relation  (27.67),  which  requires  the  transverse  optical-mode  frequency  to  vanish 
whenever  the  static  dielectric  constant  is  infinite. 


30  Transitions  back  and  forth  are  also  known:  e.g.,  there  can  be  a  range  of  temperatures  for  the 
pyroelectric  phase,  above  and  below  which  the  crystal  is  unpolarized. 

31  The  name  stresses  the  analogy  with  ferromagnetic  materials,  which  have  a  net  magnetic  moment. 
It  is  not  meant  to  suggest  that  iron  has  any  special  relation  to  the  phenomenon. 

32  Sometimes  the  term  “ferroelectric”  is  reserved  for  crystals  in  which  the  transition  is  second  order. 
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Perhaps  the  simplest  type  of  ferroelectric  crystal  (and  the  one  most  widely  studied) 
is  the  perovskite  structure,  shown  in  Figure  27.10.  Other  ferroelectrics  tend  to  be 
substantially  more  complex.  Some  characteristic  examples  are  given  in  Table  27.4. 

Figure  27.10 

The  perovskite  structure,  characteristic  of  the  barium 
titanate  (BaTi03)  class  of  ferroelectrics  in  the  unpolarized 
phase.  The  crystal  is  cubic,  with  Ba++  ions  at  the  cube 
comers,  O--  ions  at  the  centers  of  the  cube  faces,  and 
Ti4+  ions  at  the  cube  centers.  The  first  transition  is  to  a 
tetragonal  structure,  the  positive  ions  being  displaced 
relative  to  the  negative  ones,  along  a  [100]  direction.  The 
perovskite  structure  is  an  example  of  a  cubic  crystal  in 
which  every  ion  is  not  at  a  point  of  full  cubic  symmetry. 

(The  Ba+  +  and  Ti4+  are,  but  the  CT  ions  are  not.)  There¬ 
fore  the  local  field  acting  on  the  oxygen  ions  is  more  com¬ 
plicated  than  that  given  by  the  simple  Lorentz  formula. 

This  is  important  in  understanding  the  mechanism  for  the 
ferroelectricity. 

Table  27.4 

SELECTED  FERROELECTRIC  CRYSTALS 


Tc  P  at  T 

NAME  FORMULA  (K)  (pC/cm2)  (K) 

4.75  96 

4.83  180 

5.6  90 

26.0  300 

>50  300 

30.0  523 

0.25  278 

0.35  279 


“  Has  upper  and  lower  Tc 

Source:  F.  Jona  and  G.  Shirane,  Ferroelectric  Crystals,  Pergamon,  New  York,  1962,  p.  389. 


Potassium  dihydrogen  phosphate  KH2P04 

Potassium  dideuterium  phosphate  KD2P04 

Rubidium  dihydrogen  phosphate  RbH2P04 

Rubidium  dideuterium  phosphate  RbD2P04 

Barium  titanate  BaTi03 

Lead  titanate  PbTi03 

Cadmium  titanate  CdTi03 

Potassium  niobate  KNb03 


Rochelle  salt 


NaKC4H4Oe  •  4DzO 


Deuterated  Rochelle  salt  NaKC4H,D70,  •  4D,0 


123 

213 

147 

218 

393 

763 

55 

708 

ST 

sr 


PROBLEMS 


1.  Electric  Field  of  a  Neutral  Uniformly  Polarized  Sphere  of  Radius  a 

Far  from  the  sphere,  the  potential  <f>  will  be  that  of  a  point  dipole  of  moment  p  =  4nPcd/3 : 

,  P  cos  0 


(27.80) 
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(where  the  polar  axis  is  along  P).  Using  the  fact  that  the  general  solution  to  V2<p  =  0  proportional 
to  cos  6  is 

A  cos  6 

— p —  +  Br  cos  0,  (27.81) 

use  the  boundary  conditions  at  the  surface  of  the  sphere  to  show  that  the  potential  inside  the 
sphere  leads  to  a  uniform  field  E  =  —  47rP/3. 

2.  Electric  Field  of  an  Array  of  Identical  Dipoles  with  Identical  Orientations, 
at  a  Point  with  Respect  to  Which  the  Array  Has  Cubic  Symmetry 

The  potential  at  r  due  to  the  dipole  at  r'  is 

4>  =  -  P  ‘  V  r  1  (27.82) 

lr  -  r  | 

By  applying  the  restrictions  of  cubic  symmetry  to  the  tensor 

I^vpr^TJ,  (27.83) 

and  noting  that  V2(l/r)  =  0,  r  /  0,  show  that  E(r)  must  vanish,  when  the  positions  r'  of  the 
dipoles  have  cubic  symmetry  about  r. 

3.  Polarizability  of  a  Single  Hydrogen  Atom 

Suppose  an  electric  field  E  is  applied  (along  the  x-axis)  to  a  hydrogen  atom  in  its  ground  state 
with  wave  function 

oc  e~rl°°.  (27.84) 

(a)  Assume  a  trial  function  for  the  atom  in  the  field  of  the  form 

t  oc  ^„(1  +  yx)  =  ijj0  +  6 ijt, 

and  determine  y  by  minimizing  the  total  energy. 

(b)  Calculate  the  polarization 

P  =  J*  (-e)  *  (*0  <¥*  +  K  <¥)> 

using  the  best  trial  function,  and  show  that  this  leads  to  a  polarizability  a 
answer  is  4.5n03.) 

4.  Orientational  Polarization 

The  following  situation  sometimes  arises  in  pure  solids  and  liquids  whose  molecules  have  per¬ 
manent  dipole  moments  (such  as  water  or  ammonia)  and  also  in  solids  such  as  ionic  crystals  with 
some  ions  replaced  by  others  with  permanent  moments  (such  as  OH"  in  KC1). 

(a)  An  electric  field  tends  to  align  such  molecules;  thermal  disorder  favors  misalignment. 
Using  equilibrium  statistical  mechanics,  write  down  the  probability  that  the  dipole  makes  an 
angle  in  the  range  from  6  to  6  +  d6  with  the  applied  field.  If  there  are  N  such  dipoles  of  moment 
p,  show  that  their  total  dipole  moment  in  thermal  equilibrium  is 

Arp<cos  ()>  =  NpL 


(27.85) 


(27.86) 

=  4n03.  (The  exact 


(27.87) 
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where  L(x),  the  “Langevin  function,”  is  given  by 


L(x)  =  coth  x  — 


(27.88) 


(b)  Typical  dipole  moments  are  of  order  1  Debye  unit  (10  18  in  esu).  Show  that  for  an  electric 
field  of  order  104  volts/cm  the  polarizability  at  room  temperature  can  be  written  as 


3  k„T 


(27.89) 


5.  Generalized  Lyddane-Sachs-Teller  Relation 

Suppose  that  the  dielectric  constant  e(c-j)  does  not  have  a  single  pole  as  a  function  of  ci2  (as  in 
(27.57))  but  has  the  more  general  structure: 


e(co)  =  A  +  £ 


B: 


i=l  -  «i2' 


(27.90) 


Show  directly  from  (27.90)  that  the  Lyddane-Sachs-Teller  relation  (27.67)  is  generalized  to 

—  =  n  (— Y>  (27-91) 

e=c  \<oJ 

where  the  to°  are  the  frequencies  at  which  e  vanishes.  (Him:  Write  the  condition  e  =  0  as  an  nth- 
degree  polynomial  in  co2,  and  note  that  the  product  of  the  roots  is  simply  related  to  the  value  of  the 
polynomial  at  co  =  0.)  What  is  the  significance  of  the  frequencies  co,-  and  o>?? 
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In  Chapter  12  we  observed  that  electrons  in  a  completely  filled  band  can  carry  no 
current.  Within  the  independent  electron  model  this  result  is  the  basis  for  the  dis¬ 
tinction  between  insulators  and  metals:  In  the  ground  state  of  an  insulator  all  bands 
are  either  completely  filled  or  completely  empty;  in  the  ground  state  of  a  metal  at 
least  one  band  is  partially  filled. 

We  can  characterize  insulators  by  the  energy  gap,  Eg,  between  the  top  of  the  highest 
filled  band(s)  and  the  bottom  of  the  lowest  empty  band(s)  (see  Figure  28.1).  A  solid 
with  an  energy  gap  will  be  nonconducting  at  T  —  0  (unless  the  DC  electric  field  is 
so  strong  and  the  energy  gap  so  minute  that  electric  breakdown  can  occur  (Eq.  (12.8)) 
or  unless  the  AC  field  is  of  such  high  frequency  that  hco  exceeds  the  energy  gap). 


Figure  28.1 

(a)  In  an  insulator  there  is  a 
region  of  forbidden  energies 
separating  the  highest  occu¬ 
pied  and  lowest  unoccupied 
levels,  (b)  In  a  metal  the 
boundary  occurs  in  a  region 
of  allowed  levels.  This  is 
indicated  schematically  by 
plotting  the  density  of  levels 
(horizontally)  vs.  energy  (ver¬ 
tically). 


However,  when  the  temperature  is  not  zero  there  is  a  nonvanishing  probability 
that  some  electrons  will  be  thermally  excited  across  the  energy  gap  into  the  lowest 
unoccupied  bands,  which  are  called,  in  this  context,  the  conduction  bands,  leaving 
behind  unoccupied  levels  in  the  highest  occupied  bands,  called  valence  bands.  The 
thermally  excited  electrons  are  capable  of  conducting,  and  hole-type  conduction  can 
occur  in  the  band  out  of  which  they  have  been  excited. 

Whether  such  thermal  excitation  leads  to  appreciable  conductivity  depends  criti¬ 
cally  on  the  size  of  the  energy  gap,  for  the  fraction  of  electrons  excited  across  the  gap 
at  temperature  T  is,  as  we  shall  see,  roughly  of  order  e~Esl2kBT.  With  an  energy  gap 
of  4  eV  at  room  temperature  {kBT  x  0.025  eV)  this  factor  is  e-80  ~  10-35,  and 
essentially  no  electrons  are  excited  across  the  gap.  If,  however,  Eg  is  0.25  eV,  then  the 
factor  at  room  temperature  is  e-5  «  10“ 2,  and  observable  conduction  will  ocar. 

Solids  that  are  insulators  at  T  =  0,  but  whose  energy  gaps  are  of  such  a  size  that 
thermal  excitation  can  lead  to  observable  conductivity  at  temperatures  below  the 
melting  point,  are  known  as  semiconductors.  Evidently  the  distinction  between  a 
semiconductor  and  an  insulator  is  not  a  sharp  one,  but  roughly  speaking  the  energy 
gap  in  most  important  semiconductors  is  less  than  2  eV  and  frequently  as  low  as  a 
few  tenths  of  an  electron  volt.  Typical  room  temperature  resistivities  of  semicon¬ 
ductors  are  between  10-3  and  109  ohm-cm  (in  contrast  to  metals,  where  p  ss  10  6 
ohm-cm,  and  good  insulators,  where  p  can  be  as  large  as  1022  ohm-cm). 

Since  the  number  of  electrons  excited  thermally  into  the  conduction  band  (and 
therefore  the  number  of  holes  they  leave  behind  in  the  valence  band)  varies  exponen¬ 
tially  with  1/7",  the  electrical  conductivity  of  a  semiconductor  should  be  a  very  rapidly 
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increasing  function  of  temperature.  This  is  in  striking  contrast  to  the  case  of  metals. 
The  conductivity  of  a  metal  (Eq.  (1.6)), 

<r  =  — ,  (28.1) 

m 

declines  with  increasing  temperature,  for  the  density  of  carriers  n  is  independent  of 
temperature,  and  all  temperature  dependence  comes  from  the  relaxation  time  t, 
which  generally  decreases  with  increasing  temperature  because  of  the  increase  in 
electron-phonon  scattering.  The  relaxation  time  in  a  semiconductor  will  also  decrease 
with  increasing  temperature,  but  this  effect  (typically  described  by  a  power  law)  is 
quite  overwhelmed  by  the  very  much  more  rapid  increase  in  the  density  of  carriers 
with  increasing  temperature.1 

Thus  the  most  striking  feature  of  semiconductors  is  that,  unlike  metals,  their 
electrical  resistance  declines  with  rising  temperature;  i.e.,  they  have  a  negative  coeffi¬ 
cient  of  resistance.”  It  was  this  property  that  first  brought  them  to  the  attention  of 
physicists  in  the  early  nineteenth  century.2  By  the  end  of  the  nineteenth  century  a 
considerable  body  of  semiconducting  lore  had  been  amassed;  it  was  observed  that 
the  thermopowers  of  semiconductors  were  anomalously  large  compared  with  those 
of  metals  (by  a  factor  of  100  or  so),  that  semiconductors  exhibited  the  phenomenon 
of  photoconductivity,  and  that  rectifying  effects  could  be  obtained  at  the  junction  of 
two  unlike  semiconductors.  Early  in  the  twentieth  century,  measurements  of  the  Hall 
effect3  were  made  confirming  the  fact  that  the  temperature  dependence  of  the  con¬ 
ductivity  was  dominated  by  that  of  the  number  of  carriers,  and  indicating  that  in 
many  substances  the  sign  of  the  dominant  carrier  was  positive  rather  than  negative. 

Phenomena  such  as  these  were  a  source  of  considerable  mystery  until  the  full 
development  of  band  theory  many  years  later.  Within  the  band  theory  they  find 
simple  explanations.  For  example,  photoconductivity  (the  increase  in  conductivity 
produced  by  shining  light  on  a  material)  is  a  consequence  of  the  fact  that  if  the  band 


1  Thus  the  conductivity  of  a  semiconductor  is  not  a  good  measure  of  the  collision  rate,  as  it  is  in  a 
metal.  It  is  often  advantageous  to  separate  from  the  conductivity  a  term  whose  temperature  dependence 
reflects  only  that  of  the  collision  rate.  This  is  done  by  defining  the  mobility,  ft,  of  a  carrier,  as  being  the 
ratio  of  the  drift  velocity  it  achieves  in  a  field  E,  to  the  field  strength :  vd  —  pE.li  the  carriers  have  density 
n  and  charge  q,  the  current  density  will  be  j  —  nqvj,  and  therefore  the  conductivity  is  related  to  the  mobility 
by  O  =  nqp.  The  concept  of  mobility  has  littte  independent  use  in  discussions  of  metals,  since  it  is  related 
to  the  conductivity  by  a  temperature-independent  constant.  However,  it  plays  an  important  role  in 
descriptions  of  semiconductors  (and  any  other  conductors  where  the  carrier  density  can  vary,  such  as  ionic 
solutions),  enabling  one  to  disentangle  two  distinct  sources  of  temperature  dependence  in  the  conductivity. 
The  usefulness  of  the  mobility  will  be  illustrated  in  our  discussion  of  inhomogeneous  semiconductors  in 
Chapter  29. 

2  M.  Faraday,  Experimental  Researches  on  Electricity,  1 839,  F acsimile  Reprint  by  Taylor  and  Francis, 
London.  R.  A.  Smith,  Semiconductors ,  Cambridge  University  Press,  1964,  provides  one  of  the  most  pleasant 
introductions  to  the  subject  available.  Most  of  the  information  in  our  brief  historical  survey  is  drawn 
from  it. 

3  One  might  expect  that  the  number  of  excited  electrons  would  equal  the  number  of  holes  left  behind, 
so  that  the  Hall  effect  would  yield  little  direct  information  on  the  number  of  carriers.  However,  as  we 
shall  see,  the  number  of  electrons  need  not  equal  the  number  of  holes  in  an  impure  semiconductor,  and 
these  were  the  only  ones  available  at  the  time  of  the  early  experiments. 


564  Chapter  28  Homogeneous  Semiconductors 


gap  is  small,  then  visible  light  can  excite  electrons  across  the  gap  into  the  conduction 
band,  resulting  in  conduction  by  those  electrons  and  by  the  holes  left  behind.  The 
thermopovver,  to  take  another  example,  is  roughly  a  hundred  times  larger  in  a  semi¬ 
conductor  than  in  a  metal.  This  is  because  the  density  of  carriers  is  so  low  in  a 
semiconductor  that  they  are  properly  described  by  Maxwell-Boltzmann  statistics  (as 
we  shall  see  below).  Thus  the  factor  of  100  is  the  same  factor  by  which  the  early 
theories  of  metals  (prior  to  Sommerfeld’s  introduction  of  Fermi-Dirac  statistics) 
overestimated  the  thermopovver  (page  25). 

The  band  theoretic  explanations  of  these  and  other  characteristic  semiconducting 
properties  will  be  the  subject  of  this  chapter  and  the  next. 

Compilation  of  reliable  information  on  semiconductors  in  the  early  days  was 
substantially  impeded  by  the  fact  that  data  are  enormously  sensitive  to  the  purity 
of  the  sample.  An  example  of  this  is  shown  in  Figure  28.2,  where  the  resistivity  of 
germanium  is  plotted  vs.  T  for  a  variety  of  impurity  concentrations.  Note  that  con¬ 
centrations  as  low  as  parts  in  108  can  lead  to  observable  effects,  and  that  the  resistivity 
can  vary  at  a  given  temperature  by  a  factor  of  10' z,  as  the  impurity  concentration 
changes  by  only  a  factor  of  103.  Note  also  that,  for  a  given  impurity  concentration, 
the  resistivity  eventually  falls  onto  a  common  curve  as  the  temperature  increases. 
This  latter  resistivity  which  is  evidently  the  resistivity  of  an  ideal  perfectly  pure 
sample,  is  known  as  the  intrinsic  resistivity,  while  the  data  for  the  various  samples, 
except  at  temperatures  so  high  that  they  agree  with  the  intrinsic  curve,  are  referred 
to  as  extrinsic  properties.  Quite  generally,  a  semiconductor  is  intrinsic  if  its  electronic 
properties  are  dominated  by  electrons  thermally  excited  from  the  valence  to  the  con¬ 
duction  band,  and  extrinsic  if  its  electronic  properties  are  dominated  by  electrons 
contributed  to  the  conduction  band  by  impurities  (or  captured  from  the  valence  band 
by  impurities)  in  a  manner  to  be  described  below.  We  shall  return  shortly  to  the 
question  of  why  semiconducting  properties  are  so  very  sensitive  to  the  purity  of  the 
specimen. 

EXAMPLES  OF  SEMICONDUCTORS 

Semiconducting  crystals  come  primarily  from  the  covalent  class  of  insulators.4  The 
simple  semiconducting  elements  are  from  column  IV  of  the  periodic  table,  silicon 
and  germanium  being  the  two  most  important  elemental  semiconductors.  Carbon, 
in  the  form  of  diamond,  is  more  properly  classified  as  an  insulator,  since  its  energy 
gap  is  of  order  5.5  eV.  Tin,  in  the  allotropic  form  of  grey  tin,  is  semiconducting,  with 
a  very  small  energy  gap.  (Lead,  of  course,  is  metallic.)  The  other  semiconducting 
elements,  red  phosphorus,  boron,  selenium,  and  tellurium,  tend  to  have  highly  com¬ 
plex  crystal  structures,  characterized,  however,  by  covalent  bonding. 

In  addition  to  the  semiconducting  elements  there  is  a  variety  of  semiconducting 
compounds.  One  broad  class,  the  III  V  semiconductors,  consists  of  crystals  of  the 
zincblende  structure  (page  81)  composed  of  elements  from  columns  III  and  V  of  the 


4  Among  the  various  categories  of  insulating  crystals,  the  covalent  crystals  have  a  spatial  distribution 
of  electronic  charge  most  similar  to  metals.  (See  Chapter  19.) 
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Figure  28.2 

The  resistivity  of  antimony-doped  germanium  as  a  function  of  1/7"  for  several  impurity  con¬ 
centrations.  (From  H.  J.  Fritzsche,  J.  Phys.  Chem.  Solids  6, 69  (1958).) 


periodic  table.  As  described  in  Chapter  19,  the  bonding  in  such  compounds  is  also 
predominantly  covalent.  Semiconducting  crystals  made  up  of  elements  from  columns 
If  and  VI  begin  to  have  a  strong  ionic  as  well  as  a  covalent  character.  These  are 
known  as  polar  semiconductors,  and  can  have  either  the  zincblende  structure  or,  as 
in  the  case  of  lead  selenide,  telluride,  or  sulfide,  the  sodium  chloride  structure  more 
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characteristic  of  ionic  bonding.  There  are  also  many  far  more  complicated  semi¬ 
conducting  compounds. 

Some  examples  of  the  more  important  semiconductors  are  given  in  Table  28.1. 
The  energy  gaps  quoted  for  each  are  reliable  to  within  about  5  percent.  Note  that 
the  energy  gaps  are  all  temperature-dependent,  varying  by  about  10  percent  between 
0  K  and  room  temperature.  There  are  two  main  sources  of  this  temperature  de¬ 
pendence.  Because  of  thermal  expansion  the  periodic  potential  experienced  by  the 
electrons  (and  hence  the  band  structure  and  the  energy  gap)  can  vary  with  temperature. 
In  addition,  the  effect  of  lattice  vibrations  on  the  band  structure  and  energy  gap5  will 
also  vary  with  temperature,  reflecting  the  temperature  dependence  of  the  phonon 
distribution.  In  general  these  two  effects  are  of  comparable  importance,  and  lead  to 
an  energy  gap  that  is  linear  in  T  at  room  temperature  and  quadratic  at  very  low 
temperatures  (Figure  28.3). 

Table  28.1 

ENERGY  GAPS  OF  SELECTED  SEMICONDUCTORS 


MATERIAL 

Eg 

(T  =  300  K) 

(T  =  0  K) 

E0  (linear 

EXTRAPOLATION 
TO  T  =  0) 

LINEAR 

DOWN  TO 

Si 

1.12  eV 

1.17 

1.2 

200  K 

Ge 

0.67 

0.75 

0.78 

150 

PbS 

0.37 

0.29 

0.25 

PbSe 

0.26 

0.17 

0.14 

20 

PbTe 

0.29 

0.19 

0.17 

InSb 

0.16 

0.23 

0.25 

100 

GaSb 

0.69 

0.79 

0.80 

75 

AlSb 

1.5 

1.6 

1.7 

80 

InAs 

0.35 

0.43 

0.44 

80 

InP 

1.3 

1.4 

80 

GaAs 

1.4 

1.5 

GaP 

2.2 

2.4 

Grey  Sn 

0.1 

Grey  Se 

1.8 

Te 

0.35 

B 

1.5 

C  (diamond) 

5.5 

Sources:  C.  A.  Hogarth,  ed.,  Materials  Used  in  Semiconductor  Devices ,  Interscience,  New  York, 

1965;  O.  Madelung,  Physics  of  III  -V  Compounds ,  Wiley,  New  York,  1964;  R. 

A.  Smith,  Semi- 

conductors ,  Cambridge  University  Press,  1964. 

The  energy  gap  can  be  measured  in  several  ways.  The  optical  properties  of  the 
crystal  are  one  of  the  most  important  sources  of  information.  When  the  frequency 
of  an  incident  photon  becomes  large  enough  for  hco  to  exceed  the  energy  gap,  then, 
just  as  in  metals  (see  pages  293, 294)  there  will  be  an  abrupt  increase  in  the  absorption 


5  Via,  for  example,  the  kinds  of  effects  described  in  Chapter  26. 
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Figure  28.3 

Typical  temperature  depen¬ 
dence  of  the  energy  gap  of  a 
semiconductor.  Values  of  £n, 
EJO),  and  E9(300  K)  for 
several  materials  are  listed 
in  Table  28.1. 


of  incident  radiation.  If  the  conduction  band  minimum  occurs  at  the  same  point  in 
/c-space  as  the  valence  band  maximum,  then  the  energy  gap  can  be  directly  determined 
from  the  optical  threshold.  If,  as  is  often  the  case,  the  minima  and  maxima  occur  at 
different  points  in  /c-space,  then  for  crystal  momentum  to  be  conserved  a  phonon 
must  also  participate  in  the  process,6  which  is  then  known  as  an  “indirect  transition” 
(Figure  28.4).  Since  the  phonon  will  supply  not  only  the  missing  crystal  momentum 


e 


Conduction  band 


fi 


(b) 


Figure  28.4 

Photon  absorption  via  (a)  direct  and  (b)  indirect  transitions.  In  (a)  the  optical  threshold  is  at 
co  =  EJh ;  in  (b)  it  occurs  at  EJh  -  m(q),  since  the  phonon  of  wave  vector  q  that  must  be  absorbed 
to  supply  the  missing  crystal  momentum  also  supplies  an  energy  /i<o(q). 


6  At  optical  frequencies  the  crystal  momentum  supplied  by  the  photon  itself  is  negligibly  small. 
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fck,  but  also  an  energy  ftco(k),  the  photon  energy  at  the  optical  threshold  will  be  less 
than  Eg  by  an  amount  of  order  hoD.  This  is  typically  a  few  hundredths  of  an  electron 
volt,  and  therefore  of  little  consequence  except  in  semiconductors  with  very  small 
energy  gaps.7 

The  energy  gap  may  also  be  deduced  from  the  temperature  dependence  of  the 
intrinsic  conductivity,  which  is  predominantly  a  reflection  of  the  very  strong  tem¬ 
perature  dependence  of  the  carrier  densities.  These  vary  (as  we  shall  see  below) 
essentially  as  e  Egl2kn'l\  so  that  if  —In  (a)  is  plotted  against  \/2kBT,  the  slope8  should 
be  very  nearly  the  energy  gap,  Eg. 


TYPICAL  SEMICONDUCTOR  BAND  STRUCTURES 


The  electronic  properties  of  semiconductors  are  completely  determined  by  the  com¬ 
paratively  small  numbers  of  electrons  excited  into  the  conduction  band  and  holes  left 
behind  in  the  valence  band.  The  electrons  will  be  found  almost  exclusively  in  levels 
near  the  conduction  band  minima,  while  the  holes  will  be  confined  to  the  neighbor¬ 
hood  of  the  valence  band  maxima.  Therefore  the  energy  vs.  wave  vector  relations  for 
the  carriers  can  generally  be  approximated  by  the  quadratic  forms  they  assume  in 


the  neighborhood  of  such  extrema:9 

£(k)  =  £c  +  ylMM  XX 

(electrons), 

h2 

S(k)  —  £t  ~  X  i)llvkx/ 

^  n\> 

(holes). 

(28.2) 

Here  £c  is  the  energy  at  the  bottom  of  the  conduction  band,  £„  is  the  energy  at  the  top 
of  the  valence  band,  and  we  have  taken  the  origin  of  k- space  to  lie  at  the  band  maxi¬ 
mum  or  minimum.  If  there  is  more  than  one  maximum  or  minimum,  there  will  be 
one  such  term  for  each  point.  Since  the  tensor  M" 1  is  real  and  symmetric,  one  can 
find  a  set  of  orthogonal  principal  axes  for  each  such  point,  in  terms  of  which  the 
energies  have  the  diagonal  forms 


Hk)  =  £c 
£(k)  =  £„ 


(electrons), 

(holes). 


(28.3) 


7  To  extract  a  really  accurate  band  gap  from  the  optical  absorption  data,  however,  it  is  necessary 
to  determine  the  phonon  spectrum  and  use  it  to  analyze  the  indirect  transitions. 

8  In  deducing  the  energy  gap  in  this  way,  however,  one  must  remember  that  at  room  temperature 
the  gaps  of  most  semiconductors  have  a  linear  variation  with  temperature.  If  Eg  =  E0  —  AT,  then  the  slope 
of  the  graph  will  be  not  Eg  but  £0,  the  linear  extrapolation  of  the  room  temperature  gap  to  zero  temperature 
(Figure  28.3).  Values  of  E0  extracted  from  this  linear  extrapolation  procedure  are  also  given  in  Table  28.1. 

9  The  inverse  of  the  matrix  of  coefficients  in  (28.2)  is  called  M  because  it  is  a  special  case  of  the  general 
effective  mass  tensor  introduced  on  page  228.  The  electron  mass  tensor  will  not,  of  course,  be  the  same 
as  the  hole  mass  tensor,  but  to  avoid  a  multiplicity  of  subscripts  we  use  the  single  generic  symbol  M  for 
both. 
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Thus  the  constant  energy  surfaces  about  the  extrema  are  ellipsoidal  in  shape,  and 
are  generally  specified  by  giving  the  principal  axes  of  the  ellipsoids,  the  three  “effective 
masses,”  and  the  location  in  fc-space  of  the  ellipsoids.  Some  important  examples  are: 

Silicon  The  crystal  has  the  diamond  structure,  so  the  first  Brillouin  zone  is  the 
truncated  octahedron  appropriate  to  a  face-centered  cubic  Bravais  lattice.  The  con¬ 
duction  band  has  six  symmetry-related  minima  at  points  in  the  <100>  directions, 
about  80  percent  of  the  way  to  the  zone  boundary  (Figure  28.5).  By  symmetry  each 


Figure  28.5 

Constant-energy  surfaces  near  the  conduction  band  minima  in 
silicon.  There  are  six  symmetry-related  ellipsoidal  pockets.  The 
long  axes  are  directed  along  <100)  directions. 


of  the  six  ellipsoids  must  be  an  ellipsoid  of  revolution  about  a  cube  axis.  They  are 
quite  cigar-shaped,  being  elongated  along  the  cube  axis.  In  terms  of  the  free  electron 
mass  m,  the  effective  mass  along  the  axis  (the  longitudinal  effective  mass)  is  mL  ss 
l-0m  while  the  effective  masses  perpendicular  to  the  axis  (the  transverse  effective  mass) 
are  mT  ss  0.2m.  There  are  two  degenerate  valence  band  maxima,  both  located  at 
k  =  0,  which  are  spherically  symmetric  to  the  extent  that  the  ellipsoidal  expansion 
is  valid,  with  masses  of  0.49m  and  0.16m  (Figure  28.6). 


Figure  28.6 

Energy  bands  in  silicon.  Note  the  conduction  band  minimum  along 
[100]  that  gives  rise  to  the  ellipsoids  of  Figure  28.5.  The  valence 
band  maximum  occurs  at  k  =  0,  where  two  degenerate  bands  with 
different  curvatures  meet,  giving  rise  to  “light  holes”  and  “heavy 
holes.”  Note  also,  the  third  band,  only  0.044  eV  below  the  valence 
band  maximum.  This  band  is  separated  from  the  other  two  only 
by  spin-orbit  coupling.  At  temperatures  on  the  order  of  room 
temperature  ( kBT  —  0.025  eV)  it  too  may  be  a  significant  source  of 
carriers.  (From  C.  A.  Hogarth,  ed.  Materials  Used  in  Semiconductor 
Devices,  Interscience,  New  York,  1965.) 


Germanium  The  crystal  structure  and  Brillouin  zone  are  as  in  silicon.  However,  the 
conduction  band  minima  now  occur  at  the  zone  boundaries  in  the  <111)  directions. 
Minima  on  parallel  hexagonal  faces  of  the  zone  represent  the  same  physical  levels, 
so  there  are  four  symmetry-related  conduction  band  minima.  The  ellipsoidal  constant 
energy  surfaces  are  ellipsoids  of  revolution  elongated  along  the  <111)  directions, 
with  effective  masses  mL  x  1.6m,  and  mT  x  0.08m  (Figure  28.7).  There  are  again  two 
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Figure  28.7 

Constant-energy  surfaces  near  the  conduction  band  minima  in  germa¬ 
nium.  There  are  eight  symmetry-related  half  ellipsoids  with  long  axes 
along  <1 1 1  >  directions  centered  on  the  midpoints  of  the  hexagonal 
zone  faces.  With  a  suitable  choice  of  primitive  cell  in  fc-space  these  can 
be  represented  as  four  ellipsoids,  the  half  ellipsoids  on  opposite  faces 
being  joined  together  by  translations  through  suitable  reciprocal 
lattice  vectors. 


degenerate  valence  bands,  both  with  maxima  at  k  =  0,  which  are  spherically  sym¬ 
metric  in  the  quadratic  approximation  with  effective  masses  of  0.28m  and  0.044 in 
(Figure  28.8). 


Figure  28.8 

Energy  bands  in  germanium.  Note  the  conduction  band 
minimum  along  [111]  at  the  zone  boundary  that  gives  rise  to 
the  four  ellipsoidal  pockets  of  Figure  28.7.  The  valence  band 
maximum,  as  in  silicon,  is  at  k  =  0,  where  two  degenerate 
bands  with  different  curvatures  meet,  giving  rise  to  two  pockets 
of  holes  with  distinct  effective  masses.  (From  C.  A.  Hogarth, 
ed..  Materials  Used  in  Semiconductor  Devices,  Interscience, 
New  York,  1965.) 


Indium  antimonide  This  compound,  which  has  the  zincblende  structure,  is  interesting 
because  all  valence  band  maxima  and  conduction  band  minima  are  at  k  =  0.  The  en¬ 
ergy  surfaces  are  therefore  spherical.  The  conduction  band  effective  mass  is  very  small, 
m*  %  0.015m.  Information  on  the  valence  band  masses  is  less  unambiguous,  but  there 
appear  to  be  two  spherical  pockets  about  k  =  0,  one  with  an  effective  mass  of  about 
0.2 m  (heavy  holes)  and  another  with  effective  mass  of  about  0.015m  (light  holes). 


CYCLOTRON  RESONANCE 

The  effective  masses  discussed  above  are  measured  by  the  technique  of  cyclotron 
resonance.  Consider  an  electron  close  enough  to  the  bottom  of  the  conduction  band 
(or  top  of  the  valence  band)  for  the  quadratic  expansion  (28.2)  to  be  valid.  In  the 
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) 

t 


presence  of  a  magnetic  field  H  the  semiclassical  equations  of  motion  (12.32)  and 
(12.33)  imply  that  the  velocity  v(k)  obeys  the  single  set  of  equations 


M  —  =  +-  vxH. 
dt  c 

(28.4) 

In  a  constant  uniform  field  (taken  along  thez-axis)it  is  not  difficult  to  show  (Problem  1) 

that  (28.4)  has  an  oscillatory  solution 

v  =  Re  v0e_,“‘. 

(28.5) 

provided  that 

eH 

co  -  ““ 5  > 
m*c 

(28.6) 

where  m*,  the  “cyclotron  effective  mass,”  is  given  by 

*  / det  M  Y/2 

m  =UJ 

(28.7) 

This  result  can  also  be  written  in  terms  of  the  eigenvalues  and  principal 
mass  tensor  as  (Problem  1): 

axes  of  the 

/  m1m2m3 

m  ~  V  Hi2m!  +  H22m2  +  ft32nh' 

(28.8) 

where  the  Rt  are  the  components  along  the  three  principal  axes  of  a  unit  vector 
parallel  to  the  field.  4 

Note  that  the  cyclotron  frequency  depends,  for  a  given  ellipsoid,  on  the  orientation 
of  the  magnetic  field  with  respect  to  that  ellipsoid,  but  not  on  the  initial  wave  vector 
or  energy  of  the  electron.  Thus  for  a  given  orientation  of  the  crystal  with  respect  to 
the  field,  all  electrons  in  a  given  ellipsoidal  pocket  of  conduction  band  electrons 
(and,  by  the  same  token,  all  holes  in  a  given  ellipsoidal  pocket  of  valence  band  holes) 
precess  at  a  frequency  entirely  determined  by  the  effective  mass  tensor  describing 
that  pocket.  There  will  therefore  be  a  small  number  of  distinct  cyclotron  frequencies. 
By  noting  how  these  resonant  frequencies  shift  as  the  orientation  of  the  magnetic 
field  is  varied,  one  can  extract  from  (28.8)  the  kind  of  information  we  quoted  above. 

To  observe  cyclotron  resonance  it  is  essential  that  the  cyclotron  frequency  (28.6) 
be  larger  than  or  comparable  to  the  collision  frequency.  As  in  the  case  of  metals, 
this  generally  requires  working  with  very  pure  samples  at  very  low  temperatures, 
to  reduce  both  impurity  scattering  and  phonon  scattering  to  a  minimum.  Under 
such  conditions  the  electrical  conductivity  of  a  semiconductor  will  be  so  small  that 
(in  contrast  to  the  case  of  a  metal  (page  278))  the  driving  electromagnetic  field  can 
penetrate  far  enough  into  the  sample  to  excite  the  resonance  without  any  difficulties 
associated  with  a  skin  depth.  On  the  other  hand,  under  such  conditions  of  low 
temperatures  and  purity  the  number  of  carriers  available  in  thermal  equilibrium 
to  participate  in  the  resonance  may  well  be  so  small  that  carriers  will  have  to  be 
created  by  other  means — such  as  photoexcitation.  Some  typical  cyclotron  resonance 
data  are  shown  in  Figure  28.9. 


Absorption  (arbitrary  units) 
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Magnetic  field  (gauss) 
(a) 


(b) 


Figure  28.9 

Typical  cyclotron  resonance  signals  in  (a)  germanium  and  (b)  silicon.  The  field  lies  in  a  (110) 
plane  and  makes  an  angle  with  the  [001]  axis  of  60"  (Ge)  and  30c  (Si).  (From  G.  Dresselhaus  et  al., 
Phys.  Rev.  98,  368  (1955).) 


NUMBER  OF  CARRIERS  IN  THERMAL  EQUILIBRIUM 


The  most  important  property  of  any  semiconductor  at  temperature  Tis  the  number 
of  electrons  per  unit  volume  in  the  conduction  band,  nc,  and  the  number  of  holes10 
per  unit  volume  in  the  valence  band,  pr.  The  determination  of  these  as  a  function 
of  temperature  is  a  straightforward,  though  sometimes  algebraically  complicated, 
exercise  in  the  application  of  Fermi-Dirac  statistics  to  the  appropriate  set  of  one- 
electron  levels. 

The  values  of  nc(T)  and  p„(T)  depend  critically,  as  we  shall  see,  on  the  presence  of 
impurities.  However,  there  are  certain  general  relations  that  hold  regardless  of  the 
purity  of  the  sample,  and  we  consider  these  first.  Suppose  the  density  of  levels 
(page  143)  is  gc( S)  in  the  conduction  band  and  </,(£)  in  the  valence  band.  The  effect 
of  impurities,  as  we  shall  see  below,  is  to  introduce  additional  levels  at  energies 
between  the  top  of  the  valence  band,  £,,,  and  the  bottom  of  the  conduction  band,  £c, 
without,  however,  appreciably  altering  the  form  of  pc(S)  and  p,.(E).  Since  conduction 
is  entirely  due  to  electrons  in  conduction  band  levels  or  holes  in  valence  band  levels, 
regardless  of  the  concentration  of  impurities  the  numbers  of  carriers  present  at 
temperature  T  will  be  given  by 


nJJ)  = 
PJLT)  = 


(18  gc(8)  + 


J  1  rte  gM  ^ 

WriT  _|_  j ' 


(28.9) 


10  Hole  densities  are  conventionally  denoted  by  the  letter  p  (for  positive).  This  widely  used  notation 
exploits  the  coincidence  that  the  n  denoting  the  number  density  of  electrons  can  also  be  regarded  as  standing 
for  “negative.” 


Number  of  Carriers  in  Thermal  Equilibrium  573 


Impurities  affect  the  determination  of  nc  and  p„  only  through  the  value  of  the 
chemical  potential11  p  to  be  used  in  Eq.  (28.9).  To  determine  p  one  must  know 
something  about  the  impurity  levels.  However,  one  can  extract  some  useful  informa¬ 
tion  from  (28.9)  which  is  independent  of  the  precise  value  of  the  chemical  potential, 
provided  only  that  it  satisfies  the  conditions: 

£c  -  P  »  kBT, 

P  -  £„  »  kBT.  (28.10) 

There  will  be  a  range  of  values  of  p  for  which  (28.10)  holds  even  for  energy  gaps 
Eg  =  £c  —  £„  as  small  as  a  few  tenths  of  an  electron  volt  and  temperatures  as  high 
as  room  temperature.  Our  procedure  will  be  to  assume  the  validity  of  (28.10),  use  it 
to  simplify  (28.9),  and  then,  from  the  values  of  nc  and  pv  so  obtained  and  the  appro¬ 
priate  information  about  possible  impurity  levels,  compute  the  actual  value  of  the 
chemical  potential  to  check  whether  it  does  indeed  lie  in  the  range  given  by  (28.10). 
If  it  does,  the  semiconductor  is  described  as  “nondegenerate,”  and  the  procedure  is 
a  valid  one.  If  it  does  not,  one  is  dealing  with  a  “degenerate  semiconductor”  and  must 
work  directly  with  Eq.  (28.9)  without  making  the  simplifications  implied  by  (28.10). 

Given  Eq.  (28.10),  then  since  every  conduction  band  level  exceeds  £c  and  every 
valence  band  level  is  less  than  £,„  we  may  simplify  the  statistical  factors  in  (28.9): 

£  >  £c; 

£  <  £„.  (28.11) 

Equations  (28.9)  thereby  reduce  to 

nc(T )  =  Nc(T)e-^->‘)lk«r, 

PIT)  =  P,:(T)e-^-'^<r,  (28.12) 

where 

NC(T)  =  f“  d£  ft(E)e-«‘-y'V, 

Jf-C 

PV(T )  =  dZ  gv{Z)e  ^F)lk"r-  (28.13) 

J-00 

Because  the  ranges  of  integration  in  (28.13)  include  the  points  where  the  arguments 
of  the  exponentials  vanish,  NC(T)  and  PJT)  are  relatively  slowly  varying  functions 


11  It  is  the  widespread  practice  to  refer  to  the  chemical  potential  of  a  semiconductor  as  “the  Fermi 
level,”  a  somewhat  unfortunate  terminology.  Since  the  chemical  potential  almost  always  lies  in  the  energy 
gap,  there  is  no  one-electron  level  whose  energy  is  actually  at  “the  Fermi  level”  (in  contrast  to  the  case 
of  a  metal).  Thus  the  usual  definition  of  the  Fermi  level  (that  energy  below  which  the  one-electron  levels 
are  occupied  and  above  which  they  are  unoccupied  in  the  ground  state  of  a  metal)  does  not  specify  a  unique 
energy  in  the  case  of  a  semiconductor:  Any  energy  in  the  gap  separates  occupied  from  unoccupied  levels 
at  T  —  0.  The  term  “Fermi  level”  should  be  regarded  as  nothing  more  than  a  synonym  for  “chemical 
potential,”  in  the  context  of  semiconductors. 


_ _ _ ~ 

e(6 -V)!kBT  +\ 

_ 1 _ ~ 

e(x-e)/kB7-  ,  j  ~  e 
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of  temperature,  compared  with  the  exponential  factors  they  multiply  in  (28.12).  This 
is  their  most  important  feature.  Usually,  however,  one  can  evaluate  them  explicitly. 
Because  of  the  exponential  factors  in  the  integrands  of  (28.13)  only  energies  within 
kBT of  the  band  edges  contribute  appreciably,  and  in  this  range  the  quadratic  approx¬ 
imation,  (28.2)  or  (28.3),  is  generally  excellent.  The  level  densities  can  then  be  taken 
to  be  (Problem  3): 

(IcJZ)  = 

and  the  integrals  (28.13)  then  give 

NAT) 

PAT) 

Here  mc3  is  the  product  of  the  principal  values  of  the  conduction  band  effective  mass 
tensor  (i.e.,  its  determinant),1 2  and  mi;3  is  similarly  defined 

Equation  (28.15)  can  be  cast  in  the  numerically  convenient  forms: 


— (r^r— 
— (r(^r— *■ 


(28.16) 


where  Tis  to  be  measured  in  degrees  Kelvin.  Since  the  exponential  factors  in  (28.12) 
are  less  than  unity  by  at  least  an  order  of  magnitude,  and  since  mjm  and  mjm  are 
typically  of  the  order  of  unity,  Eq.  (28. 16)  indicates  that  101 8  or  1 01 9  carriers/cm3  is  an 
absolute  upper  limit  to  the  carrier  concentration  in  a  nondegenerate  semiconductor. 

We  still  cannot  infer  nc(T)  and  p,.(T)  from  (28.12)  until  we  know  the  value  of  the 
chemical  potential  p.  However,  the  p  dependence  disappears  from  the  product  of  the 
two  densities: 


ncPv 


N  P  e~ fl'c~lPikBT 

NC/ve~W, 


(28.17) 


This  result  (sometimes  called  the  “law  of  mass  action”13)  means  that  at  a  given 
temperature  it  suffices  to  know  the  density  of  one  carrier  type  to  determine  that  of 
the  other.  How  this  determination  is  made  depends  on  how  important  the  impurities 
are  as  a  source  of  carriers. 


12  If  there  is  more  than  one  conduction  band  minimum  one  must  add  together  terms  of  the  form 

(28.14)  and  (28.15)  for  each  minimum.  These  sums  will  continue  to  have  the  same  forms  as  (28.14)  and 

(28.15) ,  provided  that  the  definition  of  mc  is  altered  to  m3'2  ->  £  m3'2. 

13  The  analogy  with  chemical  reactions  is  quite  precise:  A  carrier  is  provided  by  the  dissociation 
of  a  combined  electron  and  hole. 
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Intrinsic  Case 

If  the  crystal  is  so  pure  that  impurities  contribute  negligibly  to  the  carrier  densities, 
one  speaks  of  an  “intrinsic  semiconductor.”  In  the  intrinsic  case,  conduction  band 
electrons  can  only  have  come  from  formerly  occupied  valence  band  levels,  leaving 
holes  behind  them.  The  number  of  conduction  band  electrons  is  therefore  equal  to 
the  number  of  valence  band  holes: 

nc(T)  =  p,(T)  =  ,h(T)-  (28.18) 

Since  nc  =  we  may  write  their  common  value  nt  as  (ncpv)112.  Equation  (28.17) 
then  gives 

a, (X)  =  [A;(T)Pt(T)]''V  V2tiT,  (28.19) 

or,  from  (28.15)  and  (28.16): 


(28.20) 

We  may  now  establish  in  the  intrinsic  case  the  condition  for  the  validity  of  assump¬ 
tion  (28.10)  on  which  our  analysis  has  been  based.  Defining  p;  to  be  the  value  of  the 
chemical  potential  in  the  intrinsic  case,  we  find  that  Eqs.  (28.12)  give  values  of  nc  and 
pv  equal  to  nt  (Eq.  (28.19)),  provided  that 

H  =  ft  =  &.+  K  +  (28-21) 

or,  from  Eq.  (28.15), 

(28.22) 

This  asserts  that  as  T  ->  0,  the  chemical  potential  p,  lies  precisely  in  the  middle 
of  the  energy  gap.  Furthermore,  since  In  ( mv/mc )  is  a  number  of  order  unity,  p,  will 
not  wander  from  the  center  of  the  energy  gap  by  more  than  order  kBT.  Consequently, 
at  temperatures  kBT  small  compared  with  Eg,  the  chemical  potential  will  be  found 
far  from  the  boundaries  of  the  forbidden  region,  Sc  and  S„,  compared  with  kBT (Figure 
28.10),  and  the  condition  for  nondegeneracy  (28.10)  will  be  satisfied.  Therefore  (28.20) 
is  a  valid  evaluation  of  the  common  value  of  nc  and  pv  in  the  intrinsic  case,  provided 
only  that  Eg  is  large  compared  with  kBT,  a  condition  that  is  satisfied  in  almost  all 
semiconductors  at  room  temperature  and  below’. 

Extrinsic  Case:  Some  General  Features 

If  impurities  contribute  a  significant  fraction  of  the  conduction  band  electrons  and/or 
valence  band  holes,  one  speaks  of  an  “extrinsic  semiconductor.”  Because  of  these 
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Figure  28.10 

In  an  intrinsic  semiconductor 
with  an  energy  gap  Eg  large 
compared  with  kBT,  the 
chemical  potential  p  lies 
within  order  kBT  of  the  center 
of  the  energy  gap,  and  is 
therefore  far  compared  with 
kBT  from  both  boundaries  of 
the  gap  at  6C  and  £,,. 


added  sources  of  carriers  the  density  of  conduction  band  electrons  need  no  longer 
be  equal  to  the  density  of  valence  band  holes: 

nc  -  pv  =  An  #  0.  (28.23) 

Since  the  law  of  mass  action  Eq.  (28.17)  holds  regardless  of  the  importance  of 
impurities,  we  can  use  the  definition  (28.19)  of  n,(T)  to  write  quite  generally, 

ncPv  =  n,2.  (28.24) 

Equations  (28.24)  and  (28.23)  permit  one  to  express  the  carrier  densities  in  the  extrinsic 
case  in  terms  of  their  intrinsic  values  n;  and  the  deviation  An  from  intrinsic  behavior: 

(28.25) 

The  quantity  A n/ni;  which  measures  the  importance  of  the  impurities  as  a  source 
of  carriers,  can  be  given  a  particularly  simple  expression  as  a  function  of  chemical 
potential  pi,  if  we  note  that  Eqs.  (28.12)  have  the  form14 

nc  =  Pv  =  (28.26) 

Therefore 

(28.27) 


14  To  verify  these  relations  one  need  not  substitute  the  explicit  definitions  of  and  ;  it  is  enough 
to  note  that  tic  and  pv  are  proportional  to  exp  (/5//)  and  exp  (— respectively,  and  that  both  reduce  to 
Rf  when  ft  —  fa. 
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We  have  noted  that  if  the  energy  gap  Eg  is  large  compared  with  kBT,  then  the 
intrinsic  chemical  potential  /t;  will  satisfy  the  assumption  (28.10)  of  nondegeneracy. 
But  Eq.  (28.27)  requires  that  if  p;  is  far  from  £c  or  on  the  scale  of  kBT,  then  p  must  be 
as  well,  unless  An  is  many  orders  of  magnitude  larger  than  the  intrinsic  carrier  density 

Thus  the  nondegeneracy  assumption  underlying  the  derivation  of  (28.27)  is  valid 
when  Eg  »  kBT,  unless  we  are  in  a  region  of  extreme  extrinsic  behavior. 

Note  also  that  when  An  is  large  compared  with  ni5  then  Eq.  (28.25)  asserts  that 
the  density  of  one  carrier  type  is  essentially  equal  to  An,  while  that  of  the  other  type 
is  smaller  by  a  factor  of  order  (n,/An)2.  Thus  when  impurities  do  provide  the  major 
source  of  carriers,  one  of  the  two  carrier  types  will  be  dominant.  An  extrinsic  semi¬ 
conductor  is  called  “n-type”  or  “p-type”  according  to  whether  the  dominant  carriers 
are  electrons  or  holes. 

To  complete  the  specification  of  the  carrier  densities  in  extrinsic  semiconductors 
one  must  determine  An  or  p.  To  do  this  we  must  examine  the  nature  of  the  electronic 
levels  introduced  by  the  impurities  and  the  statistical  mechanics  of  the  occupation 
of  these  levels  in  thermal  equilibrium. 


IMPURITY  LEVELS 

Impurities  that  contribute  to  the  carrier  density  of  a  semiconductor  are  called  donors 
if  they  supply  additional  electrons  to  the  conduction  band,  and  acceptors  if  they 
supply  additional  holes  to  (i.e.,  capture  electrons  from)  the  valence  band.  Donor 
impurities  are  atoms  that  have  a  higher  chemical  valence  than  the  atoms  making 
up  the  pure  (host)  material,  while  acceptors  have  a  lower  chemical  valence. 

Consider,  for  example,  the  case  of  substitutional  impurities  in  a  group  IV  semi¬ 
conductor.  Suppose  that  we  take  a  crystal  of  pure  germanium,  and  replace  an 
occasional  germanium  atom  by  its  neighbor  to  the  right  in  the  periodic  table,  arsenic 
(Figure  28.11).  The  germanium  ion  has  charge  4e  and  contributes  four  valence 
electrons,  while  the  arsenic  ion  has  charge  5e  and  contributes  five  valence  electrons. 
If,  to  a  first  approximation,  we  ignore  the  difference  in  structure  between  the  arsenic 
and  germanium  ion  cores,  we  can  represent  the  substitution  of  an  arsenic  atom  for 
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Figure  28.11 

(a)  Schematic  representation  of  a  substitutional  arsenic  (valence  5)  donor  impurity  in  a 
germanium  (valence  4)  crystal,  (b)  The  arsenic  (As)  can  be  represented  as  a  germanium 
atom  phis  an  additional  unit  of  positive  charge  fixed  at  the  site  of  the  atom  (circled  dot),  (c) 
In  the  semiclassical  approximation,  in  which  the  pure  semiconductor  is  treated  as  a  homo¬ 
geneous  medium,  the  arsenic  impurity  is  represented  as  a  fixed  point  charge  +e  (dot). 
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a  germanium  atom  by  a  slightly  less  drastic  modification,  in  which  the  germanium 
atom  is  not  removed,  but  an  additional  fixed  positive  charge  of  e  is  placed  at  its 
site,  along  with  an  additional  electron. 

This  is  the  general  model  for  a  semiconductor  doped  with  donor  impurities. 
Distributed  irregularly15  throughout  the  perfect  pure  crystal  are  ND  fixed  attractive 
centers  of  charge  +e,  per  unit  volume,  along  with  the  same  number  of  additional 
electrons.  As  expected,  each  such  center  of  charge  +  e  can  bind16  one  of  the  additional 
electrons  of  charge  —  e.  If  the  impurity  were  not  embedded  in  the  semiconductor, 
but  in  empty  space,  the  binding  energy  of  the  electron  would  just  be  the  first  ionization 
potential  of  the  impurity  atom,  9.81  eV  for  arsenic.  However  (and  this  is  of  crucial 
importance  in  the  theory  of  semiconductors),  since  the  impurity  is  embedded  in  the 
medium  of  the  pure  semiconductor,  this  binding  energy  is  enormously  reduced 
(to  0.013  eV  for  the  case  of  arsenic  in  germanium).  This  happens  for  two  reasons: 

1.  The  field  of  the  charge  representing  the  impurity  must  be  reduced  by  the  static 
dielectric  constant  e  of  the  semiconductor.17  These  are  quite  large  (e  ss  16  in 
germanium),  being  typically  between  about  10  and  20  but  ranging  in  some  cases 
as  high  as  100  or  more.  The  large  dielectric  constants  are  consequences  of  the 
small  energy  gaps.  If  there  were  no  overall  energy  gap,  the  crystal  would  be  a 
metal  instead  of  a  semiconductor,  and  the  static  dielectric  constant  would  be 
infinite,  reflecting  the  fact  that  a  static  electric  field  can  induce  a  current  in  which 
electrons  move  arbitrarily  far  from  their  original  positions.  If  the  energy  gap  is 
not  zero,  but  small,  then  the  dielectric  constant  will  not  be  infinite,  but  can  be 
quite  large,  reflecting  the  relative  ease  with  which  the  spatial  distribution  of 
electrons  can  be  deformed.18 

2.  An  electron  moving  in  the  medium  of  the  semiconductor  should  be  described 
not  by  the  free  space  energy-momentum  relation,  but  by  the  semiclassical  relation 
(Chapter  12)  £(k)  =  £c(k),  where  fik  is  the  electronic  crystal  momentum,  and  Sc(k) 
is  the  conduction  band  energy-momentum  relation;  i.e.,  the  additional  electron 
introduced  by  the  impurity  should  be  thought  of  as  being  in  a  superposition  of 
conduction  band  levels  of  the  pure  host  material,  which  is  appropriately  altered 
by  the  additional  localized  charge  +e  representing  the  impurity.  The  electron 
can  minimize  its  energy  by  using  only  levels  near  the  bottom  of  the  conduction 
band,  for  which  the  quadratic  approximation  (28.2)  is  valid.  Should  theconduction 
band  minimum  be  at  a  point  of  cubic  symmetry,  the  electron  would  then  behave 
very  much  like  a  free  electron,  but  with  an  effective  mass  that  differs  from  the  free 


15  Under  very  special  circumstances  it  may  be  possible  for  the  impurities  themselves  to  be  regularly 
arranged  in  space.  We  shall  not  consider  this  possibility  here. 

16  As  we  shall  see,  the  binding  is  quite  weak,  and  the  electrons  bound  to  the  center  are  readily  liberated 
by  thermal  excitation. 

17  This  use  of  macroscopic  electrostatics  in  describing  the  binding  of  a  single  electron  is  justified  by 
the  fact  (established  below)  that  the  wave  function  of  the  bound  electron  extends  over  many  hundreds 
of  angstroms. 

18  The  connection  between  small  energy  gaps  and  large  dielectric  constants  can  also  be  understood 
from  the  point  of  view  of  perturbation  theory :  The  size  of  the  dielectric  constant  is  a  measure  of  the  extent 
to  which  a  weak  electric  field  distorts  the  electronic  wave  function.  But  a  small  energy  gap  means  there 
will  be  small  energy  denominators,  and  hence  large  changes,  in  the  first-order  wave  functions. 
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electron  mass  m.  More  generally,  the  energy  wave  vector  relation  will  be  some 
anisotropic  quadratic  function  of  k.  In  either  case,  however,  to  a  first  approxi¬ 
mation,  we  may  represent  the  electron  as  moving  in  free  space  but  with  a  mass 
given  by  some  appropriately  defined  effective  mass  m*,  rather  than  the  fiee 
electron  mass.  In  general,  this  mass  will  be  smaller  than  the  free  electron  mass, 
often  by  a  factor  of  0.1  or  even  less. 

These  tw'o  observations  suggest  that  w'e  may  represent  an  electron  in  the  presence 
of  a  donor  impurity  of  charge  e  within  the  medium  of  the  semiconductor,  as  a  particle 
of  charge  -  e  and  mass  m*,  moving  in  free  space  in  the  presence  of  an  attractive 
center  of  charge  e/e.  This  is  precisely  the  problem  of  a  hydrogen  atom,  except  that 
the  product  -  e2  of  the  nuclear  and  electronic  charges  must  be  replaced  by  -  e2/e,  and 
the  free  electron  mass  m,  by  in*.  Thus  the  radius  of  the  first  Bohr  orbit,  a0  =  h2/me2, 
becomes 

r0  =  ^en0,  (28.28) 

in* 

and  the  ground-state  binding  energy,  meA/2h2  =  13.6  eV  becomes 

E  =  -4x  13.6  eV.  (28.29) 

m  e 

For  reasonable  values  of  m*/in  and  e,  the  radius  r0  can  be  100  A  or  more.  This 
is  very  important  for  the  consistency  of  the  entire  argument,  for  both  the  use  of  the 
semiclassical  model  and  the  use  of  the  macroscopic  dielectric  constant  are  predicated 
on  the  assumption  that  the  fields  being  described  vary  slowly  on  the  scale  of  a  lattice 
constant. 

Furthermore,  typical  values  of  m*/m  and  e  can  lead  to  a  binding  energy  E  smaller 
than  13.6  eV  by  a  factor  of  a  thousand  or  more.  Indeed,  since  small  energy  gaps  are 
generally  associated  with  large  dielectric  constants,  it  is  almost  always  the  case 
that  the  binding  energy  of  an  electron  to  a  donor  impurity  is  small  compared  with  the 
energy  gap  of  the  semiconductor.  Since  this  binding  energy  is  measured  relative  to 
the  energy  of  the  conduction  band  levels  from  which  the  bound  impurity  level  is 
formed,  we  conclude  that  donor  impurities  introduce  additional  electronic  levels 
at  energies  £,,  which  are  lower  than  the  energy  Ec  at  the  bottom  of  the  conduction 
band  by  an  amount  that  is  small  compared  with  the  energy  gap  Eg  (Figure  28.12). 


Figure  28.12 

Level  density  for  a  semiconductor  containing  both 
donor  and  acceptor  impurities.  The  donor  levels  Ed  are 
generally  close  to  the  bottom  of  the  conduction  band, 
£c  compared  with  Eg,  and  the  acceptor  levels,  E„,  are 
generally  close  to  the  top  of  the  valence  band,  E,  . 


x(E) 
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A  similar  argument  can  be  applied  to  acceptor  impurities,  whose  valence  is  one 
less  than  that  of  the  host  atoms  (e.g.,  gallium  in  germanium).  Such  an  impurity  can 
be  represented  by  the  superimposition  of  a  fixed  charge  —  e  on  top  of  a  host  atom, 
along  with  the  presence  of  one  less  electron  in  the  crystal.  The  missing  electron  can 
be  represented  as  a  bound  hole,  attracted  by  the  excess  negative  charge  representing 
the  impurity,  with  a  binding  energy  that  is  again  small 1 9  on  the  scale  of  the  energy  gap, 
£  In  terms  of  the  electron  picture  this  bound  hole  will  be  manifested  as  an  additional 
electronic  level  at  an  energy  £a  lying  slightly  above  the  top  of  the  valence  band 
(Figure  28.12).  The  hole  is  bound  when  the  level  is  empty.  The  binding  energy  of  the 
hole  is  just  the  energy  £„  —  8„  necessary  to  excite  an  electron  from  the  top  of  the 
valence  band  into  the  acceptor  level,  thereby  filling  the  hole  in  the  vicinity  of  the 
acceptor  and  creating  a  free  hole  in  the  valence  band. 

Table  28.2 

LEVELS  OF  GROUP  V  (DONORS)  AND  GROUP  III  (ACCEPTORS) 

IMPURITIES  IN  SILICON  AND  GERMANIUM 


GROUP  III  ACCEPTORS  (TABLE  ENTRY  IS  £„  —  £„) 

B  A1  Ga  In  T1 


Si  0.046  eV  0.057  0.065  0.16  0.26 

Ge  0.0104  0.0102  0.0108  0.0112  0.01 

GROUP  V  DONORS  (TABLE  ENTRY  IS  6C  -  £>d) 

P  As  Sb  Bi 


Si  0.044  eV  0.049  0.039  0.069 

Ge  0.0120  0.0127  0.0096 

ROOM  TEMPERATURE  ENERGY  GAPS  ( Eg  =  £„  —  8„) 

Si  1.12  eV 

Ge  0.67  eV 


Source:  P.  Aigrain  and  M.  Balkanski,  Selected  Constants  Relative  to  Semiconductors, 
Pergamon,  New  York,  1961. 


The  single  most  important  fact  about  these  donor  and  acceptor  levels  is  that  they 
lie  very  close  to  the  boundaries  of  the  forbidden  energy  region.20  It  is  far  easier 
thermally  to  excite  an  electron  into  the  conduction  band  from  a  donor  level,  or  a 
hole  into  the  valence  band  from  an  acceptor  level,  than  it  is  to  excite  an  electron 
across  the  entire  energy  gap  from  valence  to  conduction  band.  Unless  the  concentra¬ 
tion  of  donor  and  acceptor  impurities  is  very  small,  they  will  therefore  be  a  far  more 
important  source  of  carriers  than  the  intrinsic  mechanism  of  exciting  carriers  across 
the  full  gap. 


19  For  the  same  reasons  as  in  the  case  of  donor  impurities,  the  binding  energy  of  the  hole  is  quite 
weak;  i.e.,  valence  band  electrons  are  readily  lifted  into  the  acceptor  level  by  thermal  excitation, 

20  Some  measured  donor  and  acceptor  levels  are  given  in  Table  28.2. 
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POPULATION  OF  IMPURITY  LEVELS  IN  THERMAL  EQUILIBRIUM 

To  assess  the  extent  to  which  carriers  can  be  thermally  excited  from  impurity  levels, 
we  must  compute  the  mean  number  of  electrons  in  the  levels  at  a  given  temperature 
and  chemical  potential.  We  assume  that  the  density  of  impurities  is  low  enough 
that  the  interaction  of  electrons  (or  holes)  bound  at  different  impurity  sites  is  negligible. 
We  may  then  calculate  the  number  density  of  electrons  nd  (or  holes  pa)  bound  to 
donor  (or  acceptor)  sites  by  simply  multiplying  by  the  density  of  donors  Nd  (or 
acceptors  NJ  the  mean  number  of  electrons  (or  holes)  there  would  be  if  there  were 
only  a  single  impurity.  For  simplicity  we  assume  that  the  impurity  introduces  only 
a  single  one-electron  orbital  level.21  We  calculate  its  mean  occupancy  as  follows: 

Donor  Level  If  we  ignored  electron-electron  interactions  the  level  could  either  be 
empty,  could  contain  one  electron  of  either  spin,  or  two  electrons  of  opposite  spins. 
However,  the  Coulomb  repulsion  of  two  localized  electrons  raises  the  energy  of  the 
doubly  occupied  level  so  high  that  double  occupation  is  essentially  prohibited. 
Quite  generally,  the  mean  number  of  electrons  in  a  system  in  thermal  equilibrium 
is  given  by: 


w  £  e-HT-j-v Vj)  ’ 


(28.30) 


where  the  sum  is  over  all  states  of  the  system,  Ej  and  Njy  are  the  energy  and  number 
of  electrons  in  state;',  and  p  is  the  chemical  potential.  In  the  present  case  the  system 
is  a  single  impurity  with  just  three  states:  one  with  no  electrons  present  which  makes 
no  contribution  to  the  energy,  and  two  with  a  single  electron  present  of  energy  8d. 
Therefore  (28.30)  gives 


so  that22 
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(28.32) 


Acceptor  Level  In  contrast  to  a  donor  level,  an  acceptor  level,  when  viewed  as  an 
electronic  level,  can  be  singly  or  doubly  occupied,  but  not  empty.  This  is  easily  seen 
from  the  hole  point  of  view.  An  acceptor  impurity  can  be  regarded  as  a  fixed,  negatively 
charged  attractive  center  superimposed  on  an  unaltered  host  atom.  This  additional 
charge  —  e  can  weakly  bind  one  hole  (corresponding  to  one  electron  being  in  the 


21  There  is  no  general  reason  why  a  donor  site  cannot  have  more  than  one  bound  level,  and  we  assume 
a  single  one  only  to  simplify  our  discussion.  Our  qualitative  conclusions,  however,  are  quite  general 
(see  Problem  4c). 

22  Some  insight  into  the  curious  factor  of  )  that  emerges  in  (28.32)  in  contrast  to  the  more  familiar 
distribution  function  of  Fermi-Dirac  statistics  can  be  gained  by  examining  what  happens  as  the  energy 
of  the  doubly  occupied  level  drops  from  +  oo  down  to  2E,,.  See  Problem  4. 
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acceptor  level).  The  binding  energy  of  the  hole  is  £„  —  £„  and  when  the  hole  is 
“ionized”  an  additional  electron  moves  into  the  acceptor  level.  However,  the  con¬ 
figuration  in  which  no  electrons  are  in  the  acceptor  level  corresponds  to  two  holes 
being  localized  in  the  presence  of  the  acceptor  impurity,  which  has  a  very  high 
energy  due  to  the  mutual  Coulomb  repulsion  of  the  holes.23 

Bearing  this  in  mind,  we  can  calculate  the  mean  number  of  electrons  at  an  acceptor 
level  from  (28.30)  by  noting  that  the  state  with  no  electrons  is  now  prohibited,  while 
the  two-electron  state  has  an  energy  that  is  £0  higher  than  the  two  one-electron  states. 
Therefore 


<«>  = 


2efe  +  le-Ma-1*' 
2e^  + 


+  i 


(28.33) 


The  mean  number  of  holes  in  the  acceptor  level  is  the  difference  between  the 
maximum  number  of  electrons  the  level  can  hold  (two)  and  the  actual  mean  number 
of  electrons  in  the  level  (<«»:  <p>  =  2  —  <n>,  and  therefore  pa  =  Na<p>  is  given  by 


Pa 


Na 

yto-w  +  i  ■ 


(28.34) 
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SEMICONDUCTORS 


Consider  a  semiconductor  doped  with  Nd  donor  impurities  and  N„  acceptor  impurities 
per  unit  volume.  To  determine  the  carrier  densities  we  must  generalize  the  constraint 
nc  =  pv  (Eq.  (28.18))  that  enabled  us  to  find  these  densities  in  the  intrinsic  (pure) 
case.  We  can  do  this  by  first  considering  the  electronic  configuration  at  T  =  0. 
Suppose  Nd  >  Na.  (The  case  Nd  <  Na  is  equally  straightforward  and  leads  to  the 
same  result  (28.35).)  Then  in  a  unit  volume  of  semiconductor  Na  of  the  Nd  electrons 
supplied  by  the  donor  impurities  can  drop  from  the  donor  levels  into  the  acceptor 
levels.24  This  gives  a  ground-state  electronic  configuration  in  which  the  valence 
band  and  acceptor  levels  are  filled,  Nd  —  Na  of  the  donor  levels  are  filled,  and  the 
conduction  band  levels  are  empty.  In  thermal  equilibrium  at  temperature  T  the 
electrons  will  be  redistributed  among  these  levels,  but  since  their  total  number  remains 
the  same,  the  number  of  electrons  in  conduction  band  or  donor  levels,  nc  +  nd,  must 
exceed  its  value  at  T  =  0,  Nd  —  Na,  by  precisely  the  number  of  empty  levels  (i.e., 
holes),  p„  +  p„,  in  the  valence  band  and  acceptor  levels: 


nc  +  nd  =  Nd  -  Na  +  pv  +  pa. 


(28.35) 


23  When  describing  acceptor  levels  as  electronic  levels  one  usually  ignores  the  electron  that  must  be 
in  the  level,  considering  only  the  presence  or  absence  of  the  second  electron.  One  describes  the  level  as 
empty  or  filled  according  to  whether  the  second  electron  is  absent  or  present. 

24  Since  £d  is  just  below  the  conduction  band  minimum  £f,  and  £fl  is  just  above  the  valence  band 
maximum,  Er,  we  have  E^  >  £fl  (see  Figure  28.12). 
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This  equation,  together  with  the  explicit  forms  we  have  found  for  nc,  pt,,  nd,  and  na 
as  functions  of  p  and  T,  permits  one  to  find  p  as  a  function  of  T,  and  therefore  to 
find  the  thermal  equilibrium  carrier  densities  at  any  temperature.  A  general  analysis 
is  rather  complicated,  and  we  consider  here  only  a  particularly  simple  and  important 
case: 

Suppose  that 

Zd-  p»  k„T, 

p  -  £„  »  kBT.  (28.36) 


Since  Sd  and  £„  are  close  to  the  edges  of  the  gap,  this  is  only  slightly  more  restrictive 
than  the  nondegeneracy  assumption  (28.10).  Condition  (28.36)  and  the  expressions 
(28.32)  and  (28.34)  for  nd  and  pu  insure  that  thermal  excitation  fully  “ionizes”  the 
impurities,  leaving  only  a  negligible  fraction  with  bound  electrons  or  holes.  nd  «  iVd, 
Pa  «  Na.  Equation  (28.35)  therefore  becomes 

An  =  nc  —  pv  =  N  d  —  Na,  (28.37) 


so  Eqs.  (28.25)  and  (28.27)  now  give  the  carrier  densities  and  chemical  potential  as 
explicit  functions  of  the  temperature  alone: 


"‘1  =  J [ (Nd  -  Na)2  +  4,i,-2]1/2  ±  \[Nd  -  Na]  (28.38) 

Pv)  2  2 


Nd  =  2  sinh  —  Hi).  (28-39) 

If  the  gap  is  large  compared  with  k„T,  the  assumption  (28.36)  we  began  with 
should  remain  valid  unless  p  is  quite  far  from  on  the  scale  of  kBT.  According  to 
Eq.  (28.39),  this  will  only  happen  when  \Nd  -  Na\  is  several  orders  of  magnitude 
greater  than  the  intrinsic  carrier  density  iij.  Therefore  Eq.  (28.38)  correctly  describes 
the  transition  from  predominantly  intrinsic  behavior  («,-  »  |Ad  —  Na |)  well  into  the 
region  of  predominantly  extrinsic  behavior  (iq  «  \Nd  —  1V0|).  Expanding  (28.38), 
we  find  that  at  low  impurity  concentrations  the  corrections  to  the  purely  intrinsic 
carrier  densities  are 

*  n,  +  {(Nd  -  Na),  (28.40) 

while  for  a  considerable  range  of  carrier  concentrations  in  the  extrinsic  regime, 


nc  «  Nd  -  Na) 


(28.41) 


Na  >  Nd. 
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Equation  (28.41)  is  quite  important  in  the  theory  of  semiconducting  devices 
(Chapter  29).  It  asserts  that  the  net  excess  of  electrons  (or  holes)  Nd  —  Nc  introduced 
by  the  impurities  is  almost  entirely  donated  to  the  conduction  (or  valence)  band; 
the  other  band  has  the  very  much  smaller  carrier  density  ni2/(Nd  —  Na),  as  required 
by  the  law  of  mass  action,  (28.24). 

If  the  temperature  is  too  low  (or  the  impurity  concentration  too  high),  condition 
(28.36)  eventually  fails  to  hold,  and  either  nd/Nd  or  pJNa  (but  not  both)  ceases  to  be 
negligible,  i.e.,  one  of  the  impurity  types  is  no  longer  fully  ionized  by  thermal  excitation. 
As  a  result,  the  dominant  carrier  density  declines  with  decreasing  temperature 
(Figure  28. 13).25 


Figure  28.13 

Temperature  dependence  of 
the  majority  carrier  density 
(for  the  case  Nd  >  Na).  The 
two  high-temperature  re¬ 
gimes  are  discussed  in  the 
text;  the  very  low-tempera- 
ture  behavior  is  described  in 
Problem  6. 


IMPURITY  BAND  CONDUCTION 

As  the  temperature  approaches  zero,  so  does  the  fraction  of  ionized  impurities,  and 
therefore  also  the  density  of  carriers  in  the  conduction  or  valence  bands.  Nevertheless, 
some  small  residual  conductivity  is  observed  even  at  the  lowest  temperatures.  This 
is  because  the  wave  function  of  an  electron  (or  hole)  bound  to  an  impurity  site  has 
considerable  spatial  extent,  and  therefore  the  overlap  of  wave  functions  at  different 
impurity  sites  is  possible  even  at  fairly  low  concentrations.  When  this  overlap  is 
not  negligible,  it  is  possible  for  an  electron  to  tunnel  from  one  site  to  another.  The 
resulting  transport  of  charge  is  known  as  “impurity  band  conduction.” 

The  use  of  the  term  “band”  in  this  context  is  based  on  an  analogy  with  the  tight- 
binding  method  (Chapter  10),  which  shows  that  a  set  of  atomic  levels  with  a  single 
energy  can  broaden  into  a  band  of  energies,  when  wave  function  overlap  is  taken 
into  account.  The  impurities,  however,  are  usually  not  situated  at  the  sites  of  a  Bravais 
lattice,  and  one  must  therefore  be  cautious  in  attributing  to  the  impurity  “bands” 
features  associated  with  electronic  bands  in  periodic  potentials.26 


25  This  behavior  is  described  more  quantitatively  in  Problem  6. 

26  The  problem  of  electronic  behavior  in  aperiodic  potentials  (which  arises  not  only  in  connection 
with  impurity  bands,  but  also,  for  example,  in  the  case  of  disordered  alloys)  is  still  in  its  infancy,  and  is 
one  of  the  very  lively  areas  of  current  research  in  solid  state  physics. 
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THE  THEORY  OF  TRANSPORT  IN  NONDEGENERATE 
SEMICONDUCTORS 

It  is  a  straightforward  consequence  (Problem  7)  of  Fermi-Dirac  statistics  and  the 
nondegeneracy  assumption  (28.10)  that  the  thermal  equilibrium  velocity  distribution 
for  electrons  near  a  particular  conduction  band  minimum  (or  holes  near  a  particular 
valence  band  maximum)  has  the  form: 

/(v)  = "  (Wp exp  {-  f  £  *4  (28-42) 

where  n  is  their  contribution  to  the  total  carrier  density. 

This  is  just  the  form  assumed  by  the  thermal  equilibrium  molecular  velocity 
distribution  in  a  classical  gas,  with  two  exceptions: 

1.  In  a  classical  gas,  the  density  of  molecules  n  is  specified;  in  a  semiconductor, 
n  is  an  extremely  sensitive  function  of  temperature. 

2.  In  a  classical  gas  the  mass  tensor  M  is  diagonal. 

As  a  result,  the  theory  of  transport  in  a  nondegenerate  semiconductor  is  similar 
to  the  theory  of  transport  in  a  classical  gas  of  several  charged  components,27  and  many 
results  of  the  classical  theory  can  be  applied  directly  to  semiconductors,  when 
allowance  is  made  for  the  temperature  dependence  of  the  carrier  densities  and  tensor 
character  of  the  mass.  For  example,  the  anomalously  high  thermopower  of  a  semi¬ 
conductor  (page  563)  is  only  anomalous  in  comparison  with  the  thermopower  of 
metals;  it  is  quite  in  accord  with  the  properties  of  a  classical  charged  gas.  Indeed, 
the  thermopower  of  metals  was  considered  anomalously  low  in  the  early  days  of 
electron  theory,  before  it  was  realized  that  metallic  electrons  must  be  described  by 
Fermi-Dirac,  rather  than  classical,  statistics. 


PROBLEMS 


1.  Cyclotron  Resonance  in  Semiconductors 

(a)  Show  that  the  formulas  (28.6)  and  (28.7)  for  the  cyclotron  resonance  frequency  follow 
from  substituting  the  oscillatory  velocity  (28.5)  into  the  semiclassical  equation  of  motion  (28.4), 
and  requiring  that  the  resulting  homogeneous  equation  have  a  nonzero  solution. 

(b)  Show  that  (28.7)  and  (28.8)  are  equivalent  representations  of  the  cyclotron  effective  mass 
by  evaluating  (28.7)  in  the  coordinate  system  in  which  the  mass  tensor  M  is  diagonal. 

2.  Interpretation  of  Cyclotron  Resonance  Data 

(a)  Compare  the  cyclotron  resonance  signal  from  silicon  in  Figure  28.9b  with  the  geometry 
of  the  conduction  band  ellipsoids  shown  in  Figure  28.5,  and  explain  why  there  are  only  two 
electron  peaks  although  there  are  six  pockets  of  electrons. 

27  Such  a  theory  was  extensively  developed  by  Lorentz,  as  an  attempt  at  refining  the  Drude  model 
of  metals.  Although  Lorentz’s  theory  requires  substantial  modification  to  be  applicable  to  metals  (i.e.,  the 
introduction  of  degenerate  Fermi-Dirac  statistics  and  band  structure),  many  of  his  results  can  be  applied 
to  the  description  of  nondegenerate  semiconductors  with  very  little  alteration. 
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(b)  Verify  that  the  positions  of  the  electron  resonances  in  Figure  28.9b  are  consistent  with 
the  electron  effective  masses  given  for  silicon  on  page  569  and  the  formulas,  (28.6)  and  (28.8), 
for  the  resonance  frequency. 

(c)  Repeat  (a)  for  the  resonance  in  germanium  (Figure  28.9a),  noting  that  Figure  28.7  shows 
four  electron  pockets. 

(d)  Verify  that  the  positions  of  the  electron  resonances  in  Figure  28.9a  are  consistent  with 
the  electron  effective  masses  given  for  germanium  on  page  569. 

3.  Level  Density  for  Ellipsoidal  Pockets 

(a)  Show  that  the  contribution  of  an  ellipsoidal  pocket  of  electrons  to  the  conduction  band 
density  of  levels  gc( 6),  is  given  by  (d/d£)fc(£),  where  /?(£)  is  the  number  of  levels  per  unit  volume  in 
the  pocket  with  energies  less  than  £. 

(b)  Show,  similarly,  that  the  contribution  of  an  ellipsoidal  pocket  of  holes  to  the  valence 
band  density  of  levels  g„(£)  is  given  by  (d/d£)/i(£),  where  /?(£)  is  the  number  of  electronic  levels  per 
unit  volume  in  the  pocket  with  energies  greater  than  £. 

(c)  Using  the  fact  that  a  volume  Q  of  /.-space  contains  0/4 tt*  electronic  levels  per  cubic  centi¬ 
meter  and  the  formula  V  —  (4jt/3)abc  for  the  volume  of  the  ellipsoid  x2/a2  +  y2/b 2  +  z2/c2  =  1, 
show  that  formulas  (28.14)  follow  directly  from  (a)  and  (b),  when  the  conduction  (or  valence)  band 
has  a  single  ellipsoidal  pocket. 

4.  Statistics  of  Donor  Levels 

(a)  Show  that  if  the  energy  of  a  doubly  occupied  donor  level  is  taken  to  be  2£>d  +  A,  then 
Eq.  (28.32)  must  be  replaced  by 

1  4.  e-0(qr~M+A> 

~  +  j  +  Lg-tt^-n+h)-  (28.43) 

(b)  Verify  that  Eq.  (28.43)  reduces  to  (28.32)  as  A  ->  oo,  and  that  it  reduces  to  the  expected 
result  for  independent  electrons  as  A  -►  0. 

(c)  Consider  a  donor  impurity  with  many  bound  electronic  orbital  levels,  with  energies  K;. 
Assuming  that  the  electron-electron  Coulomb  repulsion  prohibits  more  than  a  single  electron 
from  being  bound  to  the  impurity,  show  that  the  appropriate  generalization  of  (28.32)  is 

Nd 

1  (28’44) 

Indicate  how  (if  at  all)  this  alters  the  results  described  on  pages  582-584. 

5.  Constraint  on  Carrier  Densities  in  p-Type  Semiconductors 

Describe  the  electronic  configuration  of  a  doped  semiconductor  as  T->0,  when  Na  >  Nd. 
Explain  why  (28.35)  (derived  in  the  text  when  Nd  >  Na)  continues  to  give  a  correct  constraint  on 
the  electron  and  hole  densities  at  nonzero  temperatures,  when  Na  >  Nd. 

6.  Carrier  Statistics  in  Doped  Semiconductors  at  Low  Temperatures 

Consider  a  doped  semiconductor  with  Nd  >  N„.  Assume  that  the  nondegeneracy  condition 
(28.10)  holds,  but  that  (Nd  —  tVJ/n,-  is  so  large  that  (28.39)  does  not  necessarily  yield  a  value  of  /i 
compatible  with  (28.36). 

(a)  Show  under  these  conditions  that  p„  is  negligible  compared  with  nc,  and  p„  is  negligible 
compared  with  Na,  so  that  the  chemical  potential  is  given  by  the  quadratic  equation 

Nd 

i  eMd-r)  +  j 


Nce-0Vc-r)  =Nd-  Nu 


(28.45) 


) 
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(b)  Deduce  from  this  that  if  the  temperature  drops  so  low  that  nc  ceases  to  be  given  by 
Nd  -  N„  (Eq.  (28.41)),  then  there  is  a  transition  to  a  regime  in  which 


Nc{Nd  -  Na) 


-p{zc-zd)/2 


(28.46) 


(c)  Show  that  as  the  temperature  drops  still  lower,  there  is  another  transition  to  a  regime 
in  which 


Nc(Nd-Na)  ^ 

'  K 

(d)  Derive  the  results  analogous  to  (28.45)-(28.47)  when  N„  >  Nd. 

7.  Velocity  Distribution  for  Carriers  in  an  Ellipsoidal  Pocket 
Derive  the  velocity  distribution  (28.42)  from  the  k-space  distribution  function 

1 


(28.47) 


/(k)  cc 


(28.48) 


by  assuming  the  nondegeneracy  condition  (28.10),  changing  from  the  variable  k  to  the  variable 
v,  and  noting  that  the  cohtribution  of  the  pocket  to  the  carrier  density  is  just  n  =  J  d\  /(v). 
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As  used  by  music  lovers,  the  term  “solid  state  physics”  refers  only  to  the  subject  of 
inhomogeneous  semiconductors,  and  it  would  be  more  accurate  if  it  were  this  latter 
phrase  that  festooned  the  brows  of  countless  tuners  and  amplifiers.  The  prevailing 
usage  reflects  the  fact  that  modern  solid  state  physics  has  had  its  most  dramatic 
and  extensive  technological  consequences  through  the  electronic  properties  of  semi¬ 
conducting  devices.  These  devices  use  semiconducting  crystals  in  which  the  concen¬ 
trations  of  donor  and  acceptor  impurities  have  been  made  nonuniform  in  a  carefully 
controlled  manner.  We  shall  not  attempt  to  survey  here  the  great  variety  of  semi¬ 
conducting  devices,  but  will  only  describe  the  broad  physical  principles  that  underlie 
their  operation.  These  principles  come  into  play  in  determining  how  the  densities  and 
currents  of  electrons  and  holes  are  distributed  in  an  inhomogeneous  semiconductor, 
both  in  the  absence  and  in  the  presence  of  an  applied  electrostatic  potential. 

The  inhomogeneous  semiconductors  of  interest  are,  ideally,  single  crystals  in  which 
the  local  concentration  of  donor  and  acceptor  impurities  varies  with  position.  One 
way  to  make  such  crystals  is  to  vary  the  concentration  of  impurities  in  the  “melt” 
as  the  growing  crystal  is  slowly  extracted,  thus  producing  a  variation  in  impurity 
concentration  along  one  spatial  direction.  Delicate  methods  of  fabrication  are  needed 
because  it  is  generally  important,  for  efficient  operation,  that  there  be  no  great  increase 
in  electronic  scattering  associated  with  the  variation  in  impurity  concentration. 

We  shall  illustrate  the  physics  of  inhomogeneous  semiconductors  by  considering 
the  simplest  example,  the  p-n  junction.  This  is  a  semiconducting  crystal  in  which  the 
impurity  concentration  varies  only  along  a  given  direction  (taken  to  be  the  x-axis) 
and  only  in  a  small  region  (taken  to  be  around  x  =  0).  For  negative  x  the  crystal  has 
a  preponderance  of  acceptor  impurities  (i.e.,  it  is  p-type)  while  for  positive  x  it  has  a 
preponderance  of  donor  impurities  (i.e.,  it  is  n-type)  (Figure  29.1).  The  manner  in 
which  the  densities  of  donors  and  acceptors  Nd{x)  and  Na(x)  vary  with  position  is 
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Figure  29.1 

The  impurity  densities  along 
a  p-n  junction  in  the  case  of 
an  “abrupt  junction,”  for 
which  donor  impurities  dom¬ 
inate  at  positive  x,  and 
acceptor  impurities  at  nega¬ 
tive  x.  The  donors  are  repre¬ 
sented  by  (+)  to  indicate 
their  charge  when  ionized, 
and  the  acceptors  by  (  — ). 
For  a  junction  to  be  abrupt, 
the  region  about  x  =  0  where 
the  impurity  concentrations 
change  must  be  narrow  com¬ 
pared  with  the  “depletion 
layer”  in  which  the  carrier 
densities  are  nonuniform. 
(Typical  plots  of  the  carrier 
densities  are  superimposed 
on  this  figure  in  Figure  29.3.) 
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called  the  “doping  profile.”  The  term  “junction”  is  used  to  refer  both  to  the  device 
as  a  whole  and,  more  specifically,  to  the  transition  region  about  x  =  0  in  which  the 
doping  profile  is  nonuniform. 

As  we  shall  see  below,  the  nonuniformity  in  impurity  concentrations  induces  a 
nonuniformity  in  the  densities  nc(x)  and  pv(x)  of  conduction  band  electrons  and 
valence  band  holes,  which  in  turn  gives  rise  to  a  potential  4>(x).  The  region  in  which 
these  carrier  densities  are  nonuniform  is  known  as  the  “depletion  layer”  (or  “space- 
charge  region”).  The  depletion  layer  can  extend  for  a  range  of  about  102  to  104  A 
around  the  (generally  more  narrow)  transition  region  in  which  the  doping  profile 
varies,  as  we  shall  see  below.  Within  the  depletion  layer,  except  near  its  boundaries, 
the  total  density  of  carriers  is  very  much  less  than  it  is  in  the  homogeneous  regions 
farther  away  from  the  transition  region.  The  existence  of  a  depletion  layer  is  one  of 
the  crucial  properties  of  the  p-n  junction.  One  of  our  main  concerns  will  be  to  explain 
why  such  a  layer  is  induced  by  the  variation  in  impurity  concentrations,  and  how  its 
structure  changes  with  the  application  of  an  external  potential  V. 

For  simplicity  we  shall  consider  here  only  “abrupt  junctions,”  in  which  the  tran¬ 
sition  region  is  so  sharp  that  variation  in  impurity  concentrations1  can  be  represented 
by  a  single  discontinuous  change  at  x  =  0: 


NJxl  =  fo’ 

^  *  k 


x  >  O') 
x  <  0J  ’ 
x  >  0  j 

X  <  0J‘ 


(29.1) 


Abrupt  junctions  are  not  only  conceptually  the  simplest,  but  also  the  type  of  greatest 
practical  interest.  How  sharp  the  actual  transition  region  must  be  made  for  (29.1)  to 
give  a  reasonable  model  of  a  physical  junction  will  emerge  in  the  analysis  below.  We 
shall  find  that  a  junction  may  be  regarded  as  abrupt  if  the  transition  region  in  the 
actual  doping  profile  is  small  in  extent  compared  with  the  depletion  layer.  In  most 
cases  this  permits  the  transition  region  to  extend  for  100  A  or  more.  A  junction  that 
cannot  be  treated  as  abrupt  is  called  a  “graded  junction.” 


THE  SEMICLASSICAL  MODEL 

To  calculate  the  response  of  an  inhomogeneous  semiconductor  to  an  applied  electro¬ 
static  potential,  or  even  to  compute  the  distribution  of  electric  charge  in  the  absence 
of  an  applied  potential,  one  almost  always  uses  the  semiclassical  model  of  Chapter  12. 
When  a  potential  <£(x)  is  superimposed  on  the  periodic  potential  of  the  crystal,  the 
semiclassical  model  treats  the  electrons  in  the  nth  band  as  classical  particles  (i.e.,  as 
wave  packets)  governed  by  the  Hamiltonian 

H„  =  £„  (|)  -  e<£(x).  (29.2) 


1  It  is  not  essential  that  there  be  onty  donor  impurities  in  the  n-type  region  and  only  acceptor  im¬ 
purities  in  the  p-type  region.  It  suffices  for  each  impurity  type  to  be  the  dominant  one  in  its  own  region. 
In  what  follows  Nd  may  be  viewed  as  the  excess  density  of  donors  over  acceptors  and  Na  as  the  excess 
density  of  acceptors  over  donors. 
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Such  a  treatment  is  valid  provided  that  the  potential  <p(x)  varies  sufficiently  slowly. 
How  slow  this  variation  must  be  is,  in  general,  a  very  difficult  question  to  answer. 
At  the  very  least,  one  requires  that  the  change  in  electrostatic  energy  eA$  over  a 
distance  of  the  order  of  the  lattice  constant  be  small  compared  with  the  band  gap 
E  but  the  condition  may  well  be  even  more  stringent  than  this.2  In  the  case  of  the 
p-n  junction  the  potential  4>  has  almost  all  of  its  spatial  variation  within  the  depletion 
layer.  There,  as  we  shall  see,  the  energy  e(j)  changes  by  about  Eg,  over  a  distance  that 
is  typically  a  few  hundred  angstroms  or  more  (so  that  the  field  in  the  depletion  layer 
can  be  as  large  as  106  volts  per  meter).  Although  this  satisfies  the  minimum  necessary 
condition  for  the  validity  of  the  semiclassical  model  (the  change  in  etj)  over  a  lattice 
constant  is  no  more  than  a  fraction  of  a  percent  of  Eg),  the  variation  is  strong  enough 
that  one  cannot  exclude  the  possibility  that  the  semiclassical  description  may  break 
down  in  the  depletion  layer.  Thus  one  should  bear  in  mind  the  possibility  that  the 
field  in  the  depletion  layer  may  be  strong  enough  to  induce  tunneling  of  electrons 
from  valence  band  to  conduction  band  levels,  leading  to  a  conductivity  considerably 
in  excess  of  the  semiclassical  prediction. 

Having  issued  this  warning,  however,  we  shall  follow  the  general  practice  of 
assuming  the  validity  of  the  semiclassical  description,  so  that  we  may  explore  its 
consequences.  Before  describing  the  semiclassical  theory  of  the  currents  that  flow  in 
a  p-n  junction  in  the  presence  of  an  applied  potential,  we  first  examine  the  case  of  the 
p-n  junction  in  thermal  equilibrium,  in  the  absence  of  applied  potentials  and  current 
flow. 

THE  p-n  JUNCTION  IN  EQUILIBRIUM 

We  wish  to  determine  the  carrier  densities  and  the  electrostatic  potential  </>(x)  induced 
by  the  nonuniform  doping.  We  assume  that  nondegenerate  conditions  hold  through¬ 
out  the  material,  so  that  the  carrier  densities  at  each  position  x  have  the  “Maxwellian” 
forms  analogous  to  the  densities  (28.12)  we  found  in  the  uniform  case.  In  the  non- 
uniform  case,  to  derive  the  carrier  densities  at  position  x  along  the  junction  in  the 
presence  of  a  potential  0(x),  the  semiclassical  procedure  is  simply  to  repeat  the 
analysis  for  the  uniform  case,  but  using  the  semiclassical  one-electron  energy  (29.2), 
in  which  each  level  is  shifted  by  -  ecj)(x).  Using  the  forms  (28.3)  of  the  £(k)  appropriate 
to  levels  near  the  conduction  band  minimum  or  valence  band  maximum,  we  see  that 
the  effect  of  this  is  simply  to  shift  the  constants  6C  and  by  -  etp(x ).  Thus  Eq.  (28.12) 
for  the  equilibrium  carrier  densities  is  generalized  to 

.  n,(x)  =  NC{T)  exp  j  -  [gt  ~  ^  j, 

pJLx)  =  PAT )  exp  |  -  [/i  ~  •  (29-3) 

The  potential  <j>(x)  must  be  determined  self-consistently,  as  that  potential  arising 
(via  Poisson’s  equation)  when  the  carrier  densities  have  the  forms  (29.3).  We  examine 


2  A  crude  argument  appropriate  to  metals  is  given  in  Appendix  J.  Analogous  arguments  (of  com¬ 
parable  crudity)  can  be  developed  for  semiconductors. 
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this  problem  in  the  special  case  (again,  the  case  of  major  practical  interest)  in  which 
far  from  the  transition  region  on  either  side  extrinsic  conditions  prevail,  in  which 
the  impurities  are  fully  “ionized”  (see  pages  583  -584).  Thus  far  away  on  the  n-side 
the  density  of  conduction  band  electrons  is  very  nearly  equal  to  the  density  N d  of 
donors,  while  far  away  on  the  p-side  the  density  of  valence  band  holes  is  very  nearly 
equal  to  the  density  Na  of  acceptors: 


Nd  =  nc( co)  =  NC(T)  exp  j-  ^ - —  j, 

=  PJ-oo)  =  PJL Dexp  j- 


(29.4) 


Since  the  entire  crystal  is  in  thermal  equilibrium,  the  chemical  potential  does  not 
vary  with  position.  In  particular,  the  same  value  of  p  appears  in  either  of  Eqs.  (29.4); 
this  immediately  requires  that  the  total  potential  drop  across  the  junction  be  given  by3 


e4>(co)  —  e<p(  —  cc)  =  Sc  —  £„  +  kBT\n 


NdN„ 


(29.5) 


e  h<j)  =  E  +  fcsTln 


'NA 

NCPV 


(29.6) 


An  alternative  way  of  representing  the  information  in  (29.3)  and  (29.6)  is  sometimes 
helpful.  If  we  define  a  position-dependent  “electrochemical  potential”  pe(x)  by 


pe(x)  =  n  +  ecp(x),  (29.7) 

then  we  may  write  the  carrier  densities  (29.3)  as 

«c(x)  =  NC(T)  exp  |  -  X^j, 

Pv(x)  =  P,(T)  exp  j-  SJ|-  (29.8) 

These  have  precisely  the  form  of  the  relations  (28.12)  for  a  homogeneous  semi¬ 
conductor,  except  that  the  constant  chemical  potential  p  is  replaced  by  the  electro¬ 
chemical  potential  pe(x).  Thus  pe(co)  is  the  chemical  potential  of  a  homogeneous 
n-type  crystal  whose  properties  are  identical  to  those  of  the  inhomogeneous  crystal 
far  on  the  n-side  of  the  transition  region,  while  p,.(  —  oc)  is  the  chemical  potential  of 
a  homogeneous  p-type  crystal  identical  to  the  inhomogeneous  crystal  far  on  the 
p-side.  The  relation  (29.6)  can  equally  well  be  written  as4 


e  H  =  P„(cc)  -  /4(-«T 


(29.9) 


3  The  derivation  of  (29.5)  requires  the  validity  of  (29.3)  only  far  from  the  depletion  layer,  where  (j)  is 
indeed  slowly  varying.  It  therefore  holds  even  when  the  semiclassical  model  fails  in  the  transition  region. 

4  This  follows  directly  from  (29.7).  Equation  (29.9)  is  sometimes  summarized  in  the  rule  that  the  total 
potential  drop  is  such  as  to  bring  the  “Fermi  levels  at  the  two  ends  of  the  junction”  into  coincidence.  This 
point  of  view  is  evidently  inspired  by  the  representation  of  Figure  29.2b. 
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Figure  29.2a  shows  the  electrochemical  potential  plotted  as  a  function  of  position 
along  the  p-n  junction.  We  have  assumed  (as  will  be  demonstrated  below)  that  <j> 
varies  monotonically  from  one  end  to  the  other.  Figure  29.2b  shows  an  alternative 
representation  of  the  same  information  in  which  the  potential  t j>  giving  the  position 
dependence  in  (29.3)  is  regarded  as  shifting  Sc  (or  S„)  rather  than  p.  In  either  case, 
the  significance  of  the  diagrams  is  that  at  any  particular  position  x  along  the  junction, 
the  carrier  densities  are  those  that  would  be  found  in  a  piece  of  homogeneous  material 
with  the  impurity  concentrations  prevailing  at  x,  and  with  a  chemical  potential  that 
is  positioned  with  respect  to  the  band  edges  as  shown  in  the  vertical  section  of  the 
diagrams  at  x. 


£„(*)" 


-£< 


-£« 

.  ■ 


(a) 


Iigure  29.2 

Two  equivalent  ways  of  repre¬ 
senting  the  effect  of  the  internal 
potential  <fi(x)  on  the  electron  and 
hole  densities  of  a  p-n  junction. 

(a)  The  electrochemical  potential 
pAx)  =  H  +  e<l>(x)  is  plotted  along 
the  p-n  junction.  The  carrier  den- 

VeM  sities  at  any  point  x  are  those  that 
would  be  found  in  a  uniform 
semiconductor  characterized  by 
the  fixed  band  and  impurity 
energies  Ec,  E,„  Ed,  and  £„,  at  a 
chemical  potential  equal  to  pe(x). 

(b)  Here  £c(xj  =  Ec  —  e<j>(x)  is  the 
energy  of  an  electron  wave  packet 
localized  about  x  formed  from 
levels  very  near  the  conduction 
band  minimum,  and  similarly  for 
£,(x).  The  energies  of  the  local 
impurity  levels  are  £d(x)  =  £d  — 
e<S>(x)  and  £„(x)  =  £„  -  e<j>(x).  “the 
(constant)  chemical  potential  is 
also  shown.  The  carrier  densities 
at  any  point  x  are  those  that 
would  be  found  in  a  uniform 
semiconductor  characterized  by 
band  and  impurity  energies  equal 
to  £c(x),  £,)(xj,  £„(.*),  and  £,.(x)  at 
the  fixed  chemical  potential  p. 

Equation  (29.6)  (or  its  equivalent  form,  (29.9))  serves  as  the  boundary  condition 
in  a  differential  equation  determining  the  potential  <j>(x).  The  differential  equation 
is  simply  Poisson’s  equation,5 


-£c(x) 

-p 


•>£„(*) 


(b) 


r-V  _ 


-v2<}>  =  - 


d2(j>  _  4np(x) 
dx2  e  ’ 


(29.10) 


5  Here  €  is  the  static  dielectric  constant  of  the  semiconductor.  The  use  of  the  macroscopic  equation 
is  possible  because  4  varies  over  the  depletion  layer,  which  is  large  on  the  interatomic  scale. 


i 


The  p-n  Junction  in  Equilibrium  595 


relating  the  potential  4>(x)  to  the  charge  distribution  p(x)  giving  rise  to  it.  To  express 
p(x)  interms  of<p  and  get  a  closed  equation,  we  first  note  that  if  (as  we  have  assumed) 
the  impurities  are  fully  ionized  far  from  the  junction,  then  they  will  remain  fully 
ionized6  at  all  x.  Consequently  the  charge  density  due  to  the  impurities  and  the 

carriers  is7 

p(x)  =  e\Na{x)  -  Na(x)  -  nc(x)  +  p,(x)].  (29.11) 

When  the  carrier  and  impurity  densities  (29.3)  and  (29.1)  are  substituted  into  the 
form  (29. 1 1)  for  the  charge  density,  and  the  result  is  substituted  into  Poisson  s  equation 
(29.10),  one  finds  a  nonlinear  differential  equation  for  4>(x)  whose  exact  solution 
usually  requires  numerical  techniques.8  However,  a  quite  reasonable  description  of 
cj}(x)  may  be  had  by  exploiting  the  fact  that  the  total  change  in  e<j>  is  of  order  Eg  »  kBT. 
The  relevance  of  this  fact  emerges  when  we  combine  (29.3)  and  (29.4)  to  write 


nc(x)  =  Nde~cl4,{x)  ^x)]/ktT, 

=  Nae~e[' Hx)~  .  (29.12) 

Suppose  that  the  change  in  <j>  occurs  within  a  region  -  dp  ^  x  d„.  Outside  of  this 
region  t}>  has  its  asymptotic  value,  and  therefore  nc  =  Nd  on  the  n  side,  pv  =  Na  on 
the  p  side,  and  p  =  0.  Within  the  region,  except  quite  near  the  boundaries,  e<t>  differs 
by  many  kBT  from  its  asymptotic  value,  so  nc  «  Nd,  pv  «  Na-  Thus,  except  in  the 
vicinity  of  x  =  -d„  and  x  =  d„,  the  charge  density  (29.1 1)  between  -dp  and  d„  is 
quite  accurately  given  by  p(x)  =  e[ N&)  7  N0(x)],  there  being  no  appreciable  carrier 
charge  to  cancel  the  charges  of  the  “ionized”  impurities.  The  points  x  —  dp  and 
x  =  d„  therefore  mark  the  boundaries  of  the  depletion  layer. 

Combining  these  observations,  and  using  the  form  (29.1)  for  the  impurity  densities, 
we  find  that  except  for  x  just  greater  than  -  dp  or  just  less  than  d„,  Poisson  s  equation 
is  well  approximated  by 


<£"(*) 


'  0, 

—  4neNd 
e 

4neNa 

e 

0, 


x  >  dfl , 
d„  >  x  >  0, 

0  >  x  >  -dp, 
-  dp  >  x. 


(29.13) 


6  If  4,  is  monotonic  (as  we  shall  find  below)  this  follows  from  the  fact  that  the  degree  of  ionization  of 
an  impurity  increases,  the  farther  the  chemical  potential  is  from  the  impurity  level.  See  Figure  29.2  and 

Eqs.  (28.32)  and  (28.34).  .  ,,  .  , 

7  The  density  of  holes  on  the  far  n-side  has  the  very  small  value  p,(oo)  -  n,  /Nd  required  by  the  law 
of  mass  action.  However,  the  density  of  electrons  on  the  far  n-side  actually  exceeds  Nd  by  this  same  sma 
amount  so  as  to  insure  that  Moo)  -  p„(ao)  =  N,  In  computing  the  total  charge  density,  if  we  ignore  this 
small  correction  in  nc  (as  we  have  done  in  writing  (29.4) ),  then  we  should  also  ignore  the  small  compensa  ting 
density  of  holes  on  the  far  n-side.  Similar  remarks  apply  to  the  small  concentration  of  electrons  on  the  far 
p-side.  These  ‘'minority  carrier  densities”  have  negligible  effect  on  the  total  balance  of  charge.  V>  e  shall  see 
below,  however,  that  they  play  an  important  role  in  determining  the  flow  of  currents  in  the  presence  of  an 
applied  potential. 

8  Some  aspects  of  that  equation  are  investigated  in  Problem  1. 
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This  immediately  integrates  to  give 


<Hco), 

x  >  dn, 

( 2neN\  ,  , 

<5(co)  (  I  (x  d„)2. 

d„  >  x  >  0, 

\  e  J 

( 2neN \  ,  , 

4>(  co)  +  (  )  (x  +  dp)2. 

0  >  x  >  —  dt 

\  e  ) 

■©- 

/*“S 

1 

8, 

x  <  -dp. 

(29.14) 


The  boundary  conditions  (continuity  of  4>  and  its  first  derivative)  are  explicitly  obeyed 
by  the  solution  (29.14)  at  x  =  —dp  and  x  =  dn.  Requiring  them  to  hold  at  x  =  0 
gives  two  additional  equations  that  determine  the  lengths  d„  and  dp.  Continuity  of 
tp'  at  x  =  0  implies  that 

Nddn  =  Nadp,  (29.15) 


which  is  just  the  condition  that  the  excess  of  positive  charge  on  the  n-side  of  the 
junction  be  equal  to  the  excess  of  negative  charge  on  the  p-side.  Continuity  of  cj>  at 
x  =  0  requires  that 


( Nddn 2  +  N„dp2)  =  <j)(c o)  -  <£(-c o)  -  A<£. 


(29.16) 


Together  with  (29.15)  this  determines  the  lengths  d„  and  dp: 

I  =  fw.W  ^Vn 

\(Nd  +  Naj  2ne  j 


(29.17) 


To  estimate  the  sizes  of  these  lengths  we  may  write  Eq.  (29.17)  in  the  numerically 
more  convenient  form 


105 


l NjNd)±l 
10  -l\Nd  +  Na) 


y/2 

[ee  A0]eVj  A. 


(29.18) 


The  quantity  ee  A <p  is  typically  of  order  1  eV,  and  since  typical  impurity  concentra¬ 
tions  are  in  the  range  from  1014  to  1018  per  cubic  centimeter,  the  lengths  d„  and  dp, 
which  give  the  extent  of  the  depletion  layer,  will  generally  be  from  104  to  102  A.  The 
field  within  the  depletion  layer  is  of  order  A 4>/(dn  +  dp),  and  for  rf’s  of  this  size  is 
therefore  in  the  range  from  105  to  107  volts  per  meter,  for  an  energy  gap  of  0.1  eV. 

The  resulting  picture  of  the  depletion  layer  is  shown  in  Figure  29.3.  The  potential 
<fi  varies  monotonically  through  the  layer,  as  asserted  above.  Except  at  the  boundaries 
of  the  layer,  the  carrier  concentrations  are  negligible  compared  with  the  impurity 
concentrations,  so  the  charge  density  is  that  of  the  ionized  impurities.  Outside  of  the 
depletion  layer  the  carrier  concentrations  balance  the  impurity  concentrations,  and 
the  charge  density  is  zero. 

The  mechanism  establishing  such  a  region  of  sharply  reduced  carrier  densities  is 
relatively  simple.  Suppose  that  one  initially  were  able  to  impose  carrier  concentrations 
that  gave  charge  neutrality  at  every  point  in  the  crystal.  Such  a  configuration  could 
not  be  maintained,  for  electrons  would  begin  to  diffuse  from  the  n-side  (where  their 
concentration  was  high)  to  the  p-side  (where  their  concentration  was  very  low),  and 
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Figure  29.3 

(a)  Carrier  densities,  (b)  charge  den¬ 
sity,  and  (c)  potential  4>{x)  plotted 
vs.  position  across  an  abrupt  p-n 
junction.  In  the  analysis  in  the  text 
the  approximation  was  made  that 
the  carrier  densities  and  charge  den¬ 
sity  are  constants  except  for  discon¬ 
tinuous  changes  at  x  =  —  dp  and 
x  =  d„.  More  precisely  (see  Problem 
1),  these  quantities  undergo  rapid 
change  over  regions  just  within  the 
depletion  layer  whose  extent  is  a 
fraction  of  order  ( kBT/Ee )112  of  the 
total  extent  of  the  depletion  layer. 
The  extent  of  the  depletion  layer  is 
typically  from  102  to  104  A. 


holes  would  diffuse  in  the  opposite  direction.  As  this  diffusion  continued,  the  result¬ 
ing  transfer  of  charge  would  build  up  an  electric  field  opposing  further  diffusive 
currents,  until  an  equilibrium  configuration  was  reached  in  which  the  effect  of  the 
field  on  the  currents  precisely  canceled  the  effect  of  diffusion.  Because  the  carriers 
are  highly  mobile,  in  this  equilibrium  configuration  the  carrier  densities  are  very  low 
wherever  the  field  has  an  appreciable  value.  This  is  precisely  the  state  of  affairs 
depicted  in  Figure  29.3. 

ELEMENTARY'  PICTURE  OF  RECTIFICATION  BY  A  p-n  JUNCTION 

We  now  consider  the  behavior  of  a  p-n  junction  when  an  external  voltage  V  is  applied. 
We  shall  take  V  to  be  positive  if  its  application  raises  the  potential  of  the  p-side  with 
respect  to  the  n-side.  When  V  =  0  we  found  above  that  there  is  a  depletion  layer 
some  102  to  104  A  in  extent  about  the  transition  point  where  the  doping  changes 
from  p-type  to  n-type,  in  which  the  density  of  carriers  is  reduced  greatly  below  its 
value  in  the  homogeneous  regions  farther  away.  Because  of  its  greatly  reduced  carrier 
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density,  the  depletion  layer  will  have  a  much  higher  electrical  resistance  than  the 
homogeneous  regions,  and  the  whole  device  can  therefore  be  viewed  as  a  series  circuit 
in  which  a  relatively  high  resistance  is  sandwiched  between  two  relatively  low  resis¬ 
tances.  When  a  potential  V  is  applied  across  such  a  circuit,  almost  all  of  the  potential 
drop  will  occur  across  the  region  of  high  resistance.  Thus  even  in  the  presence  of  an 
applied  potential  V,  we  expect  that  the  potential  <j>(x)  along  the  device  will  vary 
appreciably  only  within  the  depletion  layer.  When  V  =  0  we  found  that  cf>(x)  rose 
from  the  p-side  of  the  depletion  layer  to  the  n-side  by  the  amount  (which  we  now 
denote  by  (A (j>)0)  given  by  Eq.  (29.6),  so  we  conclude  that  when  V  ^  0  the  change  in 
potential  across  the  depletion  layer  is  modified  to 

M>  =  (A<Mo  -  V.  (29.19) 

Associated  with  this  change  in  potential  drop  across  the  depletion  layer,  there  is  a 
change  in  the  size  of  the  layer.  The  lengths  d„  and  dp  giving  the  extent  of  the  layer  on 
the  n-  and  p-sides  of  the  junction  are  determined  by  Eqs.  (29.15)  and  (29.16),  which 
use  only  the  value  of  the  total  potential  drop  across  the  layer,  and  the  assumption 
that  the  carrier  densities  are  greatly  reduced  throughout  almost  all  of  the  layer.  We 
shall  find  below  that  this  assumption  remains  valid  when  V  #  0,  and  therefore  d„ 
and  dp  continue  to  be  given  by  Eq.  (29.17)  provided  that  we  take  the  value  of  to  be 
(A0)o  -  V.  Since  d„  and  dp  vary  as  (A <j))tl2  according  to  Eq.  (29.17),  we  conclude  that 
when  V  #  0, 

T  V  T/2 

<UO-<U0)[l-^J  (29.20) 

This  behavior  of  4>  and  the  extent  of  the  depletion  layer  are  illustrated  in  Figure  29.4. 

To  deduce  the  dependence  on  V  of  the  current  that  flows  when  a  p-n  junction  is 
“biased”  by  the  application  of  an  external  voltage,  we  must  consider  separately  the 
currents  of  electrons  and  holes.  Throughout  the  discussion  that  follows  we  shall  use 
the  symbol  J  for  number  current  densities  and  j  for  electrical  current  densities,  so  that 

ic  =  ~eJe,  jh  =  eJh.  (29.21) 

When  V  =  0,  both  Je  and  Jh  vanish.  This  does  not,  of  course,  mean  that  no  individual 
carriers  flow  across  the  junction,  but  only  that  as  many  electrons  (or  holes)  flow  in 
one  direction  as  in  the  other.  When  V  ¥=  0,  this  balance  is  disrupted.  Consider,  for 
example,  the  current  of  holes  across  the  depletion  layer.  This  has  two  components: 

E  A  hole  current  flows  from  the  n-  to  the  p-side  of  the  junction,  known  as  the  hole 
generation  current.  As  the  name  indicates,  this  current  arises  from  holes  that  are 
generated  just  on  the  n-side  of  the  depletion  layer  by  the  thermal  excitation  of 
electrons  out  of  valence  band  levels.  Although  the  density  of  such  holes  on  the 
n-side  (“minority  carriers”)  is  minute  compared  with  the  density  of  electrons 
(“majority  carriers”),  they  play  an  important  role  in  carrying  current  across  the 
junction.  This  is  because  any  such  hole  that  wanders  into  the  depletion  layer  is 
immediately  swept  over  to  the  p-side  of  the  junction  by  the  strong  electric  field 
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Figure  29.4 

The  charge  density  p  and  potential  $  in  the  depletion 
layer  (a)  for  the  unbiased  junction,  (b)  for  the  junction 
with  V  >  0  (forward  bias),  and  (c)  for  the  junction 
with  V  <  0  (reverse  bias).  The  positions  x  =  d„  and 
x  =  —  dp  that  mark  the  boundaries  of  the  depletion 
layer  when  V  =  0  are  given  by  the  dashed  lines.  The 
depletion  layer  and  change  in  <j>  are  reduced  by  a 
forward  bias  and  increased  by  a  reverse  bias. 


that  prevails  within  the  layer.  The  resulting  generation  current  is  insensitive  to 
the  size  of  the  potential  drop  across  the  depletion  layer,  since  any  hole,  having 
entered  the  layer  from  the  n-side,  will  be  swept  through  to  the  p-side.9 

2.  A  hole  current  flows  from  the  p-  to  the  n-side  of  the  junction,  known  as  the  hole 
recombination  current.10  The  electric  field  in  the  depletion  layer  acts  to  oppose 
such  a  current,  and  only  holes  that  arrive  at  the  edge  of  the  depletion  layer  with 
a  thermal  energy  sufficient  to  surmount  the  potential  barrier  will  contribute  to 


9  The  density  of  holes  giving  rise  to  the  hole  generation  current  will  also  be  insensitive  to  the  size  of 
V,  provided  that  eV  is  small  compared  with  Ee,  for  this  density  is  entirely  determined  by  the  law  of  mass 
action  and  the  density  of  electrons.  The  latter  density  differs  only  slightly  from  the  value  Nc  outside  of  the 
depletion  layer  when  eV  is  small  compared  with  Ee,  as  will  emerge  from  the  more  detailed  analysis  below. 

10  So  named  because  of  the  fate  suffered  by  such  holes  upon  arriving  on  the  n-side  of  the  junction, 
where  one  of  the  abundant  electrons  will  eventually  drop  into  the  empty  level  that  constitutes  the  hole. 
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the  recombination  current.  The  number  of  such  holes  is  proportional  to  e  eA,f'ikBT j 
and  therefore11 

J'“  oc  (29.22) 

In  contrast  to  the  generation  current,  the  recombination  current  is  highly  sensitive 
to  the  applied  voltage  V.  We  can  compare  their  magnitudes  by  noting  that  when 
V  =  0  there  can  be  no  net  hole  current  across  the  junction: 

Jir  |„-o  =  JT-  (29.23) 

Taken  together  with  Eq.  (29.22),  this  requires  that 

Jhc  =  Jfle^W.  (29.24) 

The  total  current  of  holes  flowing  from  the  p-  to  the  n-side  of  the  junction  is  given  by 
the  recombination  current  minus  the  generation  current: 

Jh  =  JT  -  Jr  =  jp(ecVjkKT  -  1).  (29.25) 

The  same  analysis  applies  to  the  components  of  the  electron  current,  except  that 

the  generation  and  recombination  currents  of  electrons  flow  oppositely  to  the  corre¬ 
sponding  currents  of  holes.  Since,  however,  the  electrons  are  oppositely  charged,  the 
electrical  generation  and  recombination  currents  of  electrons  are  parallel  to  the 
electrical  generation  and  recombination  currents  of  holes.  The  total  electrical  current 
density  is  thus: 


j  =  e(Jr  +  -  1). 


(29.26) 


This  has  the  highly  asymmetric  form  characteristic  of  rectifiers,  as  shown  in  Figure 
29.5. 


Figure  29.5 

Current  vs.  applied  voltage  V  for  a 
p-n  junction.  The  relation  is  valid 
for  eV  small  compared  with  the 
energy  gap,  Eg.  The  saturation 
current  (eJf“  +  eJf")  varies  with 
temperature  as  e_£9*Br,  as  estab¬ 
lished  below. 


/(F)  =  e(J^n  +  //'")  (e‘y/>nT  - 1) 

1 

H  p  1  "  h 

/  H  p  1  n  h 

Tl'l 

Reverse  bias 

/  Irl1 

/  Forward  bias 

/  V 

-e(J£m  +//“) 

1 1  In  assuming  that  (29.22)  gives  the  dominant  dependence  of  the  hole  recombination  current  on  V, 
we  are  assuming  that  the  density  of  holes  just  on  the  p-side  of  the  depletion  layer  differs  only  slightly  from 
N0.  We  shall  find  that  this  is  also  the  case  provided  that  eV  is  small  compared  with  the  energy  gap  Eg. 
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GENERAL  PHYSICAL  ASPECTS  OF  THE  NONEQUILIBRIUM  CASE 

The  foregoing  discussion  provides  no  estimate  of  the  size  of  the  prefactor  e{  Jfn  +  JfI:) 
appearing  in  (29.26).  In  addition,  in  the  nonequilibrium  ( V  A  0)  case  the  local  carrier 
densities  will  not  in  general  be  determined  by  the  local  potential  <j>  through  the  simple 
equilibrium  Maxwellian  relations  (29.3).  In  the  nonequilibrium  case  it  requires  further 
analysis  to  construct  a  picture  of  the  carrier  densities  in  the  neighborhood  of  the 
transition  region  that  is  comparable  in  detail  to  the  picture  we  gave  for  the  equilibrium 
case. 

In  this  more  detailed  approach  it  is  not  especially  helpful  to  resolve  the  electron 
and  hole  currents  across  the  junction  into  generation  and  recombination  currents. 
Instead,  at  each  point  x  (both  inside  and  outside  the  depletion  layer)  we  shall  write 
equations  relating  the  total  electron  and  hole  currents,  Je(x)  and  Jh(x),  the  electron 
and  hole  densities,  nc(x)  and  pv(x),  and  the  potential  4>(x)  (or>  equivalently,  the  electric 
field,  E(x)  =  —  dtj>(x)/dx).  We  shall  find  five  such  equations,  which  will  enable  us,  in 
principle,  to  find  these  five  quantities.  This  method  is  a  direct  generalization  of  the 
approach  we  followed  in  our  analysis  of  the  equilibrium  (V  =  0)  case.  In  equilibrium 
the  electron  and  hole  currents  vanish,  there  are  only  three  unknowns,  and  the  three 
equations  we  used  were  Poisson’s  equation,  and  the  two  equations  (29.3)  that  relate 
»c(x)  and  pj.x)  to  c/>(x)  in  thermal  equilibrium.  Thus  the  nonequilibrium  problem  can 
be  viewed  as  that  of  finding  the  appropriate  equations  to  replace  the  equilibrium 
relation  (29.3),  when  V  A  0  and  currents  flow. 

We  first  observe  that  in  the  presence  of  both  an  electric  field  and  a  carrier  density 
gradient,  the  carrier  current  density  can  be  written  as  the  sum  of  a  term  proportional 
to  the  field  (the  drift  current )  and  a  term  proportional  to  the  density  gradient  (the 
diffusion  current ): 


Je  =  -IVVE  -  D«^> 

Jh  =  PpPJ-  -  dp^:- 


(29.27) 


The  positive12  proportionality  constants  /<„  and  appearing  in  Eq.  (29.27)  are 
known  as  the  electron  and  hole  mobilities.  We  have  introduced  the  mobilities,  rather 
than  writing  the  drift  current  in  terms  of  conductivities,  to  make  explicit  the  manner 
in  which  the  drift  current  depends  on  the  carrier  densities.  If  only  electrons  at  uniform 
density  are  present,  then  <r£  =  j  =  -eJe  =  ep„nE.  Using  the  Drude  form  a  = 
nehjm  for  the  conductivity  (Eq.  (1.6))  we  find  that 


t‘n  = 


<?T“" 

»>„  ’ 


(29.28) 


u  The  signs  in  Eq.  (29.27)  have  been  chosen  to  make  the  mobilities  positive;  the  hole  drift  current  is 
along  the  field,  and  the  electron  drift  current  is  opposite  to  the  field. 
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and,  similarly, 

excoK 

Pp  =  -S-,  (29.29) 

nip 

where  mn  and  mp  are  the  appropriate  effective  masses,  and  t“,!  and  t“"  are  the  carrier 
collision  times.13 

The  positive14  proportionality  constants  D„  and  Dp  appearing  in  Eq.  (29.27)  are 
known  as  the  electron  and  hole  diffusion  constants.  They  are  related  to  the  mobilities 
by  the  Einstein  relations :15 

(29.30) 

The  Einstein  relations  follow  directly  from  the  fact  that  the  electron  and  hole  currents 
must  vanish  in  thermal  equilibrium:  Only  if  the  mobilities  and  diffusion  constants 
are  related  by  (29.30)  will  the  currents  given  by  (29.27)  be  zero  when  the  carrier 
densities  have  the  equilibrium  form  (29.3)16  (as  is  easily  verified  by  direct  substitution 
of  (29.3)  into  (29.27)). 

The  relation  (29.27)  giving  the  currents  in  terms  of  the  density  gradients  and  field, 
together  with  the  forms  (29.28)-(29.30)  for  the  mobilities  and  diffusion  constants, 
can  also  be  derived  directly  from  the  kind  of  simple  kinetic  argument  used  in  Chapter  1 
(see  Problem  2). 

Note  that  in  thermal  equilibrium,  Eq.  (29.27)  and  the  conditions  Je  =  Jh  =  0 
contain  all  information  necessary  to  determine  the  carrier  densities,  for  when  the 
currents  vanish  we  may  integrate  Eq.  (29.27)  to  rederive  (with  the  aid  of  the  Einstein 
relations  (29.30) )  the  thermal  equilibrium  densities  (29.3).  When  V  #  0  and  currents 
flow,  we  require  a  further  equation,  which  can  be  viewed  as  the  generalization  to 
the  nonequilibrium  case  of  the  equilibrium  conditions  of  vanishing  currents.  If  the 
numbers  of  carriers  were  conserved,  the  required  generalization  would  simply  be 
the  equations  of  continuity, 

8nc  8Je 

dt  dx  ’ 


dpv  =  8Jh 
dt  dx  ’ 


(29.31) 


which  express  the  fact  that  the  change  in  the  number  of  carriers  in  a  region  is  entirely 
determined  by  the  rate  at  which  carriers  flow  into  and  out  of  the  region.  However, 
carrier  numbers  are  not  conserved.  A  conduction  band  electron  and  a  valence  band 


13  In  semiconductors  there  is  another  lifetime  of  fundamental  importance  (see  below),  the  recom¬ 
bination  time.  The  superscript  “coll”  has  been  affixed  to  the  collision  mean  free  times  to  distinguish  them 
from  the  recombination  times. 

14  They  are  positive  because  the  diffusion  current  flows  from  high-  to  low-density  regions.  In  zero 
field,  Eq.  (29.27)  is  sometimes  known  as  Fick’s  law. 

15  The  Einstein  relations  are  very  general,  arising  in  any  treatment  of  charged  particles  that  obey 
Maxwell-Boltzmann  statistics,  such  as  the  ions  in  an  electrolytic  solution. 

16  The  generalization  of  (29.30)  to  the  degenerate  case  is  described  in  Problem  3. 
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hole  can  be  generated  by  the  thermal  excitation  of  an  electron  out  of  a  valence  band 
level.  Furthermore  a  conduction  band  electron  and  a  valence  band  hole  can  recombine 
(i.e.,  the  electron  can  drop  into  the  empty  level  that  is  the  hole),  resulting  in  the 
disappearance  of  one  carrier  of  each  type.  Terms  must  be  added  to  the  continuity 
equations  describing  these  other  ways  in  which  the  number  of  carriers  in  a  region 
can  change: 


(29.32) 


To  determine  the  forms  of  ( dnjdt)g.r  and  ( dpv/dt)g.r  we  note  that  generation  and 
recombination  act  to  restore  thermal  equilibrium  when  the  carrier  densities  deviate 
from  their  equilibrium  values.  In  regions  where  nc  and  p„  exceed  their  equilibrium 
values,  recombination  occurs  faster  than  generation,  leading  to  a  decrease  in  the 
carrier  densities,  while  in  regions  where  they  fall  short  of  their  equilibrium  values, 
generation  occurs  faster  than  recombination,  leading  to  an  increase  in  the  carrier 
densities.  In  the  simplest  models  these  processes  are  described  by  electron  and  hole 
lifetimes,17  i„  and  zp.  The  rate  at  which  each  carrier  density  changes  due  to  recom¬ 
bination  and  generation  is  set  proportional  to  its  deviation  from  the  form  determined 
by  the  other  carrier  density  and  the  law  of  mass  action  (28.24): 


(29.33) 


where  n°  =  nf/pv,  and  p°  =  nf/nc. 

To  interpret  these  equations,  note  that  the  first,  for  example,  expresses  the  change 
in  electron  carrier  density  due  to  generation  and  recombination  in  an  infinitesimal 
time  dt  as 


nc(t  +  dt)  = 


(29.34) 


The  first  term  on  the  right  of  Eq.  (29.34)  expresses  the  destruction,  through  recom¬ 
bination,  of  a  fraction  dt/T„  of  the  electron  carriers;  i.e.,  t„  is  the  average  electronic 
lifetime  before  recombination  occurs.  The  second  term  on  the  right  expresses  the 
creation  through  thermal  generation  of  n0/ r„  electron  carriers  per  unit  volume,  per 
unit  time.  Note  that,  as  required,  Eqs.  (29.33)  give  carrier  densities  that  decrease 
when  they  exceed  their  equilibrium  values,  increase  when  they  are  less  than  their 
equilibrium  values,  and  do  not  change  when  they  are  equal  to  their  equilibrium 
values. 

The  lifetimes  t„  and  zp  are  generally  much  longer  than  overall  electron  or  hole 
collision  times,  r“u  and  t'0",  for  the  recombination  (or  generation)  of  an  electron 


17  Also  known  as  “recombination  times.”  Conservation  of  total  electric  charge  requires  that  the 
recombination  rates  be  proportional  to  the  densities  of  the  other  carrier  type:  (l/r„)/(l/rp)  =  p„/nc. 
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and  a  hole  is  an  interband  transition  (electron  goes  from  valence  to  conduction 
band  (generation)  or  conduction  band  to  valence  band  (recombination)).  Ordinary 
collisions,  which  conserve  the  number  of  carriers,  are  intraband  transitions.  Reflecting 
this,  typical  lifetimes  range  between  1(T3  and  1CT8  second,  while  the  collision  times 
are  similar  to  those  found  in  metals — i.e.,  1CT 12  or  10“ 13  second. 

In  the  presence  of  a  static  external  potential  the  p-n  junction,  though  not  in  thermal 
equilibrium,  is  in  a  steady  state;  i.e.,  the  carrier  densities  will  be  constant  in  time: 
dnjdt  =  dpjdt  =  0.  Using  this  fact  and  the  forms  (29.33)  for  the  rates  at  which 
recombination  and  generation  change  the  carrier  densities,  we  find  that  the  continuity 
equation  (29.32)  requires 


dJe  >h  ~  n°c 
dx  t„ 

j  r  O 

U.J  h  (  Pv  Pv 


(29.35) 


These  are  the  equations  that  replace  the  equilibrium  conditions  Je  =  Jh  =  0,  when 
V*  0. 

One  very  important  application  of  Eqs.  (29.35)  and  (29.27)  is  in  regions  where 
th^  electric  field  E  is  negligibly  small  and  the  majority  carrier  density  is  constant. 
In  that  case  the  minority  carrier  drift  current  can  be  ignored  compared  with  the 
minority  carrier  diffusion  current,  and  Eqs.  (29.27)  and  (29.35)  reduce  to  a  single 
equation  for  the  minority  carrier  density  with  a  constant  recombination  time: 


,2  O 

n  ~  »c 

"  dx 2  t„  ’ 

t^Pv  _  Pv  -  Pv 

p  dx2  Tp  ’ 


(£*  0) 


(29.36) 


The  solutions  to  these  equations  vary  exponentially  in  x/L,  where  the  lengths 


L„  =  (Dnr„)112,  Lp  =  (Dpzp) 


1/2 


(29.37) 


are  known  as  the  electron  and  hole  diffusion  lengths.  Suppose,  for  example,  (to  take 
a  case  that  will  be  of  some  importance  below)  that  we  are  in  the  region  of  uniform 
potential  on  the  n-side  of  the  depletion  layer,  so  that  the  equilibrium  density  p°  has 
the  constant  value  p„(oo)  =  nf/N,,.  If  the  density  of  holes  is  constrained  to  have  the 
value  pv(x0)  +  p„(oo)  at  a  point  x,„  then  the  solution  to  Eq.  (29.36)  for  x  Ss  x0  is 

pjx)  =  p.(c»)  +  [p„(x0)  -  pjtoo)>-«-* oVL’.  (29.38) 


Thus  the  diffusion  length  is  a  measure  of  the  distance  it  takes  for  the  density  to  relax 
back  to  its  equilibrium  value. 

One  would  expect  that  the  distance  L  over  which  a  deviation  from  equilibrium 
density  can  be  maintained  would  be  roughly  the  distance  a  carrier  can  travel  before 
undergoing  recombination.  This  is  not  immediately  obvious  from  the  forms  (29.37) 
for  the  diffusion  lengths  L„  and  Lp,  but  it  is  revealed  when  one  rewrites  (29.37)  using 
(a)  the  Einstein  relations  (29.30)  between  the  diffusion  constant  and  the  mobility. 


Theory  of  the  Nonequilibrium  p-n  Junction  605 


(b)  the  Drude  form  (29.28),  or  (29.29),  for  the  mobility,  (c)  the  relation  ?mvth2  =  ^kBT 
between  the  mean  square  carrier  velocity  and  the  temperature  under  nondegenerate 
conditions,  and  (d)  the  definition  l  =  L’thi“"  of  the  carrier  mean  free  path  between 
collisions.  Making  these  substitutions  one  finds: 

(29.39) 

Assuming  that  the  direction  of  a  carrier  is  random  after  each  collision,  a  series  of 
N  collisions  can  be  viewed  as  a  random  walk  of  step  length  L  It  is  easily  shown18 
that  in  such  a  walk  the  total  displacement  is  Nll2(.  Since  the  number  of  collisions  a 
carrier  can  undergo  in  a  recombination  time  is  the  ratio  of  the  recombination  time 
to  the  collision  time,  Eq.  (29.39)  does  indeed  show  that  the  diffusion  length  measures 
the  distance  a  carrier  can  go  before  undergoing  recombination. 

Using  the  typical  values  given  on  page  604  for  the  collision  time  and  the  (very 
much  longer)  recombination  time,  we  find  that  (29.39)  gives  a  diffusion  length  that 
can  be  between  102  and  105  mean  free  paths. 

We  can  estimate  the  size  of  the  generation  currents  that  appear  in  the  I-V  relation 
(29.26),  in  terms  of  the  diffusion  lengths  and  carrier  lifetimes.  We  first  note  that,  by 
definition  of  the  lifetime,  holes  are  created  by  thermal  generation  at  a  rate  p°/ ip  per 
unit  volume.  Such  a  hole  stands  an  appreciable  chance  of  entering  the  depletion  layer 
(and  then  being  swiftly  swept  across  to  the  n-side)  before  undergoing  recombination, 
provided  that  it  is  created  within  a  diffusion  length  Lp  of  the  boundary  of  the  depletion 
layer.  Therefore  the  flow  of  thermally  generated  holes  per  unit  area  into  the  depletion 
layer  per  second  will  be  of  order  Lpp°/xp.  Since  p°  —  n2/Nd,  we  have 

and,  similarly. 


The  sum  of  the  currents  appearing  in  (29.40)  and  (29.41)  is  known  as  the  satura¬ 
tion  current,  since  it  is  the  maximum  current  that  can  flow  through  the  junction 
when  V  is  negative  (“reversed  bias”).  Because  the  temperature  dependence  of  n2  is 
dominated  by  the  factor  e~EelknT  (Eq.  (28.19)),  the  saturation  current  is  strongly 
temperature-dependent. 

A  MORE  DETAILED  THEORY  OF  THE  NONEQUILIBRIUM  p-n 
JUNCTION 

Using  the  concepts  of  drift  and  diffusion  currents  we  can  give  a  more  detailed 
description  of  the  behavior  of  the  p-n  junction  when  V  +  0.  The  equilibrium  p-n 


18  See  for  example,  F.  Reif,  Fundamentals  of  Statistical  and  Thermal  Physics,  McGraw-Hill,  New 
York,  1965,  p.  16. 
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junction  has  two  characteristic  regions:  the  depletion  layer,  in  which  the  electric 
field,  space  charge,  and  carrier  density  gradients  are  large,  and  the  homogeneous 
regions  outside  of  the  depletion  layer,  in  which  they  are  quite  small.  In  the  non¬ 
equilibrium  case  the  position  beyond  which  the  electric  field  and  space  charge  are 
small  differs  from  the  position  beyond  which  the  carrier  density  gradients  are  small. 
Thus  when  V  +  0,  the  p-n  junction  is  characterized  not  by  two,  but  by  three  different 
regions  (described  compactly  in  Table  29.1): 

1.  The  Depletion  Layer  As  in  the  equilibrium  case,  this  is  a  region  in  which  the 
electric  field,  space  charge,  and  carrier  density  gradients  are  all  large.  When  V  ^  0, 
according  to  Eq.  (29.20)  the  depletion  layer  is  narrower  than  or  wider  than 
is  the  case  for  V  =  0  depending  on  whether  V  is  positive  (forward  bias)  or 
negative  (reverse  bias). 

2.  The  Diffusion  Regions  These  are  regions  (extending  a  distance  of  the  order  of 
a  diffusion  length  out  from  the  boundaries  of  the  depiction  layer)  in  which  the 
electric  field  and  space  charge  are  small,  but  the  carrier  density  gradients  remain 
appreciable  (though  not  as  large  as  in  the  depletion  layer). 

3.  The  Homogeneous  Regions  Beyond  the  diffusion  regions  the  electric  field,  space 
charge,  and  carrier  density  gradients  are  all  very  small,  as  in  the  equilibrium 
homogeneous  regions. 

The  diffusion  region  (2)  is  not  present  in  the  equilibrium  case.  It  arises  when  V  #  0 
for  the  following  reason: 

In  equilibrium  (V  =  0)  the  change  in  carrier  densities  across  the  depletion  layer 
is  just  enough  to  join  the  homogeneous  equilibrium  values  on  the  high-density  sides 
(nc(oo)  =  />„(—  co)  =  Na),  to  the  homogeneous  equilibrium  values  on  the  low- 

density  sides19  (nc(  —  oo)  =  n;2//V„,  /;,.(  +  oo)  =  n;2/A?d).  When  V  0,  however,  we 
have  noted  that  the  extent  of  the  depletion  layer  and  the  size  of  the  potential  drop 
across  the  layer  differ  from  their  equilibrium  values.  Consequently  (as  we  shall  see 
explicitly  below)  the  change  in  carrier  densities  across  the  layer  can  no  longer  fit 
the  difference  in  the  homogeneous  equilibrium  values  appropriate  to  the  two  sides, 
and  a  further  region  must  appear  in  which  the  carrier  densities  relax  from  their  values 
at  the  boundaries  of  the  depletion  layer  to  the  values  appropriate  to  the  more  remote 
homogeneous  region  (Figure  29.6). 

Table  29.1  summarizes  these  properties,  and  also  indicates  the  characteristic 
behavior  of  the  electron  and  hole  drift  and  diffusion  currents  in  each  of  the  three 
regions,  when  a  current  j  flows  in  the  junction20: 

1.  In  the  depletion  layer  there  are  both  drift  and  diffusion  currents.  In  the  equilibrium 
case  they  are  equal  and  opposite  for  each  carrier  type,  yielding  no  net  currents  of 


19  In  discussing  the  equilibrium  case  above,  we  approximated  the  minority  carrier  densities  (i.e.,  the 
homogeneous  equilibrium  values  on  the  low-density  side)  by  zero  (cf.  note  7).  This  was  appropriate  because 
we  were  describing  only  the  space-charge  density,  to  which  the  minority  carriers  make  a  minute  contribu¬ 
tion  compared  with  the  majority  carriers.  The  contribution  of  the  minority  carriers  to  the  current,  however, 
is  not  negligible,  and  it  is  necessary  for  us  to  use  the  values  quoted  here  (determined  by  the  values 
of  the  majority  carrier  densities  and  the  law  of  mass  action). 

20  In  the  steady  state  the  total  electric  current  is  uniform  along  the  junction:  j  cannot  depend  on  x. 
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Figure  29.6 

The  density  of  holes  (heavy 
curve)  along  a  p-n  junction 
with  V  >  0  (forward  bias). 
The  vertical  solid  lines  give 
the  boundaries  of  the  deple¬ 
tion  layer  and  diffusion  re¬ 
gions.  Note  the  break  in 
vertical  scale.  For  compari¬ 
son,  the  density  of  holes  when 
V  —  0  (unbiased  junction)  is 
shown  as  a  light  curve, 
together  with  the  boundaries 
of  the  depletion  layer  in  the 
unbiased  case  (dashed  verti¬ 
cal  lines).  The  density  of 
electrons  behaves  in  a  similar 
manner.  When  V  is  negative 
(reverse  bias),  the  density  of 
holes  falls  below  its  asymp¬ 
totic  value  in  the  diffusion 
region.  Note  that  although 
the  excess  density  in  the 
biased  case  over  the  unbiased 
case  has  the  same  size  in  both 
diffusion  regions,  on  the  p- 
side  it  represents  a  minute 
percentage  change  in  the 
carrier  density,  while  on  the 
n-side  it  is  a  very  large  per¬ 
centage  change. 

electrons  or  holes.  In  the  nonequilibrium  case  the  net  current  flowing  across  the 
depletion  layer  results  from  a  slight  imbalance  between  the  drift  and  diffusion 
currents  of  each  carrier  type ;  i.e.,  the  drift  and  diffusion  currents  are  separately 
quite  large  compared  with  the  total  current.  Once  we  have  constructed  a  complete 
picture  of  the  currents  flowing  in  the  junction  it  is  easy  to  verify  this  explicitly 
(Problem  4).  It  is  a  consequence  of  the  very  large  electric  field  and  density  gradients 
in  the  diffusion  layer  (which  more  than  compensate  for  the  very  low  carrier 
densities). 

2.  In  the  diffusion  regions,  the  carrier  densities  are  nearer  to  the  values  they  have 
in  the  homogeneous  regions.  The  majority  carrier  density  has  become  so  large 
that  its  drift  current  is  appreciable,  even  though  the  field  is  now  very  small; 
the  minority  carrier  drift  current  is  quite  negligible  in  comparison.  Because  the 
carrier  densities  continue  to  vary  in  the  diffusion  regions,  both  diffusion  currents 
(proportional  not  to  the  density,  but  its  gradient)  are  appreciable.  Typically, 
all  currents  in  the  diffusion  region  except  the  negligible  minority  carrier  drift 
current  are  of  order  j. 

3.  In  the  homogeneous  regions  the  diffusion  currents  are  negligible,  and  the  entire 
current  is  carried  by  the  majority  carrier  drift  current. 


V>  0  (forward  bias) 


Theory  of  the  Nonequilibrium  p-n  Junction  609 

Given  this  picture  of  the  individual  carrier  drift  and  diffusion  currents  we  can 
readily  calculate  the  total  current  j  flowing  in  the  junction  for  a  given  value  of  V.  To 
simplify  the  analysis  we  make  one  further  assumption:21  We  assume  that  the  passage 
of  carriers  across  the  depletion  layer  is  so  swift  that  negligible  generation  and 
recombination  occur  within  the  layer.  If  this  is  so,  then  the  total  currents  of  electrons 
and  holes,  J  and  Jh,  will  be  constant  across  the  depletion  layer  in  the  steady  state 
Consequently  in  the  expression  j  =  -eJe  +  eJh  for  the  total  current,  we  may 
separately  evaluate  Je  and  J„  at  whatever  points  along  the  depletion  layer  it  is  most 
convenient  to  do  so.  The  most  convenient  point  for  the  electron  current  is  at  the 
boundary  between  the  depletion  layer  and  the  diffusion  region  on  the  p-side  and  the 
most  convenient  point  for  the  hole  current  is  at  the  other  boundary.22  We  therefore 

j  =  -  eJe(  ~  dp)  +  eJh(dn).  (29.42) 

This  representation  is  useful  because  at  the  boundaries  between  the  depletion 
layer  and  the  diffusion  regions  the  minority  carrier  currents  are  purely  diffusive 
(see  Table  29.1).  Consequently,  if  we  could  calculate  the  position  dependence  of  the 
minority  carrier  densities  within  the  diffusion  regions,  we  could  immediately  compute 
their  currents,  using  Eq.  (29.27)  (with  E  =  0): 

U~dP)  =  -D„dn'\ 


Jh(d„)  =  —D, 


dx 

dp, 

'  dx 


(29.43) 


Because,  however,  the  minority  carrier  drift  currents  are  negligible  in  the  diffusion 
regions,  the  minority  carrier  densities  satisfy  the  diffusion  equation  (29.36).  If  we 

jv"  bC  thC  <lenSUy  of  holes  at  the  boundary  of  the  depletion  layer  on  the  /i-side 
and  if  we  note  that  far  from  that  boundary  on  the  n-side  pv  approaches  the  value 
PiA°o)  -  nt  /Nd,  then  the  solution  (29.38)  to  the  diffusion  equation  (29.36)  is 


,-<x— d„)/L_ 


x  >  d„. 


(29.44) 


Similarly,  the  electron  density  within  the  diffusion  region  on  the  p-side  is  given  by 


,  ,  n,-2  f  «.2_1 

n  (x)  -  —  +  n  l-d  )  -  — 

N.  L  p>  jvj 


-  7T  e(x+dp>iLn ,  x  <  _  w 


(29.45) 


Substituting  these  densities  into  (29.43)  we  find  that  the  minority  carrier  currents 
at  the  boundaries  of  the  depletion  layer  are 


Je(~dp)  =  ~ 


(29.46) 
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so  that  the  total  current,  (29.42),  is 


eD„ 

L„ 


j  =  ~7~  I  nA~d„)  -  5- 


+ 


eD, 


[p,w  -  ^]. 


(29.47) 


It  only  remains  for  us  to  find  the  amounts  by  which  the  minority  carrier  densities 
differ  from  their  homogeneous  equilibrium  values  at  the  boundaries  of  the  depletion 
layer.  In  equilibrium  we  found  the  variation  in  carrier  densities  across  the  depletion 
layer  by  using  the  equilibrium  expression  (29.3)  for  the  variation  of  the  carrier  densities 
in  a  potential  <p(x).  We  have  noted  above  that  this  expression  follows  from  the  fact 
that  in  equilibrium  the  drift  currents  are  equal  and  opposite  to  the  diffusion  currents. 
In  the  general  nonequilibrium  case  (e.g.,  in  the  diffusion  region)  the  drift  and  diffusion 
currents  are  not  in  balance,  and  Eq.  (29.3)  does  not  hold.  However,  in  the  depletion 
layer  there  is  a  near  balance  between  drift  and  diffusion  currents,23  and  therefore 
to  a  reasonable  approximation  the  carrier  densities  do  obey  Eq.  (29.3),  changing  by 
a  factor  e-,jM'lki<T  as  the  depletion  layer  is  crossed: 

nc(-dp)  =  nc(d„)e-£a<w‘8r  =  [nctLd„)e-eM)°lkBryv"‘Br , 
pMn)  =  pv(-dp)e~^T  =  lpv(-dp)e-'M'°ikBryv*BT'  (29.48) 

When  eV  «  Eg,  then  V  will  be  small  compared  with  (A <p)0,  and  the  carrier  densities 
on  the  minority  side  [nc(  —  dp)  and  pv(d„)]  will  continue  to  be  very  small  compared 
with  their  values  of  the  majority  side  [nc(d„)  and  p„(  —  </„)],  just  as  they  are  when 
V  =  0.  Consequently,  the  conditions  that  the  space  charge  vanish  at  the  boundaries 
of  the  depletion  layer, 

^c(d  „)  /’[( d  n)  A  d, 

Pi~dp)  -  nc(-dp)  =  Na,  (29.49) 

give  values  for  the  majority  carrier  densities  nc(d„)  and  pv(  —  dp)  that  differ  from  their 
equilibrium  values  Nd  and  N„  only  by  factors  that  are  very  close  to  unity.  Thus,  to 
an  excellent  approximation,  when  eV  «  Eg, 

nc( — dp)  =  [(V_e(i*>o  ikBTyyH‘BTt 

Pv(d„)  =  [Nae^^B-ryvikBTt 


or,  equivalently. 


nc{-dp)  =  eey,kBr, 


Pu(d„ ) 


Na 
2 

2L  eey/kBr 


(29.50) 


(29.51) 


Substituting  these  results  into  the  expression  (29.47)  for  the  total  current,  we  find 
that 

+  (eeV'kBT  -  1).  (29.52) 


2  (  Dn 

J  = 


23  This  is  verified  in  Problem  4. 

24  This  follows  from  the  form  (29.6)  of  (A 4>)0  and  the  form  (28.19)  of  nh  It  also  follows  directly  from 
(29.50)  by  requiring  that  it  yield  the  correct  equilibrium  values  n£~ dp)  =  nf/N,,  and  pv(d„)  =  nt2/Nc, 
when  V  =  0. 
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This  has  the  form  (29.26)  with  the  generation  currents  given  explicitly  by 


(29.53) 


If  we  eliminate  the  diffusion  constants  appearing  in  (29.53)  with  the  aid  of  (29.37), 
the  expressions  for  the  generation  currents  agree  with  the  rough  estimates  (29.40) 
and  (29.41). 


PROBLEMS 


1.  The  Depletion  Layer  in  Thermal  Equilibrium 

(a)  Show  that  if  the  exact  (nondegenerate)  form  (29.3)  is  retained  for  the  carrier  densities, 
then  Poisson’s  equation  (which  we  approximated  by  Eq.  (29.13)  in  the  text)  becomes  the  following 
differential  equation  for  the  variable  ip  =  (e<t>  +  p  -  Pi)/kBT: 


dx 2  \ 


A  N(x)\ 
2n,  )’ 


(29.54) 


where  K2  —  Smi;e2/kBTe,  AN(x)  is  the  doping  profile,  A N(x)  =  Na(x)  -  NJx),  and  n,  and  /i, 
are  the  carrier  density  and  chemical  potential  for  an  impurity-free  sample  at  the  same  temperature. 

(b)  The  text  discussed  the  case  of  a  p-n  junction  made  of  highly  extrinsic  material,  with 
N„,  Na  »  n{.  In  the  opposite  case  of  a  lightly  doped  nearly  intrinsic  semiconductor,  with 

n,  »  A-',,,  N„  (29.55) 


we  can  find  the  electrostatic  potential  to  high  precision  for  an  arbitrary  doping  profile,  as  follows: 

(i)  Assume  that  «  1,  so  that  sinh  \p  »  >p.  Show  that  the  solution  to  (29.54)  is  then  given 
by: 


m  =  o 


r  dx- 

2  J  —  CO 


2n,  ' 


(29.56) 


(ii)  Show  that  this  solution  and  (29.55)  imply  that  i>  is  indeed  much  less  than  1,  justifying 
the  initial  ansatz. 

(iii)  Show  that  if  A N  varies  along  more  than  one  dimension,  then  in  the  nearly  intrinsic 


case: 

(p(r)  =  e  J  dr'  AA  (r')  pj.  (29.57) 

(iv)  The  above  result  is  identical  in  form  to  the  screened  Thomas-Fermi  potential  produced 
by  impurities  in  a  metal  [Eq.  (17.54)].  Show  that  the  Thomas-Fermi  wave  vector  [Eq.  (17.50)] 
for  a  free  electron  gas  has  precisely  the  form  of  K,  except  that  rF  must  be  replaced  by  the  thermal 
velocity  appropriate  to  Boltzmann  statistics,  and  the  carrier  density  must  be  taken  as  2nr  (Why 
this  last  factor  of  2?)  The  quantity  K  is  the  screening  length  of  Debye-Huckel  theory. 
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(c)  Some  insight  into  the  general  solution  to  (29.54)  may  be  gained  by  the  simple  expedient 
of  changing  the  names  of  the  variables : 


tp  -*  u,  x  -*  t,  j K2 


1 

m 


(29.58) 


The  equation  then  describes  the  displacement  u  of  a  particle  of  mass  m  moving  under  the  influence 
of  a  force  that  depends  upon  position  (u)  and  time  (t).  In  the  case  of  an  abrupt  junction  this  force 
is  time-independent  before  and  after  t  =  0.  Sketch  the  “potential  energy”  before  and  after  t  =  0, 
and  deduce  from  your  sketch  a  qualitative  argument  that  the  solution  to  (29.54)  that  becomes 
asymptotically  constant  asx->  ±oo  can  vary  appreciably  only  in  the  neighborhood  of  x  =  0. 

(d)  Show  that  conservation  of  “energy”  before  and  after  t  =  0  in  the  mechanical  model  for 
the  abrupt  junction  described  above  permits  one  to  show  that  the  exact  potential  at  x  =  0  is 
that  given  by  the  approximate  solution  (29.14)  plus  a  correction  Ar/>,  given  by: 


A  (j)  -  — 


kT/y/NS  4-  4,h2  -  J N a2  +  4 n? 
e  \  Nd  +  Na 


(29.59) 


Comment  on  how  important  a  correction  to  <j>  this  is,  and  how  reliable  the  carrier  densities 
(29.12)  given  by  the  approximate  solution  (29.14)  are  likely  to  be  in  the  depletion  region. 

(e)  As  in  (d),  find  and  discuss  the  approximate  and  exact  electric  fields  at  x  =  0. 


2.  Derivation  of  the  Einstein  Relations  from  Kinetic  Theory 

Show  that  the  phenomenological  equations  (29.27)  relating  the  carrier  currents  to  the  electric 
field  and  carrier  density  gradients,  follow  from  elementary  kinetic  arguments  such  as  were  used 
in  Chapter  1,  with  mobilities  of  the  form  (29.28)  and  (29.29),  and  diffusion  constants  of  the  form 

D  =  i<u2>Tco".  (29.60) 

Show  that  the  Einstein  relations  (29.30)  are  satisfied  provided  the  mean  square  thermal  velocity 
<u2>  is  given  by  Maxwell-Boltzmann  statistics. 

3.  Einstein  Relations  in  the  Degenerate  Case 

When  dealing  with  degenerate  inhomogeneous  semiconductors  one  must  generalize  the  equi¬ 
librium  carrier  densities  (29.3)  to 

nc(x)  =  rPc(p  +  ee/>(x) ), 

pjx)  =  P°  (ft  +  l'4>(x)),  (29.61) 

where  n®(/i),  and  p,°(/i)  are  the  carrier  densities  of  the  homogeneous  semiconductor  as  a  function 
of  chemical  potential.25 

(a)  Show  that  the  expression  (29.9)  for  A f  and  the  interpretation  that  precedes  it  continue 
to  follow  directly  from  (29.61). 

(b)  Show  by  a  slight  generalization  of  the  argument  on  page  602  that 

1  dn  1  8p 

H„  =  eD„-—,  /(  =  -eD--  .  (29.62) 

n  cfi  p  Ofi 


25  Note  that  the  functional  forms  of  n J?  (/z)  and  p°  (p)  do  not  depend  on  the  doping  (though  of  course 
the  value  of  /z  does). 
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(c)  In  an  inhomogeneous  semiconductor,  not  in  equilibrium,  with  carrier  densities  nc(x)  and 
and  p,(x),  one  sometimes  defines  electron  and  hole  quasichemical  potentials26  pjx)  and  ph(x)  by 
requiring  the  carrier  densities  to  have  the  equilibrium  form  (29.61): 

nc(x)  =  n°c  (pjx)  +  et/>(x)),  p„(x)  =  p°  (ph(x)  +  e<f>(x)).  (29.63) 

Show  that,  as  a  consequence  of  the  Einstein  relations  (29.62),  the  total  drift  plus  diffusion  currents 
are  just 

r  d  1  , 

Je  =  -gnnc  —  -p.e{x), 
dx  e 

r  d  1 

j h  =  Up Pv  dx  -  l\(x).  (29.64) 

Note  that  these  have  the  form  of  pure  drift  currents  in  an  electrostatic  potential  <j>  =  {  —  l/e)pi. 

4.  Drift  and  Diffusion  Currents  in  the  Depletion  Layer 

Noting  that  the  electric  field  in  the  depletion  layer  is  of  order  A 4>/d,  d  =  d„  +  dp,  and  that  the 
carrier  densities  there  exceed  their  minority  values  substantially  (except  at  the  edges  of  the  layer), 
show  that  the  assumption  that  the  drift  (and  hence  diffusion)  currents  in  the  depletion  layer  greatly 
exceed  the  total  current  is  very  well  satisfied. 

5.  Fields  in  the  Diffusion  Region 

Verify  the  assumption  that  the  potential  $  undergoes  negligible  variation  in  the  diffusion  region, 
by  estimating  its  change  across  the  diffusion  region  as  follows: 

(a)  Find  the  electron  drift  current  at  d„  by  noting  that  the  total  electron  current  is  continuous 
across  the  depletion  layer  and  calculating  explicitly  the  electron  diffusion  current  at  d„. 

(b)  Noting  that  the  electron  density  is  very  close  to  Nd  at  d„,  find  an  expression  for  the  electric 
field  at  d„  necessary  to  produce  the  drift  current  calculated  in  (a). 

(c)  Assuming  that  the  field  found  in  (b)  sets  the  scale  for  the  electric  field  in  the  diffusion 
region,  show  that  the  change  in  <j>  across  the  diffusion  region  is  of  order  (kBT/e)(nJNd)2. 

(d)  Why  is  this  indeed  negligible? 

6.  Saturation  Current 

Estimate  the  size  of  the  saturation  electric  current  in  a  p-n  junction  at  room  temperature,  if  the 
band  gap  is  0.5  eV,  the  donor  (or  acceptor)  concentrations  1018/cm3,  the  recombination  times 
10-5  second,  and  the  diffusion  lengths  10-4  cm. 


26  Since  we  are  not  in  equilibrium,  pe  need  not  equal  ph. 
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By  a  crystalline  defect  one  generally  means  any  region  where  the  microscopic  arrange¬ 
ment  of  ions  differs  drastically  from  that  of  a  perfect  crystal.  Defects  are  called  surface, 
line,  or  point  defects,  according  to  whether  the  imperfect  region  is  bounded  on  the 
atomic  scale  in  one,  two,  or  three  dimensions. 

Like  human  defects,  those  of  crystals  come  in  a  seemingly  endless  variety,  many 
dreary  and  depressing,  and  a  few  fascinating.  In  this  chapter  we  shall  describe  a  few 
of  those  imperfections  whose  presence  has  a  profound  effect  on  at  least  one  major 
physical  property  of  the  solid.  One  could  argue  that  almost  any  defect  meets  this 
test;  for  example,  isotopic  inhomogeneity  can  alter  both  the  phonon  spectrum  and 
the  character  of  the  neutron  scattering.  The  examples  we  will  consider,  however, 
are  somewhat  more  dramatic.1  The  two  most  important  kinds  of  defects  we  shall 
mention  are: 

1.  Vacancies  and  Interstitials  These  are  point  defects,  consisting  of  the  absence 
of  ions  (or  presence  of  extra  ions).  Such  defects  are  entirely  responsible  for  the 
observed  electrical  conductivity  of  ionic  crystals,  and  can  profoundly  alter  their 
optical  properties  (and,  in  particular,  their  color).  Furthermore,  their  presence 
is  a  normal  thermal  equilibrium  phenomenon,  so  they  can  be  an  intrinsic  feature 
of  real  crystals. 

2.  Dislocations  These  are  line  defects  which,  though  probably  absent  from  the 
ideal  crystal  in  thermal  equilibrium,  are  almost  invariably  present  in  any  real 
specimen.  Dislocations  are  essential  in  explaining  the  observed  strength  (or 
rather,  the  lack  of  shear  strength)  of  real  crystals,  and  the  observed  rates  of 
crystal  growth. 


POINT  DEFECTS:  GENERAL  THERMODYNAMIC  FEATURES 

Point  defects  are  present  even  in  the  thermal  equilibrium  crystal,  as  we  can  illustrate 
by  considering  only  the  simplest  kind :  a  vacancy  or  Schottky  defect  in  a  monatomic 
Bravais  lattice.  A  vacancy  occurs  whenever  a  Bravais  lattice  site  that  would  normally 
be  occupied  by  an  ion  in  the  perfect  crystal  has  no  ion  associated  with  it  (Figure  30.1). 
If  the  number,  n,  of  such  vacancies  at  temperature  Tis  an  extensive  thermodynamic 
variable  (i.e.,  if  it  is  proportional  to  the  total  number  of  ions,  N,  when  N  is  very  large), 


OOOOOOOOOOOOOOOO  Figure  30.1 
OOOOOOOOOOOOOOOO  a  portion  of  a  monatomic  Bravais  lattice 
OOOOOOOOOOOOOOOO  containing  a  vacancy  or  Schottky  defect. 

OOOOOO  OOOOOOOOO 
OOOOOOOOOOOOOOOO 
OOOOOOOOOOOOOOOO 
OOOOOOOOOOOOOOOO 


1  Not  only  is  our  choice  of  defects  highly  selective,  but  we  have  also  included  in  the  chapter  some 
phenomena  (polarons  and  excitons)  that  are  not  generally  regarded  as  defects  at  all.  This  is  because  they 
bear  strong  resemblances  to  other  phenomena  that  are  regarded  as  defects,  and  therefore  arise  rather 
naturally  in  such  a  discussion. 
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then  we  can  estimate  its  size  by  minimizing  the  appropriate  thermodynamic  potential. 
If  the  crystal  is  at  constant  pressure  P,  this  will  be  the  Gibbs  free  energy, 

G  =  U  -  TS  +  PV. 


To  see  how  G  depends  on  n,  it  is  simplest  to  think  of  an  N- ion  crystal  containing 
n  vacant  sites,  as  an  ( N  +  n)-ion  perfect  crystal,  from  which  n  ions  have  been  removed. 
Thus  the  volume,  V(n),  will  to  a  first  approximation  be  just  ( N  +  n)v0,  where  i;0  is 
the  volume  per  ion  in  the  perfect  crystal. 

For  any  particular  choice  of  the  n  sites  to  be  deprived  of  their  ions,  we  can,  in 
principle,  calculate  F0(n)  =  U  —  TS  for  that  particular  imperfect  crystal.  If  n  is 
very  small2  compared  with  N,  then  we  might  expect  this  to  depend  only  on  the  number 
of  vacancies  but  not  on  their  spatial  arrangement.3  We  must  add  to  the  entropy  S 
for  a  fixed  configuration  of  vacancies,  a  further  contribution  S“r:,lg  expressing  the 
disorder  arising  from  the  (IV  +  n)\/N\n\  ways  of  choosing  the  n  vacant  sites  out  of 
the  N  +  n: 


Sconfg  =  ka  ln 


(IV  +  «)! 
N\n\  ' 


(30.1) 


Thus  the  full  Gibbs  free  energy  is 

G(n)  =  F0(n)  -  TScon»  +  P(N  +  n)v0.  (30.2) 

Using  Stirling’s  formula,  valid  for  large  X, 

InX!  st  X(\nX  -  1)  (30.3) 

we  can  evaluate 


5Sconflg 

drt 


=  kB  ln  ~  kB  In  ;  n  «  N,  (30.4) 


and  therefore 


cG  8F, 
dr 

When  n  «  N,  we  can  write 


+  Pv0  -  kET  ln 
cn  cn  \n 


SF0  ^  cF0 
8n  cn 


=  8, 


(30.5) 


(30.6) 


where  S  is  independent  of  n.  Therefore  (30.5)  tells  us  that  G  is  minimized  by 

n  =  Ne~u+Pvo  ykBT.  (30.7) 


2  This  is  not  inconsistent  with  our  assertion  that  n  is  an  extensive  variable  of  order  N.  Extensivity 
requires  limlV-DC(n/./V)  ^  0.  However,  n  being  small  compared  with  N  requires  not  that  the  limit  vanish, 
but  only  that  it  be  much  less  than  unity.  This  is  always  the  case  for  point  defects  in  crystals.  Indeed,  if  the 
number  of  defects  were  close  to  the  number  of  ions,  we  would  have  no  business  speaking  of  a  crystal  at 

all. 

3  This  will  certainly  not  be  true  for  configurations  in  which  appreciable  numbers  of  vacancies  lie 
close  together,  since  the  presence  of  one  can  then  affect  the  energy  required  to  form  another.  However, 
when  n  «  N>  such  configurations  will  be  quite  rare. 
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1  o  calculate  8  we  could  (as  in  Chapter  22)  write  the  total  potential  energy  of  a 
typical  (N  +  n)  ion  lattice  with  n  vacancies  as  U  =  Ucq  +  t/harm  (see  (22.8)).  We 
could  then  calculate  F0  from  the  partition  function 

e~pF°  =  Te~“E  =  e~PvCq  Z  p  =  -  (30.8) 

E  f-ha„r.  kET 

where  £harra  runs  through  the  eigenvalues  of  the  harmonic  part  of  the  Hamiltonian. 
Evidently  this  will  yield  an  F0,  which  is  the  equilibrium  potential  energy  of  the  lattice 
with  vacancies  plus  the  free  energy  of  the  phonons 

Fa  =  +  Fph.  (30.9) 

The  second  term  is  generally  small  compared  with  the  first,  so  that  to  a  first  approxi¬ 
mation  8  is  just 

,  (30.10) 

n  =  0 

the  temperature-independent  potential  energy  required  to  remove  one  ion.  At  normal 
(e.g.,  atmospheric)  pressures  Pv0  is  negligible  in  comparison,  and  therefore 

n  =  Ne'^o.  (30.11) 

Since  80  can  be  expected  to  be  of  the  order  of  electron  volts,4  n/N  will  indeed  be 
small,  but  not  zero. 

The  phonon  correction  to  (30.1 1)  from  the  second  term  in  (30.9)  generally  increases 
n  somewhat.  This  is  because  the  introduction  of  vacancies  tends  to  lower  some  of 
the  normal-mode  frequencies  (and  hence  the  associated  phonon  energies),  thereby 
leading  to  a  negative  dFph/dn.  A  simple  model  of  this  effect  is  discussed  in  Problem  1. 


_  cU* 

~  ~eii~ 


The  analysis  above  has  assumed  that  only  one  kind  of  point  defect  can  occur: 
a  vacancy  at  a  Bravais  lattice  site.  In  general,  of  course,  there  can  be  more  than  one 
kind  of  vacancy  (in  polyatomic  lattices).  There  is  also  the  possibility  of  extra  ions 
occupying  regions  not  occupied  in  the  perfect  crystal,  a  type  of  point  defect  known 
as  an  interstitial.  Thus  we  should  generalize  our  analysis  to  allow  for  iq  point  defects 
of  theyth  type.  If  all  nj  are  small  compared  with  N,  each  type  of  defect  will  occur  in 
numbers  given  by  the  obvious  generalization  of  (30.7)  (ignoring  the  small  correction 

Pv  o): 


nj  =  Nje'^J,  8 ,  = 


SFo 

dnj 


(30.12) 


where  Nj  is  the  number  of  sites  where  a  defect  of  type  j  can  occur. 

The  Sj  are  generally  quite  large  compared  with  kBT  and  if,  in  addition,  the  smallest 
two  values  of  8,  (say  8!  and  82)  are.  themselves  far  apart  compared  with  kBT,  then  n, 
will  be  very  much  greater  than  all  the  other  rq;  i.e.,  the  defect  with  the  smallest  8,- 
will  be  overwhelmingly  the  most  abundant  type. 

However,  (30.12)  is  only  correct  if  the  numbers  of  each  kind  of  defect  are  inde¬ 
pendent,  since  it  follows  from  minimizing  the  free  energy  independently  with  respect 


We  would  expect  it  to  be  roughly  the  size  of  the  cohesive  energy  per  particle.  See  Chapter  20. 
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to  all  the  rtj.  Should  there  be  constraints  among  the  n we  must  reexamine  the  problem. 
The  most  important  such  constraint  is  charge  neutrality.  We  cannot  have  a  set  of 
defects  consisting  entirely  of  positive  ion  vacancies  in  an  ionic  crystal,  without  creating 
unbalanced  positive  charge,  with  its  prohibitively  large  Coulomb  energy.  This  excess 
charge  must  be  balanced  either  by  positive  ion  interstitials,  negative  ion  vacancies, 
or  some  combination  of  these.5  Thus  the  free  energy  must  be  minimized  subject  to 
the  constraint : 

0  =  Zqjnj,  (30.13) 

where  q}  is  the  charge  of  the  jth  defect  type  (cjj  =  +  e  for  a  negative  ion  vacancy  or 
positive  ion  interstitial,  and  q}  —  —e  for  a  positive  ion  vacancy  or  negative  ion 
interstitial).  If  we  introduce  a  Lagrange  multiplier  A,  then  we  can  take  the  constraint 
into  account  by  minimizing  not  G,  but  G  +  (AEg/ij).  This  leads  to  the  replacement 
of  (30.12)  by 

=  Nje~mi+Xqi\  (30.14) 

where  the  unknown  X  is  determined  by  requiring  that  (30.14)  satisfy  the  constraint 
(30.13). 

Usually  the  lowest  Zj  for  each  charge  type  is  separated  in  energy  by  many  knT 
from  the  next  lowest.6  Consequently  there  will  be  one  dominant  type  of  defect  of 
each  charge,  whose  numbers  are  given  by 

n+  =  N+e-pw++U) 

n.  =  N-e~m--Xe\  S±=  min(S,)  (30.15) 

(9;  =  ±e) 

Since  the  densities  of  all  the  other  defect  types  satisfy 

rij  «  n+,  qj  =  +e. 


rij  «  n_,  qt  =  —  e. 

(30.16) 

charge  neutrality  requires  to  high  precision  that 

n+  =  n_. 

(30.17) 

Since  (30.15)  also  requires 

n  +  n_  =  N  +  N.e~m++!i-\ 

(30.18) 

we  find  that 

n+  =  n_  =  e-«e++E-,/2. 

(30.19) 

Thus  the  constraint  of  charge  neutrality  reduces  the  concentration  of  the  most 
abundant  defect  type  and  raises  the  concentration  of  the  most  abundant  type  of 
the  opposite  charge.  It  does  so  in  a  manner  that  changes  the  values  they  would  have 
in  the  absence  of  the  constraint  to  the  geometric  mean  of  those  values. 

Even  in  simple  diatomic  ionic  crystals,  there  are  several  ways  in  which  charge 
neutrality  can  thus  be  achieved  (Figure  30.2).  There  can  be  essentially  equal  numbers 


5  We  ignore  the  possibility  of  color  center  formation.  See  below. 

6  When  this  is  not  the  case,  then  the  distinction  developed  below  between  Schottky  and  Frenkel 
defects  cannot  be  made.  See  Problem  Z 
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+  —  +  —  +  —  +  —  +  —  Figure  30.2 

—  +  —  4  —  +  —  +  —  +  (a)  A  perfect  ionic  crystal,  (b)  An  ionic  crystal  with  point 

+  _  +  _  +  _  +  _  +  _  defects  of  the  Schottky  type  (equal  numbers  of  positive  and 

negative  ion  vacancies),  (c)  An  ionic  crystal  with  defects  of 
4  +  +  n  +  the  Frenkel  type  (equal  numbers  of  positive  ion  vacancies 

+  “  +  —  +  —  +  —  +  —  and  interstitials). 
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of  positive  and  negative  ion  vacancies,  known,  in  this  context,  also  as  Schottky 
defects.  On  the  other  hand,  there  can  be  essentially  equal  numbers  of  vacancies  and 
interstitials  of  the  same  ion,  known  as  Frenkel  defects.  Alkali  halides  have  defects 
of  the  Schottky  type;  silver  halides,  of  the  Frenkel  type.  (The  third  possibility,  equal 
numbers  of  positive  and  negative  ion  interstitials,  does  not  seem  to  occur,  interstitials 
being  generally  more  costly  in  energy  than  vacancies  of  the  same  ion.) 


DEFECTS  AND  THERMODYNAMIC  EQUILIBRIUM 

It  is  most  unlikely  that  line  or  surface  defects  can,  like  point  defects,  have  a  non¬ 
vanishing  concentration  in  thermal  equilibrium.  The  energy  of  formation  of  one  of 
these  more  extended  defects  will  be  proportional  to  the  linear  dimensions  ( N 1/3) 
or  cross-sectional  area  (Ai2/3)  of  the  crystal.  However,  the  number  of  ways  of  intro¬ 
ducing  one  (provided  that  it  is  not  excessively  “wiggly”  (lines)  or  “wobbly”  (surfaces)) 
does  not  appear  to  be  more  than  logarithmic  in  N,  as  for  point  defects.  Thus  although 
the  cost  in  energy  of  a  single  point  defect  (independent  of  N)  is  more  than  compen¬ 
sated  for  by  the  gain  in  entropy  (of  order  In  N),  this  is  probably  not  the  case  for  the 
line  and  surface  defects. 

Line  and  surface  defects  are,  in  all  likelihood,  metastable  configurations  of  the 
crystal.  However,  thermal  equilibrium  may  well  be  approached  so  slowly  that  for 
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practical  purposes  the  defects  may  be  considered  to  be  frozen  in.  It  is  also  easy  to 
arrange  nonequilibrium  concentrations  of  point  defects,  which  can  have  considerable 
permanence  (for  example,  by  quickly  cooling  a  crystal  that  had  been  in  equilibrium). 
The  equilibrium  concentration  of  point  defects  can  be  brought  back  to  the  Maxwell- 
Boltzmann  form,  and  the  density  of  line  and  surface  defects  correspondingly  reduced 
toward  zero  by  the  slow  application  and  removal  of  heat.  The  restoration  of 
equilibrium  defect  concentrations  in  this  way  is  known  as  annealing. 


POINT  DEFECTS:  THE  ELECTRICAL  CONDUCTIVITY  OF  IONIC 
CRYSTALS 

Ionic  crystals,  those  electronic  insulators  par  excellence,  have  a  nonvanishing 
electrical  conductivity.  Typical  resistivities  depend  sensitively  on  temperature  and 
purity  of  the  specimen,  and  can  range,  in  alkali  halide  crystals,  from  102  to  108  ohm-cm 
(which  should  be  compared  with  typical  metallic  resistivities,  which  are  of  order 
microhm  centimeters).  Conduction  cannot  be  due  to  the  thermal  excitation  of 
electrons  from  the  valence  to  the  conduction  bands,  as  in  semiconductors  (eq.  (28.20) ), 
for  the  band  gap  is  so  large  that  few  if  any  of  the  1023  electrons  can  be  so  aroused. 
There  is  direct  evidence  that  the  charge  is  carried  not  by  electrons,  but  by  the  ions 
themselves:  After  the  passage  of  a  current,  atoms  corresponding  to  the  appropriate 
ions  are  found  deposited  at  the  electrodes  in  numbers  proportional  to  the  total 
charge  carried  by  the  current. 

The  ability  of  the  ions  so  to  conduct  is  enormously  enhanced  by  the  presence  of 
vacancies.  It  requires  far  less  work  to  move  a  vacancy  through  a  crystal  than  to  force 
an  ion  through  the  dense  ionic  array  of  a  perfect  crystal  (Figure  30.3). 

There  is  abundant  evidence  that  ionic  conduction  depends  on  the  movement  of 
vacancies.  The  conductivity  is  observed  to  increase  with  temperature  exponentially 
in  1/T,  reflecting  the  temperature  dependence  of  the  thermal  equilibrium  vacancy 
concentration  (30.14).7  Furthermore,  at  low  temperatures,  the  conductivity  of  a 
monovalent  ionic  crystal  doped  with  divalent  impurities  (e.g.,  Ca  in  NaCl)  is  found 
to  be  proportional  to  the  concentration  of  divalent  impurities,  in  spite  of  the  fact 
that  the  material  deposited  at  the  cathode  continues  to  be  the  monovalent  atom. 
As  shown  in  Figure  30.4,  the  important  function  of  the  impurity  is  to  force,  via  charge 
neutrality,  the  creation  of  a  Na+  vacancy  for  each  Ca++  ion  incorporated  substi- 
tutionally  into  the  lattice  at  an  Na+  site.8  Thus  the  more  Ca  introduced,  the  greater 
the  number  of  Na+  vacancies,  and  the  greater  the  conduction.9 


1  This  by  itself  is  not  completely  convincing,  since  the  ionic  diffusion  constant,  which  depends  on  the 
probability  of  the  ion  having  enough  thermal  energy  to  cross  a  potential  barrier  will  also  vary  exponentially 
in  1/T.  Consequently,  so  will  the  mobility  and  conductivity.  (See  Chapter  29  for  definitions  of  diffusion 
constant  and  mobility.) 

8  Direct  evidence  for  this  is  that  the  density  of  the  doped  crystal  is  less  than  that  of  the  pure  crystal, 
even  though  a  calcium  atom  has  a  greater  mass  than  one  of  sodium. 

9  This  phenomenon  is  quite  analogous  to  the  impurity  doping  of  semiconductors.  See  Chapter  28, 
and  Problem  3. 
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Figure  30.3 

(a)  It  is  very  difficult  to  pass  an  extra  positively 
charged  ion  through  a  perfect  crystal.  However,  by 
successive  motion  of  positively  charged  ions  into 
neighboring  vacancies,  (b)-(d),  a  unit  of  positive 
charge  can  be  moved  with  relative  ease  across  the 
whole  crystal. 
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COLOR  CENTERS 

We  have  indicated  that  charge  neutrality  requires  vacancies  of  one  constituent  of 
a  diatomic  ionic  crystal  to  be  balanced,  either  by  an  equal  number  of  interstitials 
of  the  same  constituent  (Frenkel)  or  by  an  equal  number  of  vacancies  of  the  other 
constituent  (Schottky).  It  is  also,  however,  possible  to  balance  the  missing  charge 
of  a  negative  ion  vacancy  with  an  electron  localized  in  the  vicinity  of  the  point  defect 
whose  charge  it  is  replacing. 

Such  an  electron  can  be  regarded  as  bound  to  an  effectively  positively  charged 
center,  and  will,  in  general,  have  a  spectrum  of  energy  levels.10  Excitations  between 


10  See  Problem  5. 
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Figure  30.4 

The  introduction  of  n  Ca+  +  ions  into  NaCl  results  in  n  Na+  ions 
being  replaced  by  Ca++  and  the  creation  of  an  additional  n  Na  + 
vacancies  to  preserve  charge  neutrality. 

these  levels  produce  a  series  of  optical  absorption  lines  quite  analogous  to  those  of 
single  isolated  atoms.  These  excitation  energies  occur  in  the  optically  forbidden  band 
between  hwT  and  hwL,  for  the  perfect  crystal  (see  Chapter  27),  and  therefore  stand 
out  as  quite  striking  peaks  in  the  optical  absorption  spectrum  (Figure  30.5).  These 
and  other  such  defect-electron  structures  are  known  as  color  centers,  since  their 
presence  imparts  a  strong  color  to  the  otherwise  transparent  perfect  crystal. 

Color  centers  have  been  extensively  studied  in  the  alkali  halides,  which  can  be 
colored  by  exposure  to  X-  or  y-radiation  (with  the  ensuing  production  of  defects 
by  the  very  high-energy  photons)  or,  more  instructively,  by  heating  an  alkali  halide 
crystal  in  a  vapor  of  the  alkali  metal.  In  this  last  case  excess  alkali  atoms  (whose 
numbers  can  range  from  one  in  107  to  as  high  as  one  in  103)  are  incorporated  into 
the  crystal,  as  subsequent  chemical  analysis  demonstrates.  However,  the  mass  density 
of  the  colored  crystal  diminishes  in  proportion  to  the  concentration  of  excess  alkali 


Figure  30.5 

The  absorption  spectrum  of 
KC1,  showing  peaks  asso¬ 
ciated  with  the  various  com¬ 
binations  of  F-centers;  e.g., 
the  F-center  itself,  the  M- 
center,  and  the  Reenter. 
R.  H.  Silsbee,  Phys.  Rev. 
A180, 138  (1965).) 
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Figure  30.6 

Heating  a  perfect  alkali  ha¬ 
lide  crystal  in  the  vapor  of  the 
alkali  metal  can  produce  a 
crystal  with  an  excess  of 
alkali  ions.  There  is  a  corre¬ 
sponding  concentration  of 
negative  ion  vacancies  whose 
sites  are  now  occupied  by 
the  (highly  localized)  excess 
electrons. 

atoms,  demonstrating  that  the  atoms  are  not  absorbed  interstitially.  Instead,  the  alkali 
metal  atoms  are  ionized  and  assume  positions  in  the  sites  of  a  perfect  positively 
charged  sublattice,  and  the  excess  electrons  are  bound  to  an  equal  number  of  negative 
ion  vacancies  (Figure  30.6). 

Striking  evidence  for  the  validity  of  this  picture  is  provided  by  the  fact  that  the 
absorption  spectrum  so  produced  is  not  substantially  altered  if,  for  example,  we  heat 
potassium  chloride  in  a  vapor  of  sodium,  rather  than  potassium  metal.  This  confirms 
the  fact  that  the  primary  role  of  the  vapor  of  metal  atoms  is  to  introduce  negative 
ion  vacancies  and  to  supply  the  neutralizing  electron,  whose  energy  levels  produce 
the  absorption  spectrum. 

An  electron  bound  to  a  negative  ion  vacancy  (known  as  an  F-center11)  is  capable 
of  reproducing  many  of  the  qualitative  features  of  ordinary  atomic  spectra,  with  the 
added  complication  that  it  moves  in  a  field  of  cubic,  rather  than  spherical,  symmetry. 
This  permits  one  to  exercise  one’s  knowledge  of  group  theory  (e.g.,  how  angular 
momentum  multiplets  are  split  by  cubic  fields).  Indeed,  by  stressing  the  crystal,  one 
can  reduce  the  cubic  symmetry,  producing  diagnostic  perturbations,  which  are 
helpful  in  disentangling  a  wealth  of  additional  structure  in  the  absorption  spectrum. 
The  extra  structure  is  present  because  the  simple  /  -center  is  not  the  only  way  the 
electrons  and  vacancies  can  conspire  to  color  the  crystal.12  Two  other  possibilities 
are:  (a)  The  M- center  (Figure  30.7a),  in  which  tw'o  neighboring  negative  ion  vacancies 
in  a  (100)  plane  bind  two  electrons,  (b)  The  /(-center  (Figure  30.7b),  in  w'hich  three 
neighboring  negative  ion  vacancies  in  a  (111)  plane  bind  three  electrons. 
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Figure  30.7 

(a)  The  M-center,  in  which 
two  neighboring  negative  ion 
vacancies  in  a  (100)  plane 
bind  two  electrons,  (b)  The 
R-center,  in  which  three  neg¬ 
ative  ion  vacancies  in  a  (1 1 1) 
plane  bind  three  electrons. 


11  An  abbreviation  for  the  German  Farbzentrum. 

12  It  is,  however,  the  most  abundant  of  the  centers 
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It  took  considerable  ingenuity  to  demonstrate  that  these  various  categories  of 
defects  were  indeed  responsible  for  the  observed  spectra.  Identification  is  made 
possible  by  noting  that  each  has  a  characteristic  response  to  the  effects  of  stress  or 
electric  fields  on  its  level  structure. 

The  resonances  in  the  optical  absorption  produced  by  color  centers  are  not  nearly 
so  sharp  as  those  produced  by  the  excitation  of  isolated  atoms.  This  is  because  the 
linewidth  is  inversely  proportional  to  the  excited-state  lifetime.  Isolated  atoms 
can  only  decay  radiatively,  a  relatively  slow  process,  but  the  “atom”  represented 
by  an  F-center  is  strongly  coupled  to  the  rest  of  the  solid,  and  can  lose  its  energy  by 
the  emission  of  phonons. 

One  might  think  that  by  heating  an  alkali  halide  crystal  in  halogen  gas,  one  could 
also  introduce  alkali  metal  vacancies  to  which  holes  could  be  bound,  but  these 
antimorphs  to  the  F-center  and  its  cousins  have  not  been  observed.  Holes  can  be 
bound  to  point  imperfections,  but  the  imperfections  have  not  been  observed  to  be 
positive  ion  vacancies.  Indeed,  the  most  studied  hole  center,  the  F^-center  is  based 
on  no  vacancies  at  all,  but  only  on  the  possibility  of  a  hole  binding  two  neighboring 
negative  ions  (e.g.  chlorine)  into  something  which  has  a  spectrum  rather  like  Clj 

Figure  30.8 

Color  centers  involving  the  binding  of 
holes  do  not  involve  positive  ionvacancies. 

The  l^-center  is  based  on  the  possibility 
of  a  hole  binding  two  neighboring  negative 
ions. 
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(Figure  30.8).  A  similar  “hole  center,”  the  //-center,  apparently  results  from  an 
interstitial  chlorine  ion  being  so  bound  to  a  (symmetrically  situated)  lattice  ion  by 
a  hole  (Figure  30.9);  i.e.,  the  singly  ionized  chlorine  molecule  is  forced  to  occupy 
a  single  negative  ion  site.  The  spectra  of  the  VK-  and  //-centers  are  similar  enough  to 
have  greatly  impeded  efforts  to  make  definitive  classifications. 


Figure  30.9 

The  H- center,  which  is  thought  to  result 
from  the  binding  (by  a  hole)  of  an  inter¬ 
stitial  chloride  ion  to  a  symmetrically 
situated  lattice  ion.  The  result  is  a  singly 
ionized  chlorine  molecule  that  is  forced 
to  occupy  a  single  negative  ion  site. 


Having  started  upon  the  diagnosis  and  construction  of  color  centers,  one  can 
continue  at  some  length.  For  example,  one  can  look  for,  or  make,  a  simple  F-center 
in  which  one  of  the  six  nearest  neighboring  positive  ions  has  been  replaced  by  an 
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Figure  30.10 

The  F^-center,  in  which  one  of  the 
six  nearest-neighbor  positive  ions 
surrounding  a  negative  ion  vacancy 
*  _  e  is  replaced  by  an  impurity  ion,  there- 

©  =  impurity  by  lowering  the  symmetry  of  the 
lon  levels  of  the  bound  electron.  This 

type  of  conjunction  of  impurity  and 
vacancy  is  often  energetically  favor¬ 
able. 

impurity  (Figure  30.10).  One  then  has  an  F A- center,  whose  reduced  symmetry 
delights  spectroscopists. 

Finally,  pursuing  the  search  for  opposites  we  might  inquire  whether  the  antimorph 
of  the  14'-ccntcr  has  been  observed :  a  localized  electron,  serving  to  bind  two  neigh¬ 
boring  positively  charged  ions  together.  Because  (for  example)  Cl2  molecules  exist 
(covalently  bonded)  and  Na2  molecules  in  general  do  not,  the  answer  is  no.  Indeed, 
the  asymmetry  between  electron  and  hole  centers  is  precisely  due  to  the  crucial 
difference  in  the  sodium  (s-level)  and  chlorine  (p-level)  valence  electrons,  which  form 
covalent  bonds  only  in  the  latter  case.  However,  something  rather  less  localized 
than  the  VK  antimorph  does  exist,  and  is  known  as  the  polaron. 

POLARONS 

When  an  electron  is  introduced  into  the  conduction  band  of  a  perfect  ionic  crystal, 
it  may  be  energetically  favorable  for  it  to  move  in  a  spatially  localized  level,  accom¬ 
panied  by  a  local  deformation  in  the  previously  perfect  ionic  arrangement  (i.e.,  a 
polarization  of  the  lattice)  that  serves  to  screen  its  field  and  reduce  its  electrostatic 
energy.  Such  an  entity  (electron  plus  induced  lattice  polarization)  happens  to  be  far 
more  mobile  than  the  defects  we  have  described  up  to  now,  and  is  generally  not  viewed 
as  a  defect  at  all,  but  rather  as  an  important  complication  in  the  theory  of  electron 
mobility  in  ionic  or  partially  ionic  crystals.  Theories  of  polarons  are  quite  intricate, 
since  one  is  required  to  consider  the  dynamics  of  an  electron  that  is  strongly  coupled 
to  the  ionic  degrees  of  freedom.13 

EXCITONS 

The  most  obvious  point  defects  consist  of  missing  ions  (vacancies),  excess  ions 
(interstitials),  or  the  wrong  kind  of  ions  (substitutional  impurities).  A  more  subtle 
possibility  is  the  case  of  an  ion  in  a  perfect  crystal,  that  differs  from  its  colleagues  only 
by  being  in  an  excited  electronic  state.  Such  a  “defect”  is  called' a  Frenkel  exciton. 


13  Two  general  references  on  polarons  are  C.  G.  Kuper  and  G.  D.  Whitfield,  eds.,  Polarons  and 
Excitons,  Plenum  Press,  New  Y ork,  1 963,  and  the  article  by  J.  Appel,  in  Solid  State  Physics ,  vol.  21 ,  Academic 
Press,  New  York,  1968,  page  193.  (The  reader  should  be  warned  that  our  introduction  of  the  polaron  as  a 
mobile  antimorph  of  the  VK  center  reflects  only  our  own  desperate  efforts  at  literary  continuity.  It  is  not 
the  orthodox  point  of  view.) 


Excilons  627 


Since  any  ion  is  capable  of  being  so  excited,  and  since  the  coupling  between  the  ions’ 
outer  electronic  shells  is  strong,  the  excitation  energy  can  actually  d 

from  ion  to  ion.  Thus  the  Frenkel  exciton  can  move  through  the  crystal  without  the 
ions  themselves  having  to  change  places,  as  a  result  of  which  it  is  (like  the  polaron) 
far  more  mobile  than  vacancies,  interstitials,  or  substitutional  impurities.  Indeed, 
for  most  purposes  it  is  better  not  to  think  of  the  exciton  as  being  ocalized  at  alk 
It  is  more  accurate  to  describe  the  electronic  structure  of  a  crystal  containing 
exciton.  as  a  quantum-mechanical  superposition  of  states,  m  which  it  is  equally 
probable  that  the  excitation  is  associated  with  any  ion  in  the  crystal.  This  alter 
view  bears  the  same  relation  to  specific  excited  ions,  as  the  Bloch  t.ght-b.nd.ng  levels 
(Chapter  10)  bear  to  the  individual  atomic  levels,  in  the  theory  of  band  s  ructures. 

Thus  the  exciton  is  probably  better  regarded  as  one  of  the  more  complex  mani¬ 
festations  of  electronic  band  structure  than  as  a  crystal  defect.  Indeed,  once  one 
recognizes  that  the  proper  description  of  an  exciton  is  really  a  problem  in  electronic 
band  structure,  one  can  adopt  a  very  different  view  of  the  same  phenomenon. 

Suppose  we  have  calculated  the  electronic  ground  state  of  an  insulator  in  the 
independent  electron  approximation.  The  lowest  excited  state  of  the  insulator  wil 
evidently  be  given  by  removing  one  electron  from  the  highest  level  in  the  highest 
occupied  band  (the  valence  band)  and  placing  it  into  the  lowest-lying  l««f  te 
lowest  unoccupied  band  (conduction  band).  Such  a  rearrangement  of  the  d 
tribution  of  electrons  does  not  alter  the  self-consistent  periodic  potential  in  which 
they  move  (Eqs.  (17.7)  or  (17.15)).  This  is  because  the  Bloch  electrons  are  not  localized 
(since  ^  k(r)\2  is  periodic),  and  therefore  the  change  in  local  charge  density  produced 
^changing  the  level  of  a  single  electron  will  be  of  order  1/N  since  only  an  Mh 
of  the  electron’s  charge  will  be  in  any  given  cell),  i.e.  negligibly  small  Thus  the 
electronic  energy  levels  do  not  have  to  be  recomputed  for  the  excited  configuratio 
and  [he  first  exited  state  will  He  an  energy  £c  -  6.,  above  the  energy  of  the  ground 
state  where  £c  is  the  conduction  band  minimum  and  £,  the  valence  band  maximu  . 

However,  there  is  another  way  to  make  an  excited  state.  Suppose  we  form  a  one- 
electron  level  by  superposing  enough  levels  near  the  conduction  band  minimum  to 
form  a  well-localized  wave  packet.  Because  we  need  levels  in  the  neighborhood  of 
the  minimum  to  produce  the  wave  packet,  the  energy  £c  of  the  wave  packet  will  be 
somewhat  greater  than  £,  Suppose,  in  addition,  that  the  valence  band  level  we 
depopulate  is  also  a  wave  packet,  formed  of  levels  in  the  neighborhood  of  the  valence 
baSd  maximum  (so  that  its  energy  £„  is  somewhat  less  than  £ J  and  chosen  so  th 
center  of  the  wave  packet  is  spatially  very  near  the  center  of  the  conduct  on  band 
wave  packet.  If  we  ignored  electron-electron  interactions,  the  energy  required  to  move 
an  electron  from  valence  to  conduction  band  wave  packets  would  be  £ 
but  because  the  levels  are  localized,  there  will,  in  addition,  be  a  non-neghgtb le  am  ® 
of  negative  Coulomb  energy  due  to  the  electrostatic  attraction  of  the  (localized) 
conduction  band  electron  and  (localized)  valence  band  hole. 

This  additional  negative  electrostatic  energy  can  reduce  the  tota!  exaction  energy 
to  an  amount  that  is  less  than  £c  -  £,  so  the  more  complicated  type  of  noted 
state,  in  which  the  conduction  band  electron  is  spatially  correlated  with  the  vale 


14  We  use  the  nomenclature  introduced  on  page  562. 
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band  hole  it  left  behind,  is  the  true  lowest  excited  state  of  the  crystal.  Evidence  for 
this  is  (a)  the  onset  of  optical  absorption  at  energies  below  the  interband  continuum 
threshold  (Figure  30.11)  and  (b)  the  following  elementary  theoretical  argument, 
indicating  that  one  always  does  better  by  exploiting  the  electron-hole  attraction: 

Let  us  consider  the  case  in  which  the  localized  electron  and  hole  levels  extend 
over  many  lattice  constants.  We  may  then  make  the  same  type  of  semiclassical 
argument  that  we  used  to  deduce  the  form  of  the  impurity  levels  in  semiconductors 
(Chapter  28).  We  regard  the  electron  and  hole  as  particles  of  masses  mc  and  mv  (the 
conduction  and  valence  band  effective  masses  (see  (28.3)),  which  we  take,  for  simplicity, 
to  be  isotropic).  They  interact  through  an  attractive  Coulomb  interaction  screened 
by  the  dielectric  constant  e  of  the  crystal.  Evidently  this  is  just  the  hydrogen  atom 
problem,  with  the  hydrogen  atom  reduced  mass  fi  (l//i  =  l/Mprolon  +  l/melectron  ~ 
1//4J  replaced  by  the  reduced  effective  mass  m*  (1/m*  =  l/mc  +  l/m„),  and  the 
electronic  charge  replaced  by  e2/e.  Thus  there  will  be  bound  states,  the  lowest  of 
which  extends  over  a  Bohr  radius  given  by: 


a 


ex 


h 1  m 

m*(e2/e)  m*  0 


(30.20) 


The  energy  of  the  bound  state  will  be  lower  than  the  energy  (£c  -  £„)  of  the  non¬ 
interacting  electron  and  hole  by 

(e2/ e)  m*  1  e 2 

ex  2a*  m  e2  2a0 

=  —■4  (13.6)  eV.  (30.21) 

m  e 


The  validity  of  this  model  requires  that  aex  be  large  on  the  scale  of  the  lattice 
(i.e.,  aex  »  a0),  but  since  insulators  with  small  energy  gaps  tend  to  have  small  effective 
masses  and  large  dielectric  constants,  that  is  not  difficult  to  achieve,  particularly 
in  semiconductors.  Such  hydrogenic  spectra  have  in  fact  been  observed  in  the  optical 
absorption  that  occurs  below'  the  interband  threshold. 

The  exciton  described  by  this  model  is  known  as  the  Mott-Wannier  exciton. 
Evidently  as  the  atomic  levels  out  of  which  the  band  levels  are  formed  become  more 
tightly  bound,  e  will  decrease,  m*  will  increase,  a0*  will  decrease,  the  exciton  will 
become  more  localized,  and  the  Mott-Wannier  picture  will  eventually  break  down. 
The  Mott-Wannier  exciton  and  the  Frenkel  exciton  are  opposite  extremes  of  the 
same  phenomenon.  In  the  Frenkel  case,  based  as  it  is  on  a  single  excited  ionic  level, 
the  electron  and  hole  are  sharply  localized  on  the  atomic  scale.  The  exciton  spectra 
of  the  solid  rare  gases  fall  in  this  class.15 


LINE  DEFECTS:  DISLOCATIONS 

One  of  the  most  spectacular  failures  of  the  model  of  a  solid  as  a  perfect  crystal  is  its 
inability  to  account  for  the  order  of  magnitude  of  the  force  necessary  to  deform  a 


15  For  a  general  reference  on  excitons,  see  R,  S.  Knox.  Excitons,  Academic  Press,  New  York,  1963. 
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Figure  30.11 

(a)  The  band  structure  of  K1  as  inferred  by  J.  C.  Phillips  (Phys.  Rev.  136,  A1705  (1964)  from  its 
optical  absorption  spectrum,  (b)  The  exciton  spectrum  associated  with  the  various  valence  and 
conduction  band  maxima  and  minima.  (After  J.  E.  Eby,  K.  J.  Teegarden,  and  D.  B.  Dutton, 
Phys.  Rev.  116,  1099  (1959),  as  summarized  by  J.  C.  Phillips,  “Fundamental  Optical  Spectra  of 
Solids,”  in  Solid  State  Physics,  vol  18,  Academic  Press,  New  York,  1966.) 
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crystal  plastically  (i.e.,  permanently  and  irreversibly).  Assuming  the  solid  is  a  perfect 
crystal,  this  force  is  easily  estimated: 

Suppose,  as  in  Figure  30.12,  that  we  resolve  the  crystal  into  a  family  of  parallel 
lattice  planes,  separated  by  a  distance  d,  and  consider  a  shear  deformation  of  the 
crystal  in  which  each  plane  is  displaced  parallel  to  itself  in  a  specified  direction  n 
by  an  amount  x,  with  respect  to  the  plane  immediately  below  it.  Let  the  extra  energy 
per  unit  volume  associated  with  the  shear  be  n(x).  For  small  x,  we  expect  u  to  be 
quadratic  in  x  (x  =  0  corresponds  to  equilibrium)  and  given  by  the  elasticity  theory 
described  in  Chapter  22.  For  example,  if  the  crystal  is  cubic,  the  planes  are  (100) 
planes,  and  the  direction  is  [010],  then  (Problem  4) 


More  generally  we  will  have  a  relation  of  the  form: 

«  =  \  {l)  G'  <3°-23) 

where  G  is  the  size  of  a  typical  elastic  constant,  and  therefore  (Table  22.2)  of  order 
10' 1  to  10 12  dynes/cm2. 

The  form  (30.23)  will  certainly  fail  when  x  is  too  large.  To  take  an  extreme  case, 
if  v  is  as  large  as  the  shortest  Bravais  lattice  vector  a  parallel  to  n,  then  the  displaced 


Shear  strain 
=  xfd 


Figure  30.12 

An  undeformed  crystal  undergoes 
progressively  increasing  shear  strain, 
(a)  Perfect  crystal,  (b)  Deformed 
crystal.  In  (c)  the  crystal  is  deformed 
so  far  that  the  new  interior  configura¬ 
tion  is  indistinguishable  from  the 
undeformed  crystal. 
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Figure  30.13 

(a)  The  behavior  of  the  additional  energy  per  unit  volume,  u(x),  due  to  a  shear  strain  x.  Note 
that  u(x  +  a)  =  n(x).  (b)  A  plot  of  the  force  per  unit  area  per  plane  necessary  to  maintain  the 
strain  x.  In  this  simple  model  the  order  of  magnitude  of  the  maximum  or  critical  stress  ac  can  be 
estimated  by  taking  the  value  of  c  at  x  =  a/4,  or  alternatively  by  extrapolating  the  linear  region 
of  er(x)  to  this  value  of  x. 

configuration  (ignoring  small  surface  effects)  will  be  indistinguishable  from  the 
undeformed  crystal,  and  u(a)  will  be  0.  Indeed,  as  a  function  of  x,  u  will  be  periodic 
with  period  a:  u  (x  +  a)  —  u(x),  reducing  to  the  form  (30.23)  only  when  x  «  a 
(Figure  30.1 3a).  As  a  result,  starting  from  the  perfect  crystal  the  force  cr(x)  per  unit 
area  of  plane  (per  plane)  necessary  to  maintain  the  displacement  x,  known  as  the 
shear  stress,  will  not  increase  indefinitely  with  x.  We  estimate  its  maximum  size  as 
follows: 

If  the  crystal  is  made  up  of  N  planes  of  area  A,  then  the  volume  is  V  —  ANd, 
and  the  shear  stress  is  given  by 

a  =  (Fu)  =  d  (30.24) 

NAdx  ’  \dxj  v 

This  will  be  maximum  at  some  displacement  x0  between  0  and  all  (Figure  30.13b). 
If  we  roughly  estimate  the  value  at  maximum  by  extrapolating  the  linear  region  of 
<r(x)  (valid  for  small  x)  out  to  x  =  a/4,  then  we  find  that  the  critical  shear  stress  is 
of  order: 

ctc  w  \  G— z-  =  \  ~  G  a  101 1  dynes/cm2.  (30.25) 

ax  2  a  x=0/4  4  d 

If  a  shear  greater  than  ac  is  applied,  there  is  nothing  to  prevent  one  plane  from 
sliding  over  another;  i.e.,  the  crystal  undergoes  slip.  It  is  evident  from  Figure  30.13b 
that  (30.25)  gives  only  a  rough  estimate  of  the  critical  shear  stress.  However,  the 
observed  critical  shear  stress  in  apparently  well-prepared  “single  crystals”  can  be 
less  than  the  estimate  (30.25)  by  as  much  as  a  factor  of  104!  An  error  of  this  magnitude 
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suggests  that  the  description  of  slip,  on  which  the  estimate  (30.25)  is  based,  is  simply 
incorrect. 

The  actual  process  by  which  slip  occurs  in  most  cases  is  far  more  subtle.  A  crucial 
role  is  played  by  a  special  kind  of  linear  defect  known  as  a  dislocation.  The  two 
simplest  kinds,  screw  dislocations  and  edge  dislocations,  are  illustrated  in  Figure  30.14, 
and  described  in  more  detail  below.  Dislocation  densities  in  actual  crystals  depend 
on  the  preparation  of  the  specimen,16  but  can  range  from  102  to  1012/cm2.  Along 
a  linear  dislocation  the  crystal  is  in  so  high  a  state  of  local  distortion  that  the  additional 
distortion  required  to  move  the  dislocation  sideways  by  one  lattice  constant  requires 
relatively  little  additional  applied  stress.  Furthermore,  the  net  effect  of  moving  a 
dislocation  through  many  lattice  constants  is  a  displacement  by  a  lattice  constant17 
of  the  two  halves  of  the  crystal  separated  by  the  plane  of  motion.18 

One  can  imagine  constructing  an  edge  dislocation  (Figure  30.14a)  by  removing 
from  the  crystal  a  half  plane  of  atoms  terminating  in  the  dislocation  line,  and  then 
carefully  joining  back  together  the  two  planes  on  either  side  of  the  missing  plane 


Figure  30.14 

(a)  Slip  in  a  crystal  via  the  motion  of  an  edge  dislocation,  (b)  Slip  in  a  crystal  via  the  motion  of 
a  screw  dislocation. 


10  As  mentioned  above  linear  defects  are  not  a  thermodynamic  equilibrium  phenomenon.  There  is 
therefore  no  intrinsic  value  to  the  dislocation  density  (which  can  be  considerably  reduced  by  annealing). 

17  There  is  another  kind  of  slip,  mediated  by  dislocations,  in  which  the  slipped  portion  of  the  crystal 
bears  a  more  complex  relation  to  the  unslipped  part.  See  the  description  of  "twinning.”  below. 

18  The  analogy  is  often  made  to  the  passage  of  a  linear  ripple  across  a  carpet.  The  effect  is  a  slight 
displacement  of  the  carpet,  made  far  more  easily  than  by  sliding  the  entire  undeformed  carpel  the  same 
distance. 
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in  a  way  that  restores  the  basic  order  of  the  perfect  crystal  everywhere  except  in  the 
vicinity  of  the  dislocation  line.19 

Similarly,  a  screw  dislocation  (Figure  30.14b)  can  be  “constructed”  by  imagining 
a  plane  terminating  at  the  dislocation  line,  above  which  the  crystal  has  been  displaced 
by  a  lattice  vector  parallel  to  the  line,  and  then  rejoined  to  the  part  of  the  crystal 
below  in  a  way  that  preserves  the  basic  crystalline  order  everywhere  except  near  the 
line  itself. 

More  generally,  dislocations  need  not  be  rectilinear.  One  can  describe  a  general 
Figure  30.15 

Ambiguities  in  the  “construc¬ 
tive”  definition  of  a  disloca¬ 
tion.  One  plane  of  a  crystal  is 
shown,  perpendicular  to  a 

single  edge  dislocation.  (The  ABCDEFGHIJK  123456789  10  11 

point  where  the  dislocation 
intersects  the  plane  is  most 
easily  perceived  by  viewing 
the  figure  at  a  low  angle 
along  either  of  the  families  of 
parallel  lines.)  One  can  de¬ 
scribe  the  dislocation  as 
being  produced  by  the  inser¬ 
tion  of  the  extra  plane  of 
atoms  that  intersects  the 
upper  half  of  the  figure  in  the 
line  6.  or,  equally  well,  by  the 
insertion  of  the  extra  plane 
of  atoms  that  intersects  the 
upper  half  of  the  figure  in  the 
line  F.  Alternatively,  the  dis¬ 
location  can  be  viewed  as 
produced  by  the  removal  of 
a  plane  from  the  lower  half 
of  the  figure,  and  that  plane 
can  either  have  been  the  one 
that  was  between  5  and  7  or 
between  E  and  G.  The  figure 
is  based  on  the  “bubble  raft” 
photographs  of  Bragg  and 
Nye.  ( Proc .  Roy.  Soc.,  A190. 

474  (1947).) 


19  Only  the  dislocation  line  itself  has  an  absolute  significance,  however.  Given  an  edge  dislocation, 
there  are  any  number  of  places  from  which  the  “removed  plane”  might  have  been  taken.  Indeed,  one  may 
also  think  of  lhe  dislocation  as  having  been  constructed  by  the  insertion  of  an  extra  plane  (Figure  30.15). 
The  same  is  true  of  screw  dislocations. 


I 
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dislocation  as  any  linear  region  in  the  crystal  (either  a  closed  curve  or  a  curve  that 
terminates  on  the  surface)  with  the  following  properties: 

1.  Away  from  the  region  the  crystal  is  locally  only  negligibly  different  from  the 
perfect  crystal. 

2.  In  the  neighborhood  of  the  region  the  atomic  positions  are  substantially  different 
from  the  original  crystalline  sites. 

3.  There  exists  a  nonvanishing  Burgers  vector. 

The  Burgers  vector  is  defined  as  follows:  Consider  a  closed  curve  in  the  perfect 
crystal  passing  through  a  succession  of  lattice  sites,  which  therefore  can  be  traversed 
by  a  series  of  displacements  by  Bravais  lattice  vectors  (Figure  30.16,  lower  curve). 
Now  traverse  the  same  sequence  of  Bravais  lattice  displacements  in  the  putatively 
dislocated  crystal  (upper  curve  in  Figure  30.16).  The  test  path  should  be  far  enough 
away  from  the  dislocation  that  the  configuration  of  the  crystal  in  its  neighborhood 
hardly  differs  from  the  undistorted  crystal,  giving  an  unambiguous  meaning  to  the 
phrase  same  sequence  of  Bravais  lattice  displacements.”  If  the  series  of  displacements 
now  fails  to  bring  one  back  to  one’s  starting  point,  then  the  curve  has  surrounded 
a  dislocation.  The  Bravais  lattice  vector  b  by  which  the  endpoint  fails  to  coincide 
with  the  starting  point  is  called  the  Burgers  vector  of  the  dislocation.20 

Figure  30.16 

Two  paths  in  a  lattice  plane. 
The  lower  path  is  in  a  dis¬ 
location  free  region.  If  one 
starts  at  A  and  moves  five 
steps  down,  six  to  the  right, 
five  up,  and  six  to  the  left, 
one  returns  to  A.  The  upper 
path  surrounds  a  dislocation. 
(The  dislocation  line  is  per¬ 
pendicular  to  the  lattice 
plane.)  If  one  starts  at  B  and 
moves  through  the  same  se¬ 
quence  of  steps  (five  down, 
six  to  the  right,  five  up,  and 
six  to  the  left)  one  does  not 
return  to  the  starting  point  B, 
but  to  C.  The  vector  from  B 
to  C  is  the  Burgers  vector  b. 
(The  dislocation  that  the 
second  path  surrounds  is 
seen  most  readily  by  viewing 
the  page  at  a  very  low  angle.) 


20  If  b  -  0,  the  linear  defect  is  not  a  dislocation  (unless  it  happens  that  the  path  has  surrounded  two 
dislocations  with  Burgers  vectors  of  equal  magnitudes  and  opposite  directions).  A  linear  array  of  vacancies, 
for  example,  satisfies  criteria  1  and  2,  but  is  not  a  dislocation.  (I  f  an  ion  in  the  interior  of  the  lower  curve 
in  Figure  30.16  is  withdrawn,  the  path  still  closes.) 
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Some  thought  should  convince  one  that  the  Burgers  vector  of  a  given  dislocation 
does  not  depend  on  the  path  around  the  dislocation  chosen  for  the  test.  The  Burgers 
vector  is  perpendicular  to  an  edge  dislocation,  and  parallel  to  a  screw  dislocation. 
Dislocations  more  complex  than  edge  or  screw  dislocations  can  still  be  described  by 
a  single  path-independent  Burgers  vector,  though  the  relation  between  the  direction 
of  the  Burgers  vector  and  the  geometry  of  the  dislocated  region  will  not  be  as  simple 
as  it  is  for  edge  and  screw  dislocations.21 


CRYSTALLINE  STRENGTH 

The  weakness  of  good  crystals  was  a  mystery  for  many  years,  in  part,  no  doubt, 
because  the  observed  data  easily  led  one  to  the  wrong  conclusion.  Relatively  poorly 
prepared  crystals  were  found  to  have  yield  strengths  close  to  the  high  value  we  first 
estimated  for  the  perfect  crystal.  However,  as  the  crystals  were  improved  (for  example, 
by  annealing)  the  yield  strengths  were  found  to  drop  drastically,  falling  by  several 
orders  of  magnitude  in  very  well  prepared  crystals.  It  was  natural  to  assume  that  the 
yield  strength  was  approaching  that  of  a  perfect  crystal  as  specimens  were  improved, 
but,  in  fact,  quite  the  opposite  was  happening. 

Three  people  independently  came  up  with  the  explanation  in  1934,22  inventing23 
the  dislocation  to  account  for  the  data.  They  suggested  that  almost  all  real  crystals 
contain  dislocations,  and  that  plastic  slip  occurs  through  their  motion  as  described 
above.  There  are  then  two  ways  of  making  a  strong  crystal.  One  is  to  make  an 
essentially  perfect  crystal,  free  of  all  dislocations.  This  is  extremely  difficult  to 
achieve.24  Another  way  is  to  arrange  to  impede  the  flow  of  dislocations,  for  although 
dislocations  move  with  relative  ease  in  a  perfect  crystal,  if  they  encounter  interstitials, 
impurities,  or  even  other  dislocations  crossing  their  paths,  the  work  required  to  move 
them  can  increase  considerably. 

Thus  the  poorly  prepared  crystal  is  hard  because  it  is  infested  with  dislocations 
and  defects,  and  these  interfere  so  seriously  with  each  other’s  motion  that  slip  can 
occur  only  by  the  more  drastic  means  described  earlier.  However,  as  the  crystal  is 
purified  and  improved,  dislocations  largely  move  out  of  the  crystal,  vacancies  and 
interstitials  are  reduced  to  their  (low)  thermal  equilibrium  concentrations,  and  the 
unimpeded  motion  of  those  dislocations  that  remain  makes  it  possible  for  the  crystal 
to  deform  with  ease.  At  this  point  the  crystal  is  very  soft.  If  one  could  continue  the 
process  of  refinement  to  the  point  where  all  dislocations  were  removed,  the  crystal 


21  If  one  imagines  making  a  closed  dislocation  with  scalpel  and  glue,  cutting  a  surface  bounding  a 
circuit  in  the  crystal,  displacing  the  surfaces  on  either  side  of  the  cut,  and  then  regluing  them  together  after 
removing  or  adding  any  atoms  that  arenow  required  to  restore  perfect  order,  then  the  Burgers  vector  is  the 
amount  by  which  the  surfaces  have  been  displaced.  The  topological  definition  (which  is  equivalent)  is 
perhaps  more  intuitive  since  it  does  not  require  contemplation  of  these  abstruse  operations. 

22  G.  I.  Taylor,  E.  Orowan,  and  G.  Polyani.  (G.  I.  Taylor,  Proc.  Roy .  Soc.  A14S,  362  (1934); 
E.  Orowan,  Z.  Phys.  89,  614  (1934);  G.  Polyani,  Z.  Phys.  98,  660  (1934).)  Dislocations  were  introduced 
into  the  continuum  theory  of  elasticity,  some  30  years  earlier,  by  V.  Voltcrra. 

23  Dislocations  were  not  directly  observed  for  almost  another  10  years. 

24  See,  however,  the  description  of  “whiskers”  below. 
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would  again  become  hard.  In  certain  cases  this  has  actually  been  observed,  as  we  shall 
see  below. 


WORK  HARDENING 

It  is  a  familiar  fact  that  a  bar  of  soft  metal,  after  repeated  bendings  back  and  forth, 
eventually  refuses  to  be  bent,  and  breaks.  This  is  an  example  of  work  hardening. 
With  every  bending,  more  and  more  dislocations  flow  into  the  metal,  until  there  are 
so  many  that  they  impede  each  other’s  flow.  Then  the  crystal  is  incapable  of  further 
plastic  deformation,  and  breaks  under  subsequent  stress. 


DISLOCATIONS  AND  CRYSTAL  GROWTH 

The  problem  of  plastic  (i.e.,  irreversible)  flow  was  unraveled  by  attributing  it  to  the 
motion  of  dislocations.  An  equally  perplexing  problem  was  that  of  crystal  growth, 
which  was  solved  by  invoking  the  existence  of  screw  dislocations.  Suppose  we  grow 
a  large  crystal  by  exposing  a  small  piece  of  crystal  to  a  vapor  of  the  same  atoms. 
Atoms  from  the  vapor  will  condense  into  lattice  positions  more  readily  if  neighbors 
surrounding  the  site  are  already  in  place.  Thus  an  atom  is  relatively  weakly  attracted 
to  a  perfect  crystal  plane,  more  strongly  attracted  to  a  step  between  two  planes, 
and  most  strongly  attracted  to  a  corner  (Figure  30.17).  If  one  assumes  that  grown 
crystals  are  perfect,  and  that  growth  takes  place  plane  by  plane,  then  whenever  a 
new  plane  is  required,  atoms  must  condense  onto  the  plane  below  as  in  Figure  30.1 7a. 
Because  of  the  relatively  weak  binding  in  this  case,  such  processes  (known  as 
‘'nucleating  the  next  layer”)  occur  far  too  slowly  to  account  for  the  observed  rate  of 
crystal  growth.  If,  however,  the  crystal  contains  a  screw  dislocation,  it  is  never 
necessary  to  nucleate  a  new  plane,  for  the  local  planar  structure  can  wind  endlessly 
about  the  screw  dislocation  like  a  spiral  ramp  (as  in  Figure  30.1 7d). 

Figure  30.17 

Atoms  are  relatively  weakly  at¬ 
tracted  to  perfect  crystal  planes 
(a),  are  more  strongly  attracted 
to  a  step  between  two  planes  (b), 
and  are  most  strongly  attracted 
to  a  corner  (c).  If  the  crystal 
contains  a  screw  dislocation,  as 
in  (d),  then  by  adding  atoms  as 
shown,  the  local  planar  structure 
can  spiral  endlessly  around  the 
dislocation.  Crystals  can  grow 
much  more  rapidly  in  this  way, 
since  the  nucleation  of  new 
planes  by  the  process  shown  in 
(a)  is  never  required. 


Stacking  Faults  637 


WHISKERS 

The  type  of  crystal  growth  described  above  can  lead  to  a  very  long,  thin,  whisker¬ 
shaped  crystal  winding  about  and  extending  a  single  screw  dislocation.  Such  whiskers 
may  contain  only  a  single  dislocation  (the  nucleating  screw  dislocation  itself)  and 
are  observed  to  have  yield  strengths  comparable  to  those  predicted  by  the  perfect 
crystal  model. 


OBSERVATION  OF  DISLOCATIONS  AND  OTHER  DEFECTS 

One  of  the  earliest  confirmations  that  dislocations  (and  other  varieties  of  defects) 
can  indeed  exist  in  naturally  formed  crystals  was  provided  by  the  observations  of 
Bragg  and  Nye25  on  rafts  of  identical  bubbles  floating  on  the  surface  of  soap  solutions. 
The  bubbles  are  held  together  by  surface  tension  and  a  two-dimensional  array  of 
them  approximate  very  well  a  section  of  crystal.  Point  defects,  dislocations,  and  grain 
boundaries  were  all  found  to  occur  in  the  bubble  arrays. 

Direct  observations  in  solids  have  since  been  made  by  the  technique  of  transmission 
electron  microscopy.  Chemical  etching  also  reveals  the  intersection  of  dislocations 
with  solid  surfaces.  At  such  points  the  solid  is  in  a  state  of  considerable  strain,  and 
the  nearby  atoms  may  be  preferentially  dislodged  by  chemical  action. 


SURFACE  IMPERFECTIONS:  STACKING  FAULTS 

There  is  a  more  complex  kind  of  slip,  mediated  by  dislocations,  in  which  the  applied 
stress  causes  the  coherent  formation  of  dislocations  in  successive  crystal  planes. 
As  each  dislocation  moves  through  the  crystal  it  leaves  in  its  wake  a  lattice  plane 
displaced  by  a  non-Bravais  lattice  vector,  and  the  result  of  the  passage  of  the  family 
of  dislocations  is  a  region  in  which  the  crystalline  ordering  is  the  mirror  image  (in 
the  plane  of  slip)  of  the  original  crystal.  Such  processes  are  known  as  “twinning,” 
and  the  inverted  region  is  known  as  a  “deformation  twin.” 

F  or  example,  in  a  perfect  face-centered  cubic,  successive  (111)  planes  are  arranged 
in  the  pattern 

.  . .  XIcABCABCABC ....  (30.26) 

as  in  Figure  4.21.  After  slip  giving  rise  to  a  deformation  twin,  the  pattern  will  be 

. . .  ABCABCABCABCBACBACBACBA  ....  (30.27) 

where  the  double  arrow  indicates  the  boundary  of  the  slipped  region. 

Misplaced  planes  of  atoms  such  as  these  are  known  as  “stacking  faults.”  Another 
example  is  the  arrangement 

. . .  ABCABCABABCABCABC . . . ,  (30.28) 


25  W.  L.  Bragg  and  J.  F.  Nye,  Proc.  Roy.  See.,  A190, 474  (1947). 
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in  which  a  given  plane  (indicated  by  the  double  arrow)  is  out  of  step,  falling  into  the 
hexagonal  close-packed  sequence  rather  than  that  appropriate  to  face-centered  cubic, 
after  which  the  regular  (unmirrored)  fee  arrangement  is  resumed. 


LOW-ANGLE  GRAIN  BOUNDARY 

A  grain  boundary  is  formed  by  the  junction  of  two  single  crystals  of  different  orienta¬ 
tion  along  a  common  planar  surface.  When  the  difference  in  orientations  is  small , 
the  boundary  is  referred  to  as  a  low-angle  grain  boundary.  An  example  known  as  a 
tilt  boundary  is  shown  in  Figure  30.18.  It  is  formed  from  a  linear  sequence  of  edge 
dislocations.  There  is  also  a  twist  boundary,  which  is  formed  from  a  sequence  of 
screw  dislocations.  In  general,  low-angle  grain  boundaries  are  composed  of  a  mixture 
of  both  types. 


A 


Figure  30.18 

A  low-angle  tilt  boundary  (a  type  of  low  angle  grain 
boundary)  can  be  regarded  as  formed  from  a  sequence 
of  edge  dislocations.  If  the  section  B  of  the  crystal  is 
twisted  relative  to  A  about  the  axis  shown  by  a  small 
amount,  we  may  generate  (in  addition)  a  twist  component 
in  the  boundary.  A  twist  boundary,  if  of  small  angle, 
may  be  viewed  as  composed  of  a  sequence  of  screw 
dislocations. 


Unless  very  carefully  prepared,  most  real  crystals  consist  of  many  slightly  mis¬ 
aligned  grains,  separated  by  low-angle  grain  boundaries.  The  misalignmenf  is  little 
enough  for  X-ray  diffraction  to  reveal  sharp  Bragg  peaks,  but  the  existence  of  the 
grains  has  an  important  effect  on  the  intensity  of  the  peaks. 


PROBLEMS 


1.  Phonon  Correction  to  the  Density  of  Vacancies 

A  more  accurate  treatment  of  the  equilibrium  number  of  vacancies  in  a  monatomic  Bravais 
lattice  would  multiply  (30.11)  by  a  phonon  correction  (see  (30.9)): 

n  =  Ne-l&oe-WFPhi™)'  (30.29) 


Make  a  simple  Einstein  theory  of  the  normal  modes  of  the  crystal  with  vacancies,  i.e,  treat  each 
ion  as  an  independent  oscillator,  but  let  the  oscillator  frequency  be  either  coE  or  co£,  according 
to  whether  the  ion  has  or  does  not  have  one  of  its  (z)  nearest  neighboring  sites  vacant.  Show 
that  in  this  model  (30.29)  becomes: 


n  =  Ne~m° 


"1  - 

_T= 


_  -fhwE' 


(30.30) 
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Since  wE  <  wE  (why?)  the  phonon  correction  favors  vacancy  formation.  Discuss  its  form  when 
T  »  ©E  and  when  T  «  ©£. 


2.  Mixed  Schottky  and  Frenkel  Defects 

Consider  a  diatomic  ionic  crystal  in  which  the  energies  of  formation  for  positive  and  negative 
ion  vacancies  and  interstitials  are  given  by  £+,  £i,  £'+,  6-.  If  negative  ion  interstitials  are  pro¬ 
hibited  (i.e.,  if  £'_  is  much  larger  than  the  others  on  the  scale  of  kBT),  then  positive  ion  vacancies 
will  be  the  only  possible  negatively  charged  defects.  Their  charge  can  be  balanced  either  by 
negative  ion  vacancies  (Schottky)  or  positive  ion  interstitials  (Frenkel)  depending  on  whether 
£+  —  £1  »  kBT  or  E1!  —  £‘+  »  kBT,  respectively.  In  the  Schottky  case  Eq.  (30.19)  gives 

K+)s  =  (ni)s  =  [NvEN'Le~mcv++c’'L)y11,  (30.31) 

and  in  the  Frenkel  case, 

(n\)f  =  (n\)f  =  [Nv+Ni+e-Wv++ci+)]112.  (30.32) 


Show  that  if  neither  case  applies  (i.e.,  if  £'+  —  £1  =  0(kBT),  then  the  concentrations  of  the  three 
defect  types  will  be  given  by 


n\  =  [K+)s2  +  W)/]l/2, 
_  (Or2 


(«-)/ 


(30.33) 


Verify  that  these  reduce  back  to  (30.31)  and  (30.32)  in  the  appropriate  limits. 

3.  Point  Defects  in  Calcium-Doped  Sodium  Chloride 

Consider  a  crystal  of  Ca-doped  NaCl,  with  nCj  calcium  atoms  per  cubic  centimeter.  Noting  that 
pure  NaCl  has  defects  of  the  Schottky  type  with  concentrations 

n\  =  n*L  =  h,  =  (N+N.)1/2e~m+  +  €-)l2,  (30.34) 

show  that  the  defect  densities  in  the  doped  crystal  are  given  by 


n 


V 

+ 


1 

2 

1 

2 


W  +  Uca2  + 


(30.35) 


(Note  the  similarity  to  the  theory  of  doped  semiconductors;  see  Eq.  (28.38).) 

4.  Shear  Stress  of  a  Perfect  Crystal 

Show  from  (22.82)  that  (30.22)  is  valid  for  a  cubic  crystal. 

5.  Simple  Model  of  an  F-Center 

Figure  30.19b  shows  the  positions  of  the  maxima  of  F-center  bands  (illustrated  for  the  chlorides 
in  Figure  30.19a)  as  a  function  of  lattice  constant  a.  Take  as  a  model  of  the  F-center  an  electron 
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Wavelength  (m/i) 


3 


Wavelength  (p) 
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(b) 


Figure  30.19 

(a)  Absorption  bands  of  F-centers  in  some  alkali  chlorides,  (b)  The  dependence  on  lattice 
constant  of  the  maximum  in  the  f -center  absorption  band.  (From  Schulman  and  Compton, 
Color  Centers  in  Solids,  Pergamon,  New  York,  1962. 


trapped  by  a  vacancy  potential  of  the  form  V(r)  —  0,  r  <  d;  V(r)  —  co,  r  >  d,  where  d  is  pro¬ 
portional  to  the  lattice  constant  a.  Show  that  the  spectrum  scales  as  1/d2,  so  that  if  the  peaks 
are  associated  with  the  same  types  of  excitation, 

L,  cc  «2  (30.36) 

where  is  the  wavelength  corresponding  to  the  observed  maximum  in  the  F-band  absorption. 
(Equation  (30.36)  is  known  as  the  Mollwo  relation.) 

6.  Burgers  Vector 

What  is  the  smallest  Burgers  vector  parallel  to  a  [111]  direction  that  a  dislocation  may  have  in 
an  fee  crystal? 

7.  Elastic  Energy  of  a  Screw  Dislocation 

Consider  a  region  of  crystal  of  radius  r  about  a  screw  dislocation  with  Burgers  vector  b  (Figure 
30.20).  Provided  r  is  sufficiently  large,  the  shear  strain  is  bjlnr.  (What  happens  close  to  the  dis- 


Figure  30.20 

A  screw  dislocation  and  its  Burgers  vector  b. 
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location?)  Assuming  that  stress  and  strain  are  related  by  Eq.  (30.23),  show  that  the  total  elastic 
energy  per  unit  length  of  the  screw  dislocation  is 


(30.36) 


where  R  and  r0  are  upper  and  lower  limits  on  r.  What  physical  considerations  determine  reasonable 
values  for  these  quantities? 
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In  the  preceding  chapters  we  have  considered  the  effect  of  a  magnetic  field  only  on 
metals,  and  only  insofar  as  the  motion  of  conduction  electrons  in  the  field  revealed 
the  metal’s  Fermi  surface.  In  the  next  three  chapters  we  turn  our  attention  to  some 
of  the  more  intrinsically  magnetic  properties  of  solids:  the  magnetic  moments  they 
exhibit  in  the  presence  (and  sometimes  even  in  the  absence)  of  applied  magnetic 
fields. 

In  this  chapter  we  shall  first  review  the  theory  of  atomic  magnetism.  We  shall 
then  consider  those  magnetic  properties  of  insulating  solids  that  can  be  understood 
in  terms  of  the  properties  of  their  individual  atoms  or  ions  with,  if  necessary,  suitable 
modifications  to  take  into  account  effects  of  the  crystalline  environment.  We  shall 
also  consider  those  magnetic  properties  of  metals  that  can  be  at  least  qualitatively 
understood  in  the  independent  electron  approximation. 

In  none  of  the  applications  of  this  chapter  shall  we  discuss  at  any  length  electron- 
electron  interactions.  This  is  because  in  the  case  of  insulators  we  shall  base  our 
analysis  on  results  of  atomic  physics  (whose  derivation  depends  critically,  of  course, 
on  such  interactions),  and  because  in  the  case  of  metals  the  phenomena  we  shall 
describe  here  are  at  least  roughly  accounted  for  in  an  independent  electron  model. 
In  Chapter  32  we  shall  turn  to  an  examination  of  the  physics  underlying  those 
electron-electron  interactions  that  can  profoundly  affect  the  characteristically  mag¬ 
netic  properties  of  metals  and  insulators.  In  Chapter  33  we  shall  describe  the  further 
magnetic  phenomena  (such  as  ferromagnetism  and  antiferromagnetism)  that  can 
result  from  these  interactions. 


MAGNETIZATION  DENSITY  AND  SUSCEPTIBILITY 


At  T  —  0  the  magnetization  density  M(H)  of  a  quantum-mechanical  system  of 
volume  V  in  a  uniform  magnetic  field1  H  is  defined  to  be2 


M(H)  =  - 


1  dE0(H) 
V  8H  ’ 


(31.1) 


where  E0(H)  is  the  ground-state  energy  in  the  presence  of  the  field  H.  If  the  system 
is  in  thermal  equilibrium  at  temperature  T,  then  one  defines  the  magnetization 
density  as  the  thermal  equilibrium  average  of  the  magnetization  density  of  each 
excited  state  of  energy  En(H): 

X  Mn(H)e-E"lk»r 

M{H,  T)  =  £  e~En/kBT  >  (31.2) 


1  We  shall  take  H  to  be  the  field  that  acts  on  the  individual  microscopic  magnetic  moments 
within  the  solid.  As  in  the  case  of  a  dielectric  solid  (cf.  Chapter  27).  this  need  not  be  the  same  as  the 
applied  field.  However,  for  the  paramagnetic  and  diamagnetic  substances  to  be  discussed  in  this  chapter 
the  local  field  corrections  are  very  small,  and  will  be  ignored. 

2  For  simplicity  we  assume  that  M  is  parallel  to  H.  More  generally  one  should  write  a  vector  equation: 
A/„  =  —  (l/F)c£0/df/^,  and  the  susceptibility  (defined  below)  will  be  a  tensor.  In  Problem  1  it  is  shown 
that  this  definition  is  equivalent  to  the  more  familiar  one  of  Ampere  encountered  in  the  conventional 
formulations  of  classical  macroscopic  electrodynamics. 
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where 


Mn(H)  =  - 


1  SEn(H) 
V  8H  ' 


This  can  also  be  written  in  the  thermodynamic  form 


M  =  - 


1  8F 

VdH ’ 


(31.3) 


(31.4) 


where  F,  the  magnetic  Helmholtz  free  energy,  is  defined  by  the  fundamental  statistical 
mechanical  rule: 

e~F/kBT  _  Y  e-E„mikBT 
n 

The  susceptibility  is  defined  as3 

_  8M  _  1  82F 

X~~8H  ~vm2' 

The  magnetization  can  be  measured  by  recording  the  force  exerted  on  a  specimen 
by  an  inhomogeneous  field  that  varies  slowly  over  the  sample,  for  the  change  in  free 
energy  on  moving  the  specimen4  from  xtox  +  dx  will  be:5 


(31.5) 

(31.6) 


dF  =  F(H(x  +  dx))  -  F(H(x))  =  ~^dx  =  -  VM  ~  dx,  (31.7) 

and  therefore  the  force  per  unit  volume  /  exerted  on  the  specimen  by  the  field  is 

f  1  dF  8H 


CALCULATION  OF  ATOMIC  SUSCEPTIBILITIES: 

GENERAL  FORMULATION 

In  the  presence  of  a  uniform  magnetic  field  the  Hamiltonian  of  an  ion  (or  atom)  is 
modified  in  the  following  major  ways:6 


3  Often,  as  we  shall  see,  M  is  very  accurately  linear  in  H  for  attainable  field  strengths,  in  which  case 
the  definition  reduces  to  x  =  M/H.  Note  also  that  y  is  dimensionless  (in  CGS  units)  since  H2  has  the 
dimensions  of  energy  per  unit  volume. 

Equal  to  the  mechanical  work  done  on  the  specimen  if  the  temperature  is  held  constant. 

We  take  the  field  to  be  along  the  z-direction  and  move  the  sample  in  the  x-direction. 

6  Note  the  following  ways  in  which,  generally,  one  does  not  bother  to  modify  it.  One  almost  always 
neglects  the  effects  of  the  magnetic  field  on  the  translational  motion  of  the  ion ;  i.e.,  one  does  not  make 
the  replacement  (31.9)  for  the  momentum  operators  describing  the  ionic  nuclei.  Furthermore,  unless  one 
is  explicitly  interested  in  nuclear  spin  effects  (as  in  the  case  of  magnetic  resonance  experiments)  one  ignores 
the  analogue  of  (31.12)  for  the  nuclear  spins.  In  both  cases  these  simplifications  are  justified  by  the  very 
much  greater  nuclear  mass,  which  makes  the  nuclear  contribution  to  the  magnetic  moment  of  the  solid, 
some  106  to  10s  times  smaller  than  the  electronic  contribution.  Finally,  the  replacement  (31.9)  in  the 
electron  momentum  operators  that  appear  in  the  terms  describing  spin-orbit  coupling  leads  to  very 
small  corrections  compared  with  the  direct  coupling  of  the  electron  spin  to  the  magnetic  field,  and  is  also 
generally  ignored. 
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1.  In  the  total  electronic  kinetic  energy,  T0  =  £  pi2/2m,  the  momentum  of  each 
electron  (of  charge  -  e)  is  replaced  by7 

Pi  Pi  +  ^  Mri)>  (31.9) 

where  A  is  the  vector  potential.  In  this  chapter  we  shall  take  A  in  a  uniform  field 
H  to  have  the  form: 

A  =  —  ir  x  H,  (31.10) 

so  that  the  conditions 

H  =  V  x  A  and  V  ■  A  =  0  (31.11) 

are  both  satisfied. 

2.  The  interaction  energy  of  the  field  with  each  electron  spin  s‘  =  §o;  must  be 
added  to  the  Hamiltonian:8 


A3C  —  g0gBHS 


Here  gB,  the  Bohr  magneton,  is  given  by 

eh 


Bb 


2  me 


(*-?4 

=  0.927  x  10’ 20  erg/G 


=  0.579  x  10- 8  eV/G, 
ar|d  g0,  the  electronic  g-factor,  is  given  by 


00 


=  2  1  +  +  0{  a2)  +  ■  •  •  ,  a  =  %-■ 

27t  J  tic 


137 


=  2.0023, 


(31.12) 


(31.13) 


(31.14) 


which,  to  the  accuracy  of  most  measurements  of  interest  in  solids,  can  be  taken 
to  be  precisely  2. 

As  a  result  of  (3 1.9)  the  total  electronic  kinetic  energy  operator  must  be  replaced  by 


In  a  purely  classical  theory  (taking  the  electron’s  spin  to  be  a  quantum  phenomenon)  this  would 
be  the  only  effect  of  the  field.  It  can  then  easily  be  demonstrated  from  classical  statistical  mechanics  that 
the  thermal  equilibrium  magnetization  must  always  vanish  (Bohr-von  Leeuwen  theorem)  for  the  sum 
defining  the  free  energy  becomes  an  integral  over  a  6JV-dimensional  N-electron  phase  space: 

r  n 

e-»F  =  j  dp.  dti  exp  l-PH( rj, . . . ,  r„;  Pl, . . . ,  pA,)]. 

Since  the  magnetic  field  enters  only  in  the  form  p,  +  eA(r,)/c  it  can  be  eliminated  entirely  by  a  simple  shift 
of  origin  in  the  momentum  integrations  (the  limits  of  which  run  from  -  oo  to  oo  and  are  therefore  unaffected 
by  the  shift).  But  if  F  does  not  depend  on  H,  then  the  magnetization,  being  proportional  to  dF/cH,  must 
vanish.  Thus  a  quantum  theory  is  required  from  the  start  to  explain  any  magnetic  phenomena. 

8  In  magnetic  problems  we  use  script  3C  for  the  Hamiltonian  to  avoid  confusion  with  the  magnetic 
field  strength  H.  We  also  use  dimensionless  spins  (with  integral  or  half-integral  values)  so  that  the  angular 
momentum  is  h  times  the  spin. 
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which  can  be  expanded  to  give 

T=  T0  +  jubL  •  H  +  H2  £  (xf2  +  j';2),  (31.16) 

where  L  is  the  total  electronic  orbital  angular  momentum:9 


flL  =  £  Tj  X  Pi. 

i 


(31.17) 


The  spin  term  (31.12)  combines  with  (31.16)  to  give  the  following  field-dependent 
terms  in  the  Hamiltonian: 

A3C  =  nB{\ L  +  f/0S)  •  H  +  h2  £  (xi2  +  (31.18) 


We  shall  see  below  that  the  energy  shifts  produced  by  (31.18)  are  generally  quite 
small  on  the  scale  of  atomic  excitation  energies,  even  for  the  highest  presently  attain¬ 
able  laboratory  field  strengths.  Therefore  one  can  compute  the  changes  in  the  energy 
levels  induced  by  the  field  with  ordinary  perturbation  theory.  To  compute  the 
susceptibility,  a  second  derivative  with  respect  to  the  field,  one  must  retain  terms  up 
to  the  second  order  in  H,  and  must  therefore  use  the  famous  result  of  second-order 
perturbation  theory:10 


„  |<w|A3C|h'>|2 

£„  +  A£„;  A£„  =  <n|A3C|n>  +  X 

n'fn 


(31.19) 


Inserting  (31.18)  into  (31.19)  and  retaining  terms  through  those  quadratic  in  H, 
we  find  that  to  second  order,11 


.  .  V.  IOkH  •  (L  +  g0S)|n')|2 

A£„  =  PbH  •  <?i|L  +  g0S\n}  -I-  2, - F  _-F~ - 

n’fn  r'n 

+  ^H2<'€(x'2+»2)\n>- 


(31.20) 


Equation  (31.20)  is  the  basis  for  theories  of  the  magnetic  susceptibility  ofindividual 
atoms,  ions,  or  molecules.  It  also  underlies  theories  of  the  susceptibilities  of  those 
solids  that  can  be  represented  as  a  collection  of  only  slightly  deformed  individual 
ions,  i.e.,  ionic  and  molecular  solids.  In  such  cases  the  susceptibility  is  computed 
ion  by  ion. 


9  We  measure  L  in  the  same  dimensionless  units  as  the  spin,  so  that  each  component  of  L  has  integral 
eigenvalues,  and  the  orbital  angular  momentum  in  conventional  units  is  ft  L.  We  also  use  sans  serif  boldface 
type  to  denote  angular  momentum  operators.  Note  also  that  by  L  we  shall  mean  the  vector  operator 
whose  components  are  Lx,  L,,  and  L..  (Similar  remarks  apply  to  the  spin  operator  S,  and  the  total  angular 
momentum  operator  J.) 

10  D.  Park,  Introduction  to  the  Quantum  Theory .  McGraw-Hill,  New  York,  1964,  Chapter  8.  Note 
that  if  the  nth  level  is  degenerate,  as  is  often  the  case,  the  states  n  must  be  chosen  so  as  to  diagonalize  A3 C 
in  the  degenerate  subspace.  This  is  not  hard  to  arrange,  as  we  shall  see. 

11  The  quantity  e2/mcJ  may  be  written  as  a2o0- 
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Before  applying  (31.20)  to  particular  cases,  we  first  observe  that  unless  the  term 
that  is  linear  in  H  vanishes  identically  (as  it  sometimes  does),  it  will  almost  always 
be  the  dominant  term  even  when  the  field  is  very  strong  (~  104  gauss).  If  it  does 
not  vanish,  <n|(L*  +  r/0S2)|/i>  will  be  of  order  unity  so  that 

jubH  •  <n|(L  +  0os)|h>  =  O(hbH)  ~  ~  fiwc.  (31.21) 


This  is  of  order  10" 4  eV  when  H  is  of  order  104  gauss  (thereby  substantiating  our 
earlier  assertion  that  the  energy  shifts  are  small).  To  estimate  the  size  of  the  last 
term  in  A3C,  we  note  that  <n|(x;2  +  Ti2)|n>  will  be  of  the  order  of  the  square  of  a 
typical  atomic  dimension,  so  that 


i^H2<r)|I(xi2  +  yf2)|n>  =  o[^  ™o2 


(31.22) 


Since  e2/a0  is  about  27  eV,  this  term  is  smaller  than  the  linear  term  (31.21)  by  a 
factor  of  about  10“5,  even  at  fields  as  high  as  104  gauss.  The  second  term  in  (31.20) 
can  also  be  shown  to  be  smaller  than  the  first  by  a  factor  of  order  fuoJA,  where 
A  =  min  \E„  -  E„-\  is  a  typical  atomic  excitation  energy.  In  most  cases  A  will  be 
large  enough  to  make  this  factor  quite  small. 


SUSCEPTIBILITY  OF  INSULATORS  WITH  ALL  SHELLS  FILLED: 
LARMOR  DIAMAGNETISM 

The  simplest  application  of  these  results  is  to  a  solid  composed  of  ions12  with  all 
electronic  shells  filled.  Such  an  ion  has  zero  spin  and  orbital  angular  momentum  in 
its  ground  state13  |0> 

J|0>  =  L|0>  =  S|0>  =  0.  (31.23) 

Consequently  only  the  third  term  in  (31.20)  contributes  to  the  field-induced  shift  in 
the  ground-state  energy:14 

AEo  =  H\ 0E  (x,-2  +  y,-2)|0>  =  H\ 0|£  r,*|0>.  (31.24) 


If  (as  is  the  case  at  all  but  very  high  temperatures)  there  is  negligible  probability  of 
the  ion  being  in  any  but  its  ground  state  in  thermal  equilibrium,  then  the  susceptibility 
of  a  solid  composed  of  N  such  ions  is  given  by 


X 


N  82AE0 

V  8H 2 


e2  N 
6  me2  V 


<0Erf2|0>. 

i 


(31.25) 


12  As  in  earlier  chapters  we  continue  to  use  the  term  “ion”  to  mean  ion  or  atom,  the  latter  being 
an  ion  of  charge  0. 

13  This  is  because  the  ground  state  of  a  closed-shell  ion  is  spherically  symmetric.  It  is  also  an  especially 
simple  consequence  of  Hund’s  rules  (see  below). 

14  The  last  form  follows  from  the  spherical  symmetry  of  the  closed-shell  ion: 

<0|Zx;2|0>  =  <0|Zy.2|0>  =  <0|ZZi2|0>  =  K0|&,2|0> 
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This  is  known  as  the  Larmor  diamagnetic  susceptibility.15  The  term  diamagnetism 
is  applied  to  cases  of  negative  susceptibility — i.e.,  cases  in  which  the  induced  moment 
is  opposite  to  the  applied  field. 

Equation  (31.25)  should  describe  the  magnetic  response  of  the  solid  noble  gases 
and  of  simple  ionic  crystals  such  as  the  alkali  halides,  since  in  these  solids  the  ions 
are  only  slightly  distorted  by  their  crystalline  environment.  Indeed,  in  the  alkali 
halides  the  susceptibilities  can  be  represented,  to  within  a  few  percent,  as  a  sum  of 
independent  susceptibilities  for  the  positive  and  negative  ions.  These  ionic  suscep¬ 
tibilities  also  give  accurately  the  contribution  of  the  alkali  halides  to  the  susceptibility 
of  solutions  in  which  they  are  dissolved. 

Susceptibilities  are  usually  quoted  as  molar  susceptibilities,  based  on  the  magneti¬ 
zation  per  mole,  rather  than  per  cubic  centimeter.  Thus  xmolar  is  given  by  multiplying 
X  by  the  volume  of  a  mole,  N A/\N/V\,  where  NA  is  Avogadro’s  number.  It  is  also 
conventional  to  define  a  mean  square  ionic  radius  by 


O2>  =  ^I<0h2|0>,  (31-26) 

where  Z,-  is  the  total  number  of  electrons  in  the  ion.  Thus  the  molar  susceptibility 
is  written: 


.  7JV  e2  ,  2.  7(e2\2NAa03 

x^=-ZiNA—2(r>=-  ZA- 


6 


<('/«o)2>- 


Since  a0  =  0.529  A,  e2/hc  =  1/137,  and  NA  =  0.6022  x  1024, 

Xm°iar  =  _o.79Zi  x  10_6<(r/o0)2>  cm3/mole. 


(31.27) 

(31.28) 


The  quantity  <(r/a0)2>  is  of  order  unity,  as  is  the  number  of  moles  per  cubic  centimeter 
(by  which  the  molar  susceptibility  must  be  multiplied  to  get  the  dimensionless  suscep¬ 
tibility  defined  in  (31.6)).  We  conclude  that  diamagnetic  susceptibilities  are  typically 
of  order  10' 5;  i.e.,  M  is  minute  compared  with  H. 

Molar  susceptibilities  for  the  noble  gases  and  the  alkali  halide  ions  are  given  in 
Table  31.1. 


Table  31.1 

MOLAR  SUSCEPTIBILITIES  OF  NOBLE  GAS  ATOMS  AND  ALKALI 
HALIDE  ION'S" 


ELEMENT  SUSCEPTIBILITY  ELEMENT  SUSCEPTIBILITY  ELEMENT  SUSCEPTIBILITY 


He 

-1.9 

Li  + 

-0.7 

F" 

-9.4 

Ne 

-7.2 

Na  + 

-6.1 

cr 

-24.2 

A 

-19.4 

K+ 

-14.6 

Br~ 

-34.5 

Kr 

-28 

Rb+ 

-22.0 

r 

-50.6 

Xe 

-43 

Cs+ 

-35.1 

"  In  units  of  10  6  cm3/mole.  Ions  in  each  row  have  the  same  electronic  configuration. 
Source:  R.  Kubo  and  T.  Nagamiya,  eds..  Solid  State  Physics,  McGraw-Hill,  New  York,  1969,  p.  439. 


15 


It  is  also  frequently  referred  to  as  the  Langevin  susceptibility. 
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When  a  solid  contains  some  ions  with  partially  filled  electronic  shells,  its  magnetic 
behavior  is  very  different.  Before  we  can  apply  the  general  result  (31.20)  to  this  case, 
we  must  review  the  basic  facts  about  the  low-lying  states  of  such  ions. 


GROUND  STATE  OF  IONS  WITH  A  PARTIALLY  FILLED  SHELL: 
HUND’S  RULES 

Suppose  we  have  a  free16  atom  or  ion  in  which  all  electronic  shells  are  filled  or 
empty  except  for  one,  whose  one-electron  levels  are  characterized  by  orbital  an¬ 
gular  momentum  /.  Since  for  given  /  there  are  2/  +  1  values  /.  can  have  (I,  I  -  1, 
l  —  2, . . . ,  —  /)  and  two  possible  spin  directions  for  each  /z,  such  a  shell  will  contain 
2(2/  +  1)  one-electron  levels.  Let  n  be  the  number  of  electrons  in  the  shell,  with 
0  <  /i  <  2(2/  +  1).  If  the  electrons  did  not  interact  with  one  another,  the  ionic 
ground  state  would  be  degenerate,  reflecting  the  large  number  of  ways  of  putting 
n  electrons  into  more  than  n  levels.  However,  this  degeneracy  is  considerably  (though 
in  general  not  completely)  lifted  by  electron-electron  Coulomb  interactions  as  well 
as  by  the  electron  spin-orbit  interaction.  Except  for  the  very  heaviest  ions  (where 
spin-orbit  coupling  is  very  strong)  the  lowest-lying  levels  after  the  degeneracy  is 
lifted  can  be  described  by  a  simple  set  of  rules,  justified  both  by  complex  calculations 
and  by  the  analysis  of  atomic  spectra.  Here  we  shall  simply  state  the  rules,  since  we 
are  more  interested  in  their  implications  for  the  magnetic  properties  of  solids  than 
in  their  underlying  justification.17 

1.  Russel-Saunders  Coupling  To  a  good  approximation18  the  Hamiltonian  of  the 
atom  or  ion  can  be  taken  to  commute  with  the  total  electronic  spin  and  orbital 
angular  momenta,  S  and  L,  as  well  as  w'ith  the  total  electronic  angular  momentum 
J  =  L  +  S.  Therefore  the  states  of  the  ion  can  be  described  by  quantum  numbers 
L,  Lz,  S,  Sz,  J,  and  Jz,  indicating  that  they  are  eigenstates  of  the  operators  L2,  Lz, 
S2,  Sz,  J2,  and  Jz  with  eigenvalues  L(L  +  1),  Lz,  S(S  +  1),  Sz,  J(J  +  1),  and  Jz, 
respectively.  Since  filled  shells  have  zero  orbital,  spin,  and  total  angular  momentum, 
these  quantum  numbers  describe  the  electronic  configuration  of  the  partially  filled 
shell,  as  well  as  the  ion  as  a  whole. 

2.  Hand's  First  Rule  Out  of  the  many  states  one  can  form  by  placing  n  electrons 
into  the  2(2/  +  1)  levels  of  the  partially  filled  shell,  those  that  lie  lowest  in  energy 
have  the  largest  total  spin  S'  that  is  consistent  with  the  exclusion  principle.  To  see 
what  that  value  is,  one  notes  that  the  largest  value  S  can  have  is  equal  to  the  largest 
magnitude  that  Sz  can  have.  If  n  <  21  +  1,  all  electrons  can  have  parallel  spins 
without  multiple  occupation  of  any  one-electron  level  in  the  shell,  by  assigning  them 
levels  with  different  values  of  lz.  Hence  S  =  \n,  when  n  <  2/  +  1.  When  n  =  21  +  1, 

16  We  shall  discuss  how  the  behavior  of  the  free  atom  or  ion  is  modified  by  the  crystalline  environment 
on  pages  656-659. 

11  The  rules  are  discussed  in  most  quantum  mechanics  texts.  See,  for  example,  L.  D.  Landau  and 
E.  M.  Lifshitz.  Quantum  Mechanics,  Addison  Wesley,  Reading,  Mass.,  1965. 

18  The  total  angular  momentum  J  is  always  a  good  quantum  number  for  an  atom  or  ion,  but  L  and 
S  are  good  quantum  numbers  only  to  the  extent  that  spin-orbit  coupling  is  unimportant. 
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S  has  its  maximum  value,  l  +  Since  electrons  after  the  (21  +  l)th  are  required  by 
the  exclusion  principle  to  have  their  spins  opposite  to  the  spins  of  the  first  21  +  1,  S  is 
reduced  from  its  maximum  value  by  half  a  unit  for  each  electron  after  the  (21  +  l)th. 

3.  Hund's  Second  Rule  The  total  orbital  angular  momentum  L  of  the  lowest-lying 
states  has  the  largest  value  that  is  consistent  with  Hund’s  first  rule,  and  with  the 
exclusion  principle.  To  determine  that  value,  one  notes  that  it  is  equal  to  the  largest 
magnitude  that  Lz  can  have.  Thus  the  first  electron  in  the  shell  will  go  into  a  level 
with  |L|  equal  to  its  maximum  value  1.  The  second,  according  to  rule  2,  must  have  the 
same  spin  as  the  first,  and  is  therefore  forbidden  by  the  exclusion  principle  from 
having  the  same  value  of  /..  The  best  it  can  do  is  to  have  |/z|  =  1—1,  leading  to  a 
total  L  of  /  +  (/  -  1)  =  2/  -  1.  Continuing  in  this  way,  if  the  shell  is  less  than  half 
filled,  we  will  have  L  =/  +  (/—  1)  +  ■■■  +  [/  —  (n  —  1)].  When  the  shell  is  pre¬ 
cisely  half  filled,  all  values  of  l2  must  be  assumed,  and  therefore  L  =  0.  The  second 
half  of  the  shell  is  filled  with  electrons  with  spin  opposite  to  those  in  the  first  half, 
and  therefore  the  exclusion  principle  allows  us  again  to  go  through  the  same  series 
of  values  for  L  we  traversed  in  filling  the  first  half. 

4.  Hund's  Third  Rule  The  first  two  rules  determine  the  values  of  L  and  S  assumed 
by  the  states  of  lowest  energy.  This  still  leaves  (2 L  +  1)(2S  +  1)  possible  states. 
These  can  be  further  classified  according  to  their  total  angular  momentum  J,  which, 
according  to  the  basic  rules  of  angular  momentum  composition,  can  take  on 
all  integral  values  between  | L  —  S|  and  L  +  S.  The  degeneracy  of  the  set  of 
(2 L  +  1)(2,V  +  1)  states  is  lifted  by  the  spin-orbit  coupling,  which,  within  this  set  of 
states,  can  be  represented  by  a  term  in  the  Hamiltonian  of  the  simple  form  /(L  -S). 
Spin-orbit  coupling  will  favor  maximum  J  (parallel  orbital  and  spin  angular  momenta) 
if  X  is  negative,  and  minimum  J  (antiparallel  orbital  and  spin  angular  momenta)  if 
X  is  positive.  As  it  turns  out,  X  is  positive  for  shells  that  are  less  than  half  filled  and 
negative  for  shells  that  are  more  than  half  filled.  As  a  result,  the  value  J  assumes  in 
the  states  of  lowest  energy  is: 

J  =\L-  S\,  n  sS  (2/  +  1), 

J  =  L  +  S,  n  S*  (21  +  1).  (31.29) 

In  magnetic  problems  one  usually  deals  only  with  the  set  of  (2 L  +  1)(2S  +  1) 
states  determined  by  Hund’s  first  two  rules,  all  others  lying  so  much  higher  in  energy 
as  to  be  of  no  interest.  Furthermore,  it  is  often  enough  to  consider  only  the  2J  +  \ 
lowest  lying  of  these  specified  by  the  third  rule. 

The  rules  are  easier  to  apply  than  their  description  might  suggest;  indeed,  in 
determining  the  lowest  lying  J-multiplet  (known  as  a  term)  for  ions  in  a  solid,  one 
really  encounters  only  22  cases  of  interest:  1  to  9  electrons  in  a  r/-shell  (l  =  2)  or 
1  to  13  electrons  in  an  /-shell  (l  =  3).19  For  unfortunate  historical  reasons  the 
ground-state  multiplet  in  these  cases  is  not  described  by  the  simple  triad  of  numbers 


19  Partially  filled  p-shells  contain  valence  electrons,  and  invariably  broaden  into  bands  in  the  solid. 
Thus  the  configuration  of  the  electrons  they  contain  in  the  solid  is  in  no  sense  a  slight  distortion  of  the 
configuration  in  the  free  atom,  and  the  analysis  of  this  chapter  is  inapplicable. 
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SLJ.  Instead,  the  orbital  angular  momentum  L  is  given  by  a  letter,  according  to 
the  hallowed  spectroscopic  code: 


L=0123456 
X  =  SPDFGH I 


(31.30) 


The  spin  is  specified  by  affixing  the  number  2S  +  1  (known  as  the  multiplicity)  to 
the  letter  as  a  superprefix,  and  only  J  is  given  as  the  number  J,  affixed  as  a  right 
subscript.  Thus  the  lowest  lying  ./-multiplct  is  described  by  the  symbol:  i2S+,]Xj. 

The  cases  of  major  interest  for  the  study  of  magnetism  in  solids  are  given  in 
Table  31.2. 


Table  31.2 

GROUND  STATES  OF  IONS  WITH  PARTIALLY  FILLED  d-  OR  /-SHELLS, 
AS  CONSTRUCTED  FROM  HUND’S  RULES0 
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SUSCEPTIBILITY  OF  INSULATORS  CONTAINING  IONS  WITH  A 
PARTIALLY  FILLED  SHELL:  PARAMAGNETISM 


There  are  two  cases  to  distinguish: 

1.  If  the  shell  has  J  =  0  (as  is  the  case  for  shells  that  are  one  electron  short  of 
being  half  filled)  then  the  ground  state  is  nondegenerate  (as  in  the  case  of  a  filled  shell) 
and  the  linear  term  in  the  energy  shift  (31.20)  vanishes.20  However  (in  contrast  to 
the  case  of  a  filled  shell),  the  second  term  in  (31.20)  need  not  vanish,  and  the  shift 
in  the  ground-state  energy  due  to  the  field  will  be  given  by 


AE0 


8  me 


(31.31) 


When  the  solid  contains  N/V  such  ions  per  unit  volume,  the  susceptibility  is 


X  = 


n  e2E0 
V  SH 2 
NT  e2 


|<0|(L2  +  flpSz|n)|: 


=  -7L4^<0|?h,  +  K',|0>-2"'‘?^^ 


] 


(31.32) 


The  first  term  is  just  the  Larmor  diamagnetic  susceptibility  discussed  above.  The 
second  term  has  a  sign  opposite  to  that  of  the  first  (since  the  energies  of  excited  slates 
necessarily  exceed  that  of  the  ground  slate).  It  therefore  favors  alignment  of  the 
moment  parallel  to  the  field,  a  behavior  known  as  paramagnetism.  This  paramagnetic 
correction  to  the  Larmor  diamagnetic  susceptibility  is  known  as  Van  Vleck  para¬ 
magnetism.2^  The  magnetic  behavior  of  ions  with  a  shell  one  electron  short  of  being 
half  filled  is  determined  by  a  balance  between  Larmor  diamagnetism  and  Van  Vleck 
paramagnetism,  provided  that  only  the  ground  state  is  occupied  with  appreciable 
probability  in  thermal  equilibrium,  so  the  free  energy  is  just  the  ground-state  energy. 
In  many  such  cases,  however,  the  next  lowest  J-multiplet  is  close  enough  to  the  J  =  0 
ground  state  that  its  contribution  to  the  free  energy  (and  hence  to  the  susceptibility) 
is  appreciable,  and  a  more  complicated  formula  than  (31.32)  is  required. 

2.  If  the  shell  does  not  have  J  =  0  (i.e.,  in  all  cases  except  for  closed  shells  and 
shells  one  electron  short  of  being  half  filled),  then  the  first  term  in  the  energy  shift 
(31.20)  will  not  vanish  and,  as  we  indicated  above,  will  almost  always  be  so  much 
larger  than  the  other  two  that  they  can  safely  be  ignored.  In  this  case  the  ground  state 
is  (2 J  +  l)-fold  degenerate  in  zero  field,  and  we  are  faced  with  the  problem  of 
evaluating  and  diagonalizing  the  (2  J  +  l)-dimensional  square  matrix22 

(JLSJZ\(LZ  +  g0S-)\JLSJ-'y;  J-,  J-l  =  -J,  ■  ■  ■ .  J-  (31-33) 


20  This  follows  from  the  symmetry  of  states  with  J  =  0,  as  shown  in  Problem  4. 

21  Van  Vleck  paramagnetism  also  arises  in  the  susceptibilities  of  molecules  of  a  more  complex 
structure  than  the  single  ions  we  consider  here. 

22  See  the  remark  in  footnote  10. 
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This  task  is  made  simple  by  a  theorem  (Wigner-Eckart),23  which  states  that  the  matrix 
elements  of  any  vector  operator  in  the  (2  J  +  l)-dimensional  space  of  eigenstates  of 
J2  and  Jz  with  a  given  value  of  J,  are  proportional  to  the  matrix  elements  of  J  itself: 

<JLSJZ  |(L  +  g0S)\JLSJz'y  =  g(JLS)(JLSJz\j\jLSJz,'>.  (31.34) 

The  significant  feature  of  this  result  is  that  the  proportionality  constant  g(JLS)  does 
not  depend  on  the  values  of  J2  or  J,'. 

In  particular,  since  the  matrix  elements  of  Jz  are 

<■ JLSJ2\Jz\JLSJ2">  =  JzSjz,jz.  (31.35) 

it  follows  that 

<  J  LSJ  .[(Lj.  +  g0Sz)\JLSJz'y  =  g(JLS)Jz8jzjz..  (31.36) 

This  solves  the  secular  problem:  i.e.,  the  matrix  is  already  diagonal  in  the  states  of 
definite  J2,  and  the  (2  J  4-  l)-fold  degenerate  ground  state  is  therefore  split  into  states 
with  definite  values  of  J2  whose  energies  are  uniformly  separated  by  g(JLS)gBH. 
The  value  of g(JLS)  (known  as  the  Lande  g-factor)  is  easily  computed  (Appendix  P): 

g(JLS)  =  Hg0  +  1)  -  Kt/o  -  1)  ^  +j)ij~+Sl)  +  <31‘37) 

or,  taking  the  electron  ^-factor  g0  to  be  exactly  2, 

,Irc.  3  1TS(S  +  1)  -  L(L+  1)1 

-  2 4  SL — jinrr, — <3>-38> 

One  sometimes  encounters  the  result  (31.34),  which  can  be  written  in  the  equivalent 
form, 

<JLSJ2\(L  +  g0S)\JLSJ2'}  =  (JLSJ2\g(JLS)j\jLSJz'\  (31.39) 

written  without  the  surrounding  state  vectors: 

L  +  g0S  =  g(JLS)  J.  (31.40) 

We  stress  that  this  relation  is  valid  only  within  the  (2  J  +  1)  dimensional  set  of  states 
that  make  up  the  degenerate  atomic  ground  state  in  zero  field;  i.e.,  (31.40)  is  obeyed 
only  for  matrix  elements  taken  between  states  that  are  diagonal  in  J,  L,  and  S.  If 
the  splitting  between  the  zero-field  atomic  ground-state  multiplet  and  the  first  excited 
multiplet  is  large  compared  with  kBT( as  is  frequently  the  case),  then  only  the  (2.1  +  1) 
states  in  the  ground-state  multiplet  will  contribute  appreciably  to  the  free  energy.  In 
that  case  (and  only  in  that  case)  Eq.  (31.40)  permits  one  to  interpret  the  first  term  in 
the  energy  shift  (31.20)  as  expressing  the  interaction  (— n  •  H)  of  the  field  with  a 
magnetic  moment  that  is  proportional  to  the  total  angular  momentum  of  the  ion,24 

H  =  -g(JLS)nB  J.  (31.41) 


23  For  a  proof  see,  for  example,  K.  Gottfried,  Quantum  Mechanics,  vol.  1,  W.  A.  Benjamin,  Menlo 
Park,  Calif,  1966,  pp.  302-304. 

24  Within  the  ground-state  multiplet  the  energy  of  the  ion  in  a  field  H  is  then  given  by  the  operator 
-g  •  H.  This  is  a  very  simple  example  of  a  “spin  Hamiltonian”  (see  pages  679-681). 
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Because  the  zero-field  ground  state  is  degenerate,  it  is  never  permissible  to  calculate 
the  susceptibility  by  equating  the  free  energy  to  the  ground-state  energy  (as  we  did 
in  the  case  of  the  nondegenerate  shells  with  J  =  0),  for  as  the  field  goes  to  zero,  the 
splitting  of  the  (2J  +  1)  lowest-lying  states  will  be  small  compared  with  kBT.  To 
obtain  the  susceptibility  we  must  therefore  do  an  additional  statistical  mechanical 
calculation. 


Magnetization  of  a  Set  of  Identical  Ions  of  Angular 
Momentum  /:  Curie’s  Law 


If  only  the  lowest  2 J  +  1  states  are  thermally  excited  with  appreciable  probability, 
then  the  free  energy  (31.5)  is  given  by: 

e-fF=  X  e~hB\  y  =  g(JLS)nB,  p  =  ~  (31.42) 

jz=-j  kb‘- 

This  geometric  series  is  easily  summed  to  give: 


e0yH(J+  1/2)  _  e-pyH(J+ll2) 
=  ePyH/2  _  e-PyHI2 


(31.43) 


The  expression  (31.4)  for  the  magnetization  of  N  such  ions  in  a  volume  V  then  gives 

N  8F  N 

M=-y|=y  yJBApyJH),  (31.44) 


where  the  Brillouin  function  B ^(x)  is  defined  by 

BAX)  =  -2J1  C°th  2J2T-  X  ~  2 1  C°lh  b  X '  (3L45) 


This  is  plotted  in  Figure  31.1  for  several  values  of  J. 

Note  that  as  T  ->  0  for  fixed  H,  M  ->  (N/V)yJ;  i.e.,  each  ion  is  completely  aligned 
by  the  field,  |JZ|  having  its  maximum  (or  “saturation”)  value  J.  However,  this  case 


Figure  31.1 

Plot  of  the  Brillouin  function 
Bj(x)  for  various  values  of 
the  spin  J. 


BjU) 
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arises  only  when  kBT  «  yH ;  since  yH/kB  ~  hcaJkB  ~  1  K  in  a  field  of  104  gauss, 
one  normally  encounters  the  opposite  limit  except  at  the  lowest  temperatures  and 
highest  fields. 

When  yH  «  kBT  the  small-*  expansion, 

coth  *  a  -  +  ix  +  0(x3),  Bj(x)  ~  J  ?r~x  +  °(x^’  (31.46) 

*  iJ 

gives 


(31.47) 


(31.48) 

This  variation  of  the  susceptibility  inversely  with  the  temperature  is  known  as  Curie’s 
law.  It  characterizes  paramagnetic  systems  with  “permanent  moments”  whose  align¬ 
ment  is  favored  by  the  field,  and  opposed  by  thermal  disorder.  Although  the  condition 
kBT  »  ggBH  for  the  validity  of  Curie’s  law  is  satisfied  for  an  enormous  range  of 
fields  and  temperatures,  it  is  important  to  remember  that  the  “law”  is  subject  to 
this  restriction.25 

The  paramagnetic  susceptibility  (31.47)  is  larger  than  the  temperature-independent 
Larmor  diamagnetic  susceptibility  (31.25)  by  a  factor  of  order  500  at  room  tempera¬ 
ture  (Problem  7)  and,  therefore,  when  an  ion  is  present  that  has  a  partially  filled  shell 
with  nonzero  J,  the  contribution  of  the  shell  to  the  total  susceptibility  of  the  solid 
completely  dominates  the  diamagnetic  contribution  from  the  other  (filled)  shells. 
From  our  estimate  that  diamagnetic  susceptibilities  are  of  order  10"  5  (see  page  649) 
we  conclude  that  room  temperature  paramagnetic  susceptibilities  should  be  of  order 
10“2  to  10"3. 


X  = 


N  (gnB)2  +  !) 


V  3 


kBT 


( kBT  »  ggBH ), 


/molM  =  N  M2  •/(•/+!) 


Curie’s  Law  in  Solids 


We  now  examine  the  extent  to  which  the  above  theory  of  the  paramagnetism  of  free 
ions  continues  to  describe  the  behavior  of  the  ions  when  they  are  part  of  the  structure 


of  a  solid. 

Insulating  crystals  containing  rare  earth  ions  (which  have  partially  filled  electronic 
/-shells)  are  found  to  obey  Curie’s  law  quite  well.  One  frequently  writes  the  law  in 
the  form: 


1  NBbY 
3  V  kBT’ 


(31.49) 


where  p,  the  “effective  Bohr  magneton  number,”  is  given  by 

p  =  g(JLS)V(J  +  1  )]1/2. 


(31.50) 


25  On  the  other  hand  the  law  does  hold  at  very  high  temperatures  even  when  there  are  appreciable 
magnetic  interactions  among  the  ions.  See  Eq.  (33.50). 
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In  Table  31.3  the  value  of  p  determined  by  the  coefficient  of  1/T  in  the  measured 
susceptibility,  is  compared  with  that  given  by  (31.50)  and  the  Lande  g-factor  (31.38). 

Table  31.3 

CALCULATED  AND  MEASURED  EFFECTIVE  MAGNETON  NUMBERS  p  FOR 

RARE  EARTH  IONS“ 

ELEMENT 

(triply 

ionized) 

BASIC  ELECTRON 

CONFIGURATION 

GROUND-STATE 

TERM 

CALCULATED6  p 

MEASURED*  p 

La 

4  f° 

*s 

0.00 

diamagnetic 

Ce 

4/1 

^5/2 

2.54 

2.4 

Pr 

4  f1 

3«4 

3.58 

3.5 

Nd 

4/3 

4^9/2 

3.62 

3.5 

Pm 

4/4 

5U 

2.68 

— 

Sm 

4/ 5 

6h5I2 

0.84 

1.5 

Eu 

4 f6 

1 F 
r  0 

0.00 

3.4 

Gd 

4/ 7 

8^7/2 

7.94 

8.0 

Tb 

4/ 8 

7^6 

9.72 

9.5 

Dy 

4/9 

6  H 15/2 

10.63 

10.6 

Ho 

4/10 

5h 

10.60 

10.4 

Er 

4/n 

%5,2 

9.59 

9.5 

Tm 

4/u 

3h6 

7.57 

7.3 

Yb 

4/13 

2f 

^7/2 

4.54 

4.5 

Lu 

4/14 

0.00 

diamagnetic 

“  Note  the  discrepancy  in  Sra  and  Eu  having  its  origin  in  low-lying  J-multiplets  assumed  absent 
in  the  theory. 
b  Equation  (31.50). 
c  Equation  (31.49). 

Source:  J.  H.  Van  Vleck.  The  Theory  of  Electric  and  Magnetic  Susceptibilities,  Oxford,  1952,  p.  243; 
see  also  R.  Kubo  and  T.  Nagamiya,  eds„  Solid  State  Physics,  McGraw-Hill,  New  York,  1969,  p.  451. 


The  agreement  is  excellent,  except  for  samarium  and  europium.  In  the  latter  case  we 
have  J  =  0,  and  our  analysis  is  clearly  not  applicable.  In  both  cases,  however,  the 
discrepancy  has  been  accounted  for  by  recognizing  that  the  J-mulliplct  lying  just 
above  the  ground  state  is  so  close  in  energy  that  (a)  the  energy  denominators  in  the 
second  term  in  the  energy  (31.20)  (neglected  in  the  derivation  of  Curie’s  law)  are  small 
enough  for  it  to  be  important,  and  (b)  the  probability  of  some  ions  being  thermally 
excited  out  of  the  state(s)  of  lowest  J  (also  neglected  in  deriving  Curie’s  law)  can  be 
appreciable. 

Thus  in  all  cases  the  magnetism  of  rare  earth  ions  in  an  insulating  solid  is  well 
described  by  treating  them  as  isolated  ions.  This  is  not  the  case,  however,  for  transition 
metal  ions  in  an  insulating  solid.  Indeed,  for  transition  metal  ions  from  the  iron  group 
one  finds  that  although  Curie’s  law  is  obeyed,  the  value  of  p  determined  from  it  is 
given  by  (31.50)  only  if  one  assumes  that  although  S  is  still  given  by  Hund’s  rules, 
Lis  zero  and  hence  J  is  equal  to  S.  (See  Table  31.4.)  This  phenomenon  is  known  as 
the  quenching  of  the  orbital  angular  momentum,  and  is  a  particular  example  of  a 
general  phenomenon  known  as  crystal  field  splitting. 
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Table  31.4 

CALCULATED  AND  MEASURED  EFFECTIVE  MAGNETON  NUMBERS  p  FOR 
THE  IRON  (3rf)  GROUP  IONS0 


ELEMENT 

(and 

ionization) 

BASIC 

ELECTRON 

CONFIGURATION 

GROUND- 

STATE 

TERM 

CALCULATED6  p 

(■ J  =  S)  (J  =  \L±  S|) 

MEASURED47/? 

Tj3  + 

3  dl 

2^3/2 

1.73 

1.55 

— 

y4  + 

3  rf1 

2f*3/2 

1.73 

1.55 

1.8 

v3+ 

3<f2 

3r2 

2.83 

1.63 

2.8 

v2+ 

3d3 

^3,2 

3.87 

0.77 

3.8 

Cr3  + 

3d3 

^3,2 

3.87 

0.77 

3.7 

Mn4t 

3  d3 

4fV2 

3.87 

0.77 

4.0 

Cr2  + 

3</4 

SDo 

4.90 

0 

4.8 

Mn3  + 

3d4 

500 

4.90 

0 

5.0 

Mn2+ 

3d5 

6o 

43  5/2 

5.92 

5.92 

5.9 

Fe3  + 

3  d5 

6c 

5/2 

5.92 

5.92 

5.9 

Fe2  + 

3  cl6 

5d4 

4.90 

6.70 

5.4 

Co2  + 

3d7 

4F9/2 

3.87 

6.54 

4.8 

Ni2  + 

3d 8 

3r 
*  4 

2.83 

5.59 

3.2 

Cu2+ 

3d9 

2f,5/2 

1.73 

3.55 

1.9 

“  Because  of  quenching,  much  better  theoretical  values  are  obtained  by  taking  J  equal  to  S,  the 
total  spin,  than  by  taking  the  value  J  =  |L  +  S|  appropriate  to  the  free  ion. 
b  Equation  (31.50).  In  the  case  J  -  S,  one  takes  L  =  0. 
c  Equation  (31.49). 

Source:  J.  H.  Van  Vleck,  The  Theory  of  Electric  and  Magnetic  Susceptibilities,  Oxford,  1952,  p.  285; 
R.  Kubo  and  T.  Nagamiya,  eds..  Solid  State  Physics,  McGraw-Hill,  New  York,  1969,  p.  453. 


Crystal  field  splitting  is  unimportant  for  rare  earth  ions,  because  their  partially 
filled  4/  shells  lie  deep  inside  the  ion  (beneath  filled  5s  and  5 p  shells).  In  contrast 
to  this,  the  partially  filled  d-shells  of  transition  metal  ions  are  the  outermost  electronic 
shells,  and  are  therefore  far  more  strongly  influenced  by  their  crystalline  environment. 
The  electrons  in  the  partially  filled  d-shells  are  subject  to  nonnegligible  electric  fields 
that  do  not  have  spherical  symmetry,  but  only  the  symmetry  of  the  crystalline  site  at 
which  the  ion  is  located.  As  a  result,  the  basis  for  Hund’s  rules  is  partially  invalidated. 

As  it  turns  out,  the  first  two  of  Hund’s  rules  can  be  retained,  even  in  the  crystalline 
environment.  The  crystal  field  must,  however,  be  introduced  as  a  perturbation  on 
the  (2S  +  1)(2L  +  l)-fold  set  of  states  determined  by  the  first  two  rules.  This  per¬ 
turbation  acts  in  addition  to  the  spin-orbit  coupling.  Therefore  Hund’s  third  rule 
(which  resulted  from  the  action  of  the  spin-orbit  coupling  alone)  must  be  modified. 

In  the  case  of  the  transition  metal  ions  from  the  iron  group  (partially  filled  3 d 
shells)  the  crystal  field  is  very  much  larger  than  the  spin-orbit  coupling,  so  that  to 
a  first  approximation  a  new  version  of  Hund’s  third  rule  can  be  constructed  in  which 
the  perturbation  of  spin-orbit  coupling  is  ignored  altogether,  in  favor  of  the  crystal 
field  perturbation.  This  latter  perturbation  will  not  split  the  spin  degeneracy,  since 
it  depends  only  on  spatial  variables  and  therefore  commutes  with  S,  but  it  can  com- 
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pletely  lift  the  degeneracy  of  the  orbital  L-multiplet,  if  it  is  sufficiently  asymmetric. 
The  result  will  then  be  a  ground-state  multiple!  in  which  the  mean  value  of  every 
component  of  L  vanishes  (even  though  L2  still  has  the  mean  value  L(L  +  !)).  One 
can  interpret  this  classically  as  arising  from  a  precession  of  the  orbital  angu  ar 
momentum  in  the  crystal  field,  so  that  although  its  magnitude  is  unchanged,  all  its 

components  average  to  zero.  .  „  ....  ,  .,  < >  ou^iid 

The  situation  for  the  higher  transition  metal  series  (partially  filled  4d  or  5d  shells) 
is  more  complex,  since  in  the  heavier  elements  the  spin-orbit  coupling  is  stronger. 
The  multi plet  splitting  due  to  spin-orbit  coupling  may  be  comparable  to  (or  greater 
than)  the  crystal  field  splitting.  In  general  cases  like  these,  considerations  of  how  the 
crystal  fields  can  rearrange  the  levels  into  structures  different  from  those  implied  by 
Hund’s  third  rule,  are  based  on  fairly  subtle  applications  of  group  theory.  We  shall 
not  explore  them  here,  but  mention  two  important  principles  that  come  into  play: 


1  The  less  symmetric  the  crystal  field,  the  lower  the  degeneracy  one  expects  the 
exact  ionic  ground  state  to  have.  There  is,  however,  an  important  theorem  (due 
to  Kramers)  asserting  that  no  matter  how  unsymmetnc  the  crystal  field,  an  ion 
possessing  an  odd  number  of  electrons  must  have  a  ground  state  that  is  at  least 
doubly  degenerate,  even  in  the  presence  of  crystal  fields  and  spin-orbit  inter- 

actions 

2.  One  might  expect  that  the  crystal  field  would  often  have  such  high  symmetry  (as 
at  sites  of  cubic  symmetry)  that  it  would  produce  less  than  the  maximum  lifting 
of  degeneracy  allowed  by  the  theorem  of  Kramers.  However,  another  theorem 
due  to  Jahn  and  Teller,  asserts  that  if  a  magnetic  ion  is  at  a  crystal  site  of  such 
high  symmetry  that  its  ground-state  degeneracy  is  not  the  Kramers  minimum, 
then  it  will  be  energetically  favorable  for  the  crystal  to  distort  (e.g„  for  the  equi¬ 
librium  position  of  the  ion  to  be  displaced)  in  such  a  way  as  to  lower  the  symmetry 
enough  to  remove  the  degeneracy.  Whether  this  lifting  of  degeneracy  is  large 
enough  to  be  important  (i.e.,  comparable  to  kBT  or  to  the  splitting  m  applied 
magnetic  fields)  is  not  guaranteed  by  the  theorem.  If  it  is  not  large  enough,  the 
Jahn-Teller  effect  will  not  be  observable. 


THERMAL  PROPERTIES  OF  PARAMAGNETIC  INSULATORS: 
ADIABATIC  DEMAGNETIZATION 

Since  the  Helmholtz  free  energy  is  F  =  U  -  TS,  where  U  is  the  internal  energy,  the 
magnetic  entropy  S{H,  T)  is  given  by 


S  =  kB[i2 


cF 

dp' 


(31.51) 


*  If  one  adds  the  spin-orbit  coupling  to  the  Hamiltonian,  as  an  add.tional  pertnrbauon  on  the 
crystal  field,  even  the  remaining  (2S  +  t)-fold  degeneracy  or  the  ground  state  will  be  Howler  this 
additional  splitting  may  well  be  small  compared  with  both  kBT  and  the  splitting  in  a  pp  b 

field,  in  which  case  it  can  be  ignored.  Evidently  this  is  the  case  for  the  transition  metal  ions  from  the 

iron  group. 
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(since  U  =  (d/SP)PF).  The  expression  (31.42)  for  the  free  energy  of  a  set  of  non¬ 
interacting  paramagnetic  ions  reveals  that  PF  depends  on  P  and  H  only  through 
their  product;  i.e.,  F  has  the  form 

F  =  ^0(PH).  (31.52) 

Consequently  the  entropy  has  the  form 

S  =  k,l-<HPH)  +  PHV(PH)1  (31.53) 

which  depends  only  on  the  product  pH  =  H/k„T.  As  a  result,  by  adiabatically  (i.e., 
at  fixed  S)  lowering  the  field  acting  on  a  spin  system  (slowly  enough  so  that  thermal 
equilibrium  is  always  maintained)  we  will  lower  the  temperature  of  the  spin  system 
by  a  proportionate  amount,  for  if  S  is  unchanged  then  H/T  cannot  change,  and 
therefore 


This  can  be  used  as  a  practical  method  for  achieving  low  temperatures  only  in  a 
temperature  range  where  the  specific  heat  of  the  spin  system  is  the  dominant  con¬ 
tribution  to  the  specific  heat  of  the  entire  solid.  In  practice  this  restricts  one  to  tem¬ 
peratures  far  below  the  Debye  temperature  (see  Problem  10),  and  the  technique  has 
proved  useful  for  cooling  from  a  few  degrees  Kelvin  down  to  a  few  hundredths  (or,  if 
one  is  skillful,  thousandths)  of  a  degree. 

The  limit  on  the  temperatures  one  can  reach  by  adiabatic  demagnetization  is  set 
by  the  limits  on  the  validity  of  the  conclusion  that  the  entropy  depends  only  on  H/T. 
If  this  were  rigorously  correct  one  could  cool  all  the  way  to  zero  temperature  by 
completely  removing  the  field.  But  this  assumption  must  fail  at  small  fields,  for 
otherwise  the  zero-field  entropy  would  not  depend  on  temperature.  In  reality  the 
entropy  in  zero  field  must  depend  on  temperature,  so  that  the  entropy  density  can 
drop  to  zero  with  decreasing  temperature,  as  required  by  the  third  law  of  thermo¬ 
dynamics.  The  temperature  dependence  of  the  zero-field  entropy  is  brought  about 
by  the  existence  of  magnetic  interactions  between  the  paramagnetic  ions,  the  increased 
importance  of  crystal  field  splitting  at  low  temperatures,  and  other  such  effects  that 
are  left  out  of  the  analysis  leading  to  (31.53).  When  these  are  taken  into  account, 
the  result  (31.54)  for  the  final  temperature  must  be  replaced  by  the  general  result 
S(/4lti,i ,  7inma])  =  S(0.  Tf,r  J  and  one  must  have  a  detailed  knowledge  of  the  tempera¬ 
ture  dependence  of  the  zero-field  entropy,  to  compute  the  final  temperature  (see 
Figure  31.2). 

^Evidently  the  most  effective  materials  are  those  where  the  inevitable  decline  with 
temperature  of  the  zero-field  entropy  sets  in  at  the  lowest  possible  temperature. 
One  therefore  uses  paramagnetic  salts  with  well-sheltered  (to  minimize  crystal  field 
splitting),  well-separated  (to  minimize  magnetic  interactions)  magnetic  ions.  Coun¬ 
tering  this,  of  course,  is  the  lower  magnetic  specific  heat  resulting  from  a  lower 
density  of  magnetic  ions.  The  most  popular  substances  used  at  present  are  of  the  type 
Ce2Mg3(N03)12  •  (H20)24. 
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Figure  31.2 

Plots  of  the  entropy  of  a 
system  of  interacting  spins 
for  various  values  of  external 
magnetic  field,  H.  (The 
dashed  line  represents  the 
constant  NkB  In  (2 J  +  1)  for 
independent  spins  in  zero 
field.)  The  cooling  cycle  is 
this:  Starting  at  A(ThH  =  0), 
we  proceed  isothermally  to 
B,  raising  the  field  in  the 
process  from  zero  to  Hi.  The 
next  step  is  to  remove  the 
field  adiabatically  (constant 
S),  thereby  moving  to  C  and 
achieving  a  temperature  Tf. 


SUSCEPTIBILITY  OF  METALS:  PAULI  PARAMAGNETISM 

None  of  the  above  discussions  bear  on  the  problem  of  the  contribution  of  conduction 
electrons  to  the  magnetic  moment  of  a  metal.  The  conduction  electrons  are  not 
spatially  localized  like  electrons  in  partially  filled  ionic  shells,  nor,  because  of  the 
stringent  constraints  of  the  exclusion  principle,  do  they  respond  independently  like 
electrons  localized  on  different  ions. 

However,  within  the  independent  electron  approximation  the  problem  of  con¬ 
duction  electron  magnetism  can  be  solved.  The  solution  is  quite  complicated,  owing 
to  the  intricate  way  in  which  the  electron  orbital  motion  responds  to  the  field. 
If  we  neglect  the  orbital  response  (i.e.,  consider  the  electron  to  have  only  a  spin 
magnetic  moment,  but  no  charge),  then  we  may  proceed  as  follows: 

Each  electron  will  contribute  —Bb/V  (taking  g0  =  2)  to  the  magnetization  density 
if  its  spin  is  parallel  to  the  field  H,  and  bb/V,  if  antiparallel.  Hence  if  n±  is  the  number 
of  electrons  per  unit  volume  with  spin  parallel  (+)  or  antiparallel  (  — )  to  H,  the 
magnetization  density  will  be 

M  =  — |tB(n+  —  n_).  (31.55) 

If  the  electrons  interact  with  the  field  only  through  their  magnetic  moments,  then 
the  only  effect  of  the  field  is  to  shift  the  energy  of  each  electronic  level  by  +)tBH, 
according  to  whether  the  spin  is  parallel  ( + )  or  antiparallel  ( — )  to  H.  We  can  express 
this  simply  in  terms  of  the  density  of  levels  for  a  given  spin.  Let  g±(E)  d  8  be  the  number 
of  electrons  of  the  specified  spin  per  unit  volume  in  the  energy  range  8  to  8  +  d8.27 
In  the  absence  of  the  field  we  would  have 

g±(  S)  =  kn  {H  =  0),  (31.56) 


27  To  avoid  confusing  the  density  of  levels  with  the  g-factor,  we  shall  always  make  the  energy  argument 
of  the  level  density  explicit.  A  subscript  distinguishes  the  Bohr  magneton  nB  from  the  chemical  potential  g. 


) 
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) 


where  g( £)  is  the  ordinary  density  of  levels.  Since  the  energy  of  each  electronic  level 
with  spin  parallel  to  the  field  is  shifted  up  from  its  zero  field  value  by  pBH,  the  number 
of  levels  with  energy  £  in  the  presence  of  H  is  the  same  as  the  number  with  energy 
£  —  pBH  in  the  absence  of  H : 


Similarly, 


<?+(£)  —  —  PbH)- 


0-(£)  =  '.<?(£  +  PBH  )■ 


(31.57) 

(31.58) 


The  number  of  electrons  per  unit  volume  of  each  spin  species  is  given  by 


where  /  is  the  Fermi  function 


n± 


I 


d£g±  (£)/(£), 


/(£)  = 


1 

em-n)  +  y 


(31.59) 

(31.60) 


The  chemical  potential  p  is  determined  by  noting  that  the  total  electronic  density  is 
given  by 

n  =  n+  +  n_.  (31.61) 

Eliminating  p  through  this  relation  we  can  use  (31.59)  and  (31.55)  to  find  the  mag¬ 
netization  density  as  a  function  of  the  electronic  density  n.  In  the  nondegenerate  case 
(/  »  this  leads  back  to  our  earlier  theory  of  paramagnetism,  giving  precisely 

(31.44)  with  J  =  1/2.  (See  Problem  8.) 

However,  in  metals  one  is  very  much  in  the  degenerate  case.  The  important 
variation  in  the  density  of  levels  g(Z)  is  on  the  scale  of  £f,  and  since  pBH  is  only  of 
order  lCT4£f  even  at  104  gauss,  we  can,  with  negligible  error,  expand  the  density 
of  levels: 


9±  (£)  —  ls(£  i  PbH)  —  2 ff(£)  i  (£)■  (31.62) 

In  conjunction  with  (31.59)  this  gives 

n±  =  j  (£)/(£)  d£  +  j  pBH  di.  ty  (£,)  /(£),  (31.63) 

so  that,  from  (31.61), 

«  =  Jg(£)/(£)d£.  (31.64) 


This  is  precisely  the  formula  for  the  electronic  density  in  the  absence  of  the  field,  and 
thus  the  chemical  potential  p  can  be  taken  to  have  its  zero  field  value,  Eq.  (2.77): 


„  -  eF[‘  +  o(%f)  ]• 

In  conjunction  with  Eq.  (31.55),  Eq.  (31.63)  gives  a  magnetization  density 

M  =  pB2H  (V(£)/(£)d£, 


(31.65) 


(31.66) 


1 
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or,  integrating  by  parts, 

M 

At  zero  temperature,  —  8f/cE  — 


(31.67) 

<5(£  —  £f),  so  that 

M  =  pB2Hg(EF). 

(31.68) 

Since  (see  Chapter  2)  the  T  #  0  corrections  to  —  8f/8E  are  of  order  (kBT/ 8f)2,  Eq. 
(31.68)  is  also  valid  at  all  but  the  very  highest  temperatures  ( T  ~  104  K). 

It  follows  from  (31.68)  that  the  susceptibility  is 


X  =  PB2g(eF)- 


(31.69) 


This  is  known  as  the  Pauli  paramagnetic  susceptibility.  In  contrast  to  the  susceptibility 
of  paramagnetic  ions  given  by  Curie’s  law,  the  Pauli  susceptibility  of  conduction 
electrons  is  essentially  independent  of  temperature.  In  the  free  electron  case  the  density 
of  levels  has  the  form  g(EF)  =  mkF/h2n2,  and  the  Pauli  susceptibility  takes  on  the 
simple  form 


Xviiuh 


00 


(°t A'f)> 


where  a  =  e2/hc  =  1/137.  An  alternative  form  is 


Xpauii 


.  -  (A59) 


VJaoJ 


x  1CT 


(31.70) 


(31.71) 


These  expressions  reveal  that  Xpaulj  has  the  minute  size  characteristic  of  diamagnetic 
susceptibilities,  in  contrast  to  the  strikingly  larger  paramagnetic  susceptibilities  of 
magnetic  ions.  This  is  because  the  exclusion  principle  is  far  more  effective  than  thermal 
disorder  in  suppressing  the  tendency  of  the  spin  magnetic  moments  to  align  with  the 
field.  Another  way  of  comparing  Pauli  paramagnetism  with  the  paramagnetism  of 
magnetic  ions  is  to  note  that  the  Pauli  susceptibility  can  be  cast  into  the  Curie’s  law 
form  (Eq.  (31.47),  but  with  a  fixed  temperature  of  order  7>  playing  the  role  of  T.  Thus 
the  Pauli  susceptibility  is  hundreds  of  times  smaller,  even  at  room  temperatures.28 

Values  of  the  Pauli  susceptibility,  both  measured  and  theoretical  (from  Eq.  (31.71)) 
are  given  in  T able  3 1 .5  for  the  alkali  metals.  The  rather  significant  discrepancy  between 
the  two  sets  of  figures  is  mainly  a  result  of  the  neglected  electron-electron  interactions 
(see  Problem  12).29 


28  Until  Pauli’s  theory,  the  absence  of  a  strong  Curie’s  law  paramagnetism  in  metals  was  another 
of  the  outstanding  anomalies  in  the  free  electron  theory  of  metals;  as  in  the  case  of  the  specific  heat,  the 
anomaly  was  removed  by  observing  that  electrons  obey  Fermi-Dirac,  rather  than  classical,  statistics. 

29  The  reader  who  recalls  the  large  correction  to  the  electronic  density  of  levels  appearing  in  the 
electronic  specific  heat,  which  arises  from  the  electron-phonon  interaction,  may  be  surprised  to  learn 
that  a  similarly  large  correction  does  not  arise  in  the  Pauli  susceptibility.  There  is  an  important  difference 
between  the  two  cases.  When  the  specific  heat  is  computed,  one  calculates  a  fixed  temperature-independent 
correction  to  the  electronic  density  of  levels,  and  then  inserts  that  fixed  density  of  levels  into  formulas 
(such  as  (2.79))  telling  how  the  energy  changes  as  the  temperature  varies.  When  a  magnetic  field  is 
varied,  however,  the  density  of  levels  itself  changes.  We  have  already  noted,  for  example  (ignoring  phonon 
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Table  31.5 

COMPARISON  OF  FREE  ELECTRON  AND  MEASURED  PAULI 
SUSCEPTIBILITIES 


METAL 

rja0 

106Xpauli 
(from  Eq.  (31.71)) 

106Xpauli 

(measured)0 

Li 

3.25 

0.80 

2.0 

Na 

3.93 

0.66 

1.1 

K 

4.86 

0.53 

O.85 

Rb 

5.20 

0.50 

0.8 

Cs 

5.62 

0.46 

0.8 

«  The  measured  values  are  taken  from  the  following  sources :  Li :  R.  T.  Schuma¬ 
cher  and  C.  P.  Slichter,  Phys.  Rev.  101,  58  (1956);  Na:  R.  T.  Schumacher  and 
W.  E.  Vehse,  J.  Phys.  Chem.  Solids  24, 297  (1965);  K:  S.  Schultz  and  G.  Dunifer, 
Phys.  Rev.  Lett.  18,  283  (1967);  Rb.  Cs:  J.  A.  Kaeck,  Phys.  Rev.  175,  897  (1968). 


CONDUCTION  ELECTRON  DIAMAGNETISM 

In  the  foregoing  discussion  of  conduction  electron  magnetism  we  considered  only 
the  paramagnetic  effects  arising  from  the  coupling  of  the  intrinsic  spin  of  the  electrons 
with  the  applied  field  H.  There  are  also  diamagnetic  effects  arising  from  the  coupling 
of  the  field  to  the  orbital  motion  of  the  electrons.  We  discussed  this  at  some  length  in 
Chapter  14,  where  we  found  that  at  very  low  temperatures,  high  fields,  and  high 
purities  (coct  —  eHx/tnc  »  1)  there  was  a  complicated  oscillatory  structure  to  the 
dependence  of  M  on  H.  In  ordinary  specimens  the  condition  of  high  cocx  is  not  met, 
and  the  oscillatory  structure  is  not  perceptible.  However,  the  dependence  of  M  on 
H  does  not  average  out  to  zero;  There  is  a  net  nonvanishing  magnetization  anti¬ 
parallel  to  H,  known  as  the  Landau  diamagnetism,  that  is  due  to  the  orbital  electronic 
motion  induced  by  the  field.  For  free  electrons  it  can  be  shown30  that 

X Landau  =  —  iXPauli  -  (31.72) 

If  the  electrons  move  in  a  periodic  potential  but  are  otherwise  independent,  the 
analysis  becomes  quite  complicated,  but  again  results  in  a  diamagnetic  susceptibility 
of  the  same  order  of  magnitude  as  the  paramagnetic  susceptibility.  In  practice,  of 


corrections),  that  the  density  of  levels  for  each  spin  population  is  shifted  up  or  down  in  energy  with  changing 
field.  The  phonon  correction  to  this  result  occurs  in  a  neighborhood  (of  width  hcoD ,  which  is  large  compared 
with  the  shift  timc  due  to  the  field)  of  the  Fermi  level.  But  the  Fermi  level  does  not  shift  with  field,  in 
contrast  to  the  uncorrected  density  of  levels.  Consequently,  one  cannot  simply  substitute  a  phonon- 
corrected  density  of  levels  into  (31.68)  as  one  can  in  (2.79)  because  the  corrected  density  of  levels  varies 
with  field  in  an  intrinsically  different  way  from  the  uncorrected  one.  A  careful  analysis  reveals  that  because 
the  phonon  correction  is  tied  to  the  Fermi  level,  it  has  very  little  effect  on  the  magnetization  as  the  field 
varies,  leading  to  a  correction  factor  in  the  susceptibility  that  is  only  of  order  ( m/M )  1  (in  contrast  to 
the  correction  of  order  unity  in  the  specific  heat). 

30  See,  for  example,  R.  E.  Peierls,  Quantum  Theory  of  Solids,  Oxford,  1955,  pp.  144-149.  An  analysis 
that  takes  band  structure  into  account  can  be  found  in  P.  K.  Misra  and  L.  M.  Roth,  Phys.  Rev.  177, 
1089  (1969)  and  references  therein. 


Measurement  of  Pauli  Paramagnetism  by  Nuclear  Magnetic  Resonance  665 


course,  it  is  the  total  susceptibility  that  is  revealed  by  a  measurement  of  the  bulk 
moment  induced  by  a  field,  and  this  is  a  combination  of  the  Pauli  paramagnetic 
susceptibility,  the  Landau  diamagnetic  susceptibility,  and  the  Larrnor  diamagnetic 
susceptibility  (of  the  closed-shell  ion  cores).  As  a  result,  it  is  not  at  all  a  straightforward 
matter  to  isolate  experimentally  a  particular  term  in  the  susceptibility.  Thus  the  Pauli 
susceptibilities  quoted  in  Table  31.5  were  obtained  by  quite  indirect  methods,  one  of 
which  we  now  describe. 

MEASUREMENT  OF  PAULI  PARAMAGNETISM  BY 
NUCLEAR  MAGNETIC  RESONANCE 

To  distinguish  the  electron  spin  paramagnetic  contribution  to  the  susceptibility  of  a 
metal  from  the  other  sources  of  magnetization,  one  requires  a  probe  that  couples 
much  more  strongly  to  the  spin  magnetic  moments  of  the  conduction  electrons  than 
it  does  to  fields  arising  from  electronic  translational  motion.  The  magnetic  moments 
of  the  ionic  nuclei  provide  such  a  probe. 

A  nucleus  of  angular  momentum  I  possesses  a  magnetic  moment  mw  =  yA.l  (which 
is  typically  smaller  in  order  of  magnitude  than  the  electronic  magnetic  moment  by 
the  electron  to  nuclear  mass  ratio).  In  an  applied  magnetic  field  the  (27  +  1)  de¬ 
generate  nuclear  spin  levels  are  split  by  an  amount  yNH.  This  splitting  can  be  detected 
by  observing  the  resonant  absorption  of  energy  at  the  angular  frequency  y^H/h.31,32 

The  field  determining  the  frequency  of  the  nuclear  magnetic  resonance  is,  of  course, 
the  field  acting  directly  on  the  nucleus.  In  nonparamagnetic  substances  the  field  at 
the  nucleus  differs  by  small  diamagnetic  corrections  (known  as  the  chemical  shift) 
from  the  externally  applied  field.  In  metals,  however,  there  is  a  more  important33 
source  of  field  at  the  nucleus: 

The  conduction  electrons  (whose  wave  functions  are  generally  derived,  at  least  in 
part,  from  atomic  s-shells)  have  nonvanishing  wave  functions  at  the  ionic  nuclei. 
When  an  electron  actually  overlaps  the  nucleus,  however,  there  is  a  direct  magnetic 
coupling  of  their  moments  proportional  to  mt,  •  m;V.34  If  the  conduction  electron  gas 
had  no  net  paramagnetic  moment,  this  coupling  would  yield  no  net  shift  in  the  nuclear 
resonance  since  electrons  of  all  spin  orientations  would  be  found,  with  equal  prob¬ 
ability,  at  the  nuclear  position.35  However,  the  same  field  in  which  the  nuclei  precess 
also  produces  the  Pauli  paramagnetic  imbalance  in  the  electronic  spin  populations. 


31  In  practical  nuclear  magnetic  resonance  experiments  the  resonance  is  observed  by  fixing  the 
frequency  of  the  perturbing  radio-frequency  (RF)  field,  and  varying  the  applied  magnetic  field. 

32  An  excellent  introduction  to  nuclear  magnetic  resonance  is  given  by  C.  Slichter,  Principles  of 
Magnetic  Resonance ,  Harper  Row,  New  York,  1963. 

33  Since  isolated  nuclei  are  hard  to  work  with,  one  generally  deals  only  with  relative  shifts.  One 
knows  that  the  shift  in  metals  is  more  important  because  it  differs  from  the  shift  in  salts  of  the  same  metal 
by  far  more  than  the  shift  from  one  salt  to  another. 

34  Known,  variously,  as  the  hyperfine  interaction,  the  Fermi  interaction,  or  the  contact  interaction. 

35  This  picture  assumes,  of  course,  that  the  nucleus  experiences  the  average  electron  field — i.e.,  that 
each  nucleus  interacts  with  many  electron  spins  in  the  time  it  takes  to  complete  a  single  period  of  precession. 
Since  this  time  is  typically  10-6  second  in  a  strong  field,  the  condition  is  very  well  satisfied,  for  a  conduction 
electron  moves  with  velocity  cF  (of  order  108  cm/sec)  and  therefore  takes  about  10-21  second  to  cross 
an  atomic  nucleus  (whose  radius  is  of  order  10“ 13  cm). 
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There  is  therefore  a  net  electronic  moment  leading  to  an  effective  field  at  the  nucleus 
that  is  proportional  to  the  conduction  electron  spin  susceptibility. 

The  shift  produced  by  this  field,  known  as  the  Knight  shift,  is  measured  by  noting 
the  difference  in  resonant  frequency  between  the  metallic  element  in  (for  example)  a 
nonparamagnetic  salt,  and  in  the  metallic  state.  Unfortunately  the  Knight  shift  is 
proportional  not  only  to  the  Pauli  susceptibility,  but  also  to  the  magnitude  squared  of 
the  conduction  electron  wave  function  at  the  ionic  nucleus.  It  is  therefore  necessary  to 
have  some  estimate  of  this  (usually  from  a  calculation)  to  extract  the  Pauli  suscep¬ 
tibility  from  the  measured  Knight  shift. 


ELECTRON  DIAMAGNETISM  IN  DOPED  SEMICONDUCTORS 

Doped  semiconductors  provide  an  example  of  a  conducting  material  in  which  the 
conduction  electron  diamagnetism  can  be  substantially  larger  than  the  paramag¬ 
netism.  One  first  measures  the  susceptibility  of  the  intrinsic  material  at  very  low 
temperatures,  where  it  is  almost  entirely  due  to  the  diamagnetism  of  the  ion  cores. 
This  contribution  will  also  be  present  in  the  doped  material,  and  by  subtracting  it 
from  the  total  susceptibility  one  can  extract  the  contribution  to  the  susceptibility  of 
the  doped  material  due  to  the  carriers  introduced  by  the  doping.36 

Consider  the  case  in  which  the  carriers  go  into  bands  with  spherical  symmetry, 
so  that  £(k)  =  h2k2/m*.  (For  concreteness  we  consider  donor  impurities,  and  measure 
k  with  respect  to  the  conduction  band  minimum.)  According  to  (31.69),  the  para¬ 
magnetic  susceptibility  is  proportional  to  the  density  of  levels.37  Since  this  is  propor¬ 
tional  to  m  for  free  electrons,  the  Pauli  susceptibility  of  the  carriers  will  be  reduced38 
by  a  factor  m*/m.  On  the  other  hand,  the  Landau  susceptibility  is  enhanced  by  a 
factor  m/m*,  since  the  coupling  of  the  electron  orbital  motion  to  the  fields  is  pro¬ 
portional  to  e(v/c)  x  H,  which  is  inversely  proportional  to  m*.  As  a  result, 


Thus  there  are  semiconductors  in  which  the  electron  Landau  diamagnetism  com¬ 
pletely  dominates  the  Pauli  spin  paramagnetism,  and  can  therefore  be  directly 
extracted,  by  measuring  the  amount  by  which  the  susceptibility  exceeds  that  of  the 
undoped  material.39 

This  completes  our  survey  of  those  magnetic  properties  of  solids  that  can  be 
understood  without  taking  into  explicit  account  the  interactions  between  the  sources 
of  magnetic  moment.  In  Chapter  32  we  shall  turn  to  the  theory  underlying  such 
interactions,  and  in  Chapter  33  we  shall  return  to  the  examination  of  further  magnetic 
properties  of  solids,  which  depend  on  them  crucially. 


36  The  change  in  the  diamagnetic  susceptibility  due  to  the  different  closed-shell  structure  of  the  donor 
ions  gives  a  very  small  correction. 

37  These  considerations  can  be  shown  to  remain  valid  even  if  the  conduction  band  electrons  are  not 
degenerate. 

38  The  ratio  m*/m  is  typically  0.1  or  smaller. 

39  For  a  review,  see  R.  Bowers,  J.  Phys.  Chem.  Solids  8,  206  (1959). 
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PROBLEMS 


1 .  The  classical  definition  of  the  magnetic  moment  m  of  a  particle  of  charge 
motion,  was  given  by  Ampere  as  the  average  over  the  orbit  of 

-  e,  due  to  its  orbital 

-  ;r  (r  x  v)- 

2  c 

(31.74) 

Show  that  our  definition,  m  =  -cE/cH,  reduces  to  this  form  by  showing  from  (31.15)  that 

m  =  -i?r‘x(Pi“£r'xH) 

(31.75) 

and 

dH  1  /  e  \ 

V,-  =  —  =  —  Pi  -  T  r,-  x  H  . 
c P.  '» \  2c  J 

(31.76) 

2.  The  Pauli  spin  matrices  satisfy  the  simple  identity 

(a  •  tr)(b  •  cr)  =  a  •  b  +  i(a  x  b)  •  tr 

(31.77) 

provided  all  the  components  of  a  and  b  commute  with  those  of  a.  If  the  components  of  a  commute 
among  themselves,  a  x  a  =  O.Sincethecomponentsofpsocommute,intheabsenceofamagnetic 
field  we  could  equally  well  write  the  kinetic  energy  of  a  spin-j  particle  in  the  form  (a  •  p)2/2m. 
However  when  a  field  is  present,  the  components  of  p  +  cA/c  no  longer  commute  among  them¬ 
selves.  Show,  as  a  result,  that  (31.77)  implies: 


(31.78) 


which  thus  gives  both  the  spin  and  orbital  contributions  to  the  magnetic  part  of  the  Hamiltonian 
in  one  compact  formula  (provided  that  g0  =  2). 


3.  (a)  Show  that  Hund's  rules  for  a  shell  of  angular  momentum  /  containing  n  electrons  can 
be  summarized  in  the  formulas: 

S  =  i[(2f  +  1)  -  |2f  +  1  -  «|], 

L  =  S|2f  +  1  -  n\, 

7  =  12/-  n|S.  (31.79) 

(b)  Verify  that  that  the  two  ways  of  counting  the  degeneracy  of  a  given  LS-multiplet  give  the 
same  answer;  i.e.,  verify  that 

L+S 

(2 L  +  1)(2S  +  1)  =  X  (27  +  1).  (31.80) 

It- si 

(c)  Show  that  the  total  splitting  of  an  ES-multiplet  due  to  the  spin  orbit  interaction  2(L  •  S)  is 


Ei _  Er 


AS(2L  +  1), 
2L(2  S  +  1), 


L>  S, 
S  >  L, 


(31.81) 
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and  that  the  splittings  between  successive  J-multiplets  within  the  LS-multiplet  is 

EJ  +  i  -  Ej  =  MJ  +  1).  (31.82) 

4.  (a)  The  angular  momentum  commutation  relations  are  summarized  in  the  vector  operator 

identities 

L  x  L  =  iL,  S  x  S  =  iS.  (31.83) 

Deduce  from  these  identities  and  the  fact  that  all  components  ofL  commute  with  all  components 
of  S  that 

[L  +  g0S,  n  •  J]  =  in  x  (L  +  g0S),  (31.84) 

for  any  (c-number)  unit  vector  n. 

(b)  A  state  |0>  with  zero  total  angular  momentum  satisfies 

Jx|0>  =  Jy|0>  =  J,|0>  =  0.  (31.85) 

Deduce  from  (31.84)  that 

<0|(L  +  goS)|0>  =  0,  (31.86) 

even  though  L2  and  S2  need  not  vanish  in  the  state  |0\  and  (L  4-  goS)|0>  need  not  be  zero. 

(c)  Deduce  the  Wigner-Eckart  theorem  (Eq.  (31.34)  in  the  special  case  J  =  1/2,  from  the 
commutation  relations  (31.84). 

5.  Suppose  that  within  the  set  of  (2 L  +  1)(2S  +  1)  lowest-lying  ionic  states  the  crystal  field 
can  be  represented  in  the  form  nLx2  +  fcL,,2  +  eL,2,  with  n,  b,  and  c  all  different.  Show  in  the 
special  case  L  =  l  that  if  the  crystal  field  is  the  dominant  perturbation  (compared  with  spin-orbit 
coupling),  then  it  will  yield  a  (2S  +  l)-fold  degenerate  set  of  ground  states  in  which  every  matrix 
element  of  every  component  of  L  vanishes. 

6.  The  susceptibility  of  a  simple  metal  has  a  contribution  ycc  from  the  conduction  electrons  and 
a  contribution  /K(  from  the  diamagnetic  response  of  the  closed-shell  core  electrons.  Taking  the 
conduction  electron  susceptibility  to  be  given  by  the  free  electron  values  of  the  Pauli  paramagnetic 
and  Landau  diamagnetic  susceptibilities,  show  that 

^=-i  §^<(M2>.  (31-87) 

Xcs.  3  Zv 


where  Z„  is  the  valence,  Zc  is  the  number  of  core  electrons,  and  <r2>  is  the  mean  square  ionic 
radius  defined  in  (31.26). 


7.  Consider  an  ion  with  a  partially  filled  shell  of  angular  momentum  J,  and  Z  additional  electrons 
in  filled  shells.  Show  that  the  ratio  of  the  Curie’s  law  paramagnetic  susceptibility  to  the  Larmor 
diamagnetic  susceptibility  is 


XE£=  2J(J  +  1)  h1 
Zdia  ZkET  m<r2)’ 


(31.88) 


and  deduce  from  this  the  numerical  estimate  on  page  656. 
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8.  Show  that  the  magnetization  of  a  nondegenerate  electron  gas  is  precisely  given  by  the  result 
(31.44)  for  independent  moments  (with  J  set  equal  to  1/2),  by  inserting  the  low-density  expansion 
of  the  Fermi  function,  /  «  c-«b-io  ;nto  (31  59) 

9.  If  one  writes  the  free  energy  (31.5)  in  the  form 

e-PF  _  £  e-t>En  _  y  =  Tr  e~l<K,  (31.89) 

«  n 

then  it  is  easy  to  deduce  Curie’s  law  directly  at  high  temperature  without  going  through  the  algebra 
of  Brillouin  functions,  for  when  3C  «  kBT,  we  can  expand  e~m  -  I  -  [13C  +  (/J,TC)2/2  -  ■  ■  • 
Evaluate  the  free  energy  to  second  order  in  the  field,  using  the  fact  that 

Tr(J,A)  =  i^vTrJ-, 

and  extract  the  high-temperature  susceptibility  (31.47). 


10.  Show  that  for  an  ideal  paramagnet  whose  free  energy  has  the  form  (31.52),  the  specific  heat 
at  constant  field  is  simply  related  to  the  susceptibility  by 


or,  in  the  regime  where  Curie’s  law  is  valid. 


Ch 


=  ~kBj(j  + 


(31.90) 


(31.91) 


Estimating  the  lattice  vibrational  contribution  to  the  specific  heat  by  Eq.  (23.27),  show  that  the 
lattice  contribution  drops  below  that  of  the  spins  at  a  temperature  T0  of  order 


T0 


(31.92) 


(Here  Nt  is  the  total  number  of  ions,  and  N  is  the  number  of  paramagnetic  ions.)  What  is  a  typical 
size  for  ggBH/kB®D  in  a  field  of  104  gauss? 


11.  Show  that  if  T  is  small  compared  with  the  Fermi  temperature,  the  temperature-dependent 
correction  to  the  Pauli  susceptibility  (31.69)  is  given  by 


X(T)  =  *(0) 


0 


(31.93) 


where  g,  g\  and  g"  are  the  density  of  levels  and  its  derivatives  at  the  Fermi  energy.  Show  that  for 
free  electrons  this  reduces  to 


<31M> 

12.  Because  of  electron-electron  interactions,  the  shift  in  an  electron's  energy  due  to  the  inter¬ 
action  of  its  spin  magnetic  moment  with  a  field  H  will  have  an  additional  term  expressing  the 
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change  in  the  distribution  of  electrons  with  which  a  given  electron  interacts.  In  the  Hartree-Fock 
approximation  (see,  for  example,  Eq.  (17.19))  this  will  be  of  the  form 

f  d/c' 

Ejk)  =  £0(k)  +  UBH  -  |  — )3  t;(|k  -  k'|)/(E±(k)).  (31.95) 

Show  that  when  keT  «  £f  this  leads  to  an  integral  equation  for  £+  —  £_  that  has  the  solution: 

[£T(k)  -  E-(k)]t=Jt  =  2a<bP  ,  (31.96) 

1  -  v0g(&F) 

where  t  „  is  an  average  of  v  over  all  solid  angles, 

i'o  =  2  J  dx  l:(\/2kF2(l  -  *))■ 

By  what  factor  is  the  Pauli  susceptibility  now  modified? 


(31.97) 


32 

Electron 

Interactions  and 

Magnetic 

Structure 


Electrostatic  Origins  of  Magnetic  Interactions 

Magnetic  Properties  of  a  Two-Electron  System 

Failure  of  the  Independent  Electron 
Approximation 

Spin  Hamiltonians 

Direct,  Super,  Indirect,  and  Itinerant  Exchange 
Magnetic  Interactions  in  the  Free  Electron  Gas 
The  Hubbard  Model 
Local  Moments 

The  Kondo  Theory  of  the  Resistance  Minimum 


672  Chapter  32  Electron  Interactions  and  Magnetic  Structure 


The  simple  theory  of  paramagnetism  in  solids,  described  in  Chapter  31,  assumes  that 
the  discrete  sources  of  magnetic  moment  (e.g.,  the  ionic  shells  of  nonzero  angular 
momentum  in  insulators,  or  the  conduction  electrons  in  simple  metals)  do  not  interact 
with  one  another.  We  have  seen  that  this  assumption  must  be  dropped  in  order,  for 
example,  to  predict  the  lowest  temperatures  attainable  by  adiabatic  demagnetization, 
or  to  estimate  accurately  the  Pauli  spin  paramagnetism  of  metallic  conduction 
electrons. 

There  are,  however,  more  spectacular  consequences  of  magnetic  interactions.1 
Some  solids,  known  as  ferromagnets,  have  a  nonvanishing  magnetic  moment,  or 
“spontaneous  magnetization,”  even  in  the  absence  of  a  magnetic  field.2  If  there  were 
no  magnetic  interactions,  in  the  absence  of  a  field  the  individual  magnetic  moments 
would  be  thermally  disordered,  would  point  in  random  directions,  and  could  not  sum 
to  a  net  moment  for  the  solid  as  a  whole  (Figure  32.1a).  The  net  parallel  orientation 
of  the  moments  in  a  ferromagnetic  (Figure  32.1b)  must  be  due  to  interactions  between 
them.  In  other  solids,  known  as  antiferromagnets,  although  there  is  no  net  total 
moment  in  the  absence  of  a  field,  there  is  a  far  from  random  spatial  pattern  of  the 
individual  magnetic  moments,  due  to  magnetic  interactions  favoring  antiparallel 
orientations  of  neighboring  moments  (Figure  32.1c). 

The  theory  of  the  origin  of  magnetic  interactions  is  one  of  the  less  well  developed 
of  the  fundamental  areas  of  solid  state  physics.  The  problem  is  best  understood  in 
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Figure 

32.1 

Typical  distribution  of  directions  for  the  local  magnetic  moments  when  no  magnetic  field  is 
present  (a)  in  a  solid  with  inconsequential  magnetic  interactions,  (b)  in  a  ferromagnetic  solid 
below  its  critical  temperature,  and  (c)  in  an  antiferromagnetic  solid  below  its  critical  temperature. 
Cases  (b)  and  (c)  illustrate  magnetically  ordered  slates. 


1  We  use  the  term  "magnetic  interaction”  to  describe  any  dependence  of  the  energy  of  two  or  more 
magnetic  moments  on  their  relative  directions.  We  shall  see  that  the  most  important  contribution  to  this 
energy  dependence  is  usually  electrostatic  rather  than  magnetic  in  origin.  The  nomenclature  can  be  mis¬ 
leading  if  one  forgets  that  “magnetic”  refers  only  to  the  effects  of  the  interactions,  and  not  necessarily  their 
sources. 

2  The  spontaneous  magnetization  decreases  with  increasing  temperature,  and  vanishes  above  a 
certain  critical  temperature  (see  Chapter  33).  When  we  use  the  term  “ferromagnet”  we  shall  mean  a  ferro¬ 
magnetic  material  below  its  crittcal  temperature.  We  use  the  term  "antiferromagnet”  with  the  same 
understanding. 
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insulators,  where  the  magnetic  ions  are  well  separated,  though  even  here  the  theory 
is  quite  complex.  To  keep  things  as  simple  as  possible  we  shall  illustrate  some  of 
the  basic  physics  applicable  to  insulators  (and,  with  considerable  modifications  and 
embellishments,  to  metals  as  well)  only  in  the  simple  case  of  a  single  hydrogen  mole¬ 
cule,  which  the  charitable  reader  will  be  asked  to  regard  as  a  solid  with  N  =  2 
rather  than  O(1023).  We  shall  then  indicate  how  the  ideas  suggested  by  the  hydrogen 
molecule  are  generalized  to  real  solids  containing  large  numbers  of  atoms.  Finally, 
we  shall  illustrate  some  of  the  further  complexities  one  encounters  in  the  theory  of 
magnetic  moments  and  their  interactions,  in  metals. 

Readers  who  wish  to  be  spared  the  rather  uncomfortably  compromised  exposition 
forced  on  us  by  the  difficulty  and  incompleteness  of  the  subject  might  merely  want 
to  note  the  following  two  major  points,  which  the  remainder  of  the  chapter  will 
serve  to  illustrate  in  rather  more  detail : 

1.  One  s  first  expectation  might  be  that  the  magnetic  interactions  between  discrete 
moments  arise  from  their  magnetic  fields,  either  directly  through  magnetic  dipole- 
dipole  interactions,  or,  less  directly,  through  spin-orbit  coupling.  These,  however, 
are  not  often  the  dominant  magnetic  interaction.  By  far  the  most  important 
source  of  magnetic  interaction  is  the  ordinary  electrostatic  electron-electron 
interaction.  Indeed,  to  a  first  approximation  many  theories  of  magnetism  ig¬ 
nore  dipole-dipole  and  spin-orbit  coupling  altogether,  retaining  only  Coulomb 
interactions. 

2.  To  explain  magnetic  ordering  in  solids,  in  the  great  majority  of  cases  it  is  necessary 
to  go  well  beyond  the  independent  electron  approximation,  upon  which  band 
theory,  with  its  impressive  successes  in  accounting  for  nonmagnetic  properties, 
is  based.  It  is  rarely  enough  merely  to  introduce  electron-electron  interactions 
into  band  theory  in  the  form  of  self-consistent  fields.  Indeed,  the  development 
of  a  tractable  model  of  a  magnetic  metal,  capable  of  describing  both  the  charac¬ 
teristic  electron  spin  correlations  as  well  as  the  electronic  transport  properties 
predicted  by  simple  band  theory,  remains  one  of  the  major  unsolved  problems 
of  modem  solid  state  theory. 


ESTIMATE  OF  MAGNETIC  DIPOLAR  INTERACTION  ENERGIES 

Before  explaining  how  magnetic  interactions  can  arise  from  purely  electrostatic  cou¬ 
plings,  we  estimate  the  direct  dipolar  interaction  energy  of  two  magnetic  dipoles  nij 
and  ni2,  separated  by  r: 

U  =  p  [mi  ’  m2  -  3(m,  •  f)(m2  •  ?)].  (32.1) 

Atomic  magnetic  dipole  moments  have  a  magnitude  m,  a  m2  gfiB  k  ehjmc  (pages 
646  and  654).  Hence  the  size  of  U  (ignoring  its  angular  dependence)  will  be 


U 


1 

(137)t 


Ry. 


(32.2) 
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In  a  magnetic  solid,  moments  are  typically  about  2  A  apart,  and  hence  U  is  no  more 
than  10-4  eV.  This  is  minute  compared  with  electrostatic  energy  differences  between 
atomic  states,  which  are  typically  a  fraction  of  an  electron  volt.  Therefore,  if  we  can 
find  a  reason  why  the  electrostatic  energy  of  a  pair  of  magnetic  ions  (or  electrons) 
depends  on  the  direction  of  their  moments  (and  such  a  reason  is  supplied  by  the 
Pauli  exclusion  principle,  as  we  shall  see  below),  then  we  should  expect  that  source 
of  magnetic  interaction  to  be  far  more  important  than  the  dipolar  interaction.3 

One  can  also  generally  rule  out  spin-orbit  coupling  as  a  major  source  of  magnetic 
interaction.  To  be  sure,  it  is  quite  important  in  determining  the  total  magnetic  moment 
of  individual  atoms,  and  is  therefore  a  significant  intraatomic  source  of  magnetic 
interaction.  Even  in  this  case,  however,  the  first  two  Hund  rules  (page  650)  are  deter¬ 
mined  purely  by  electrostatic  energy  considerations.  Only  the  third  rule,  giving  the 
final  splitting  within  the  LS-multiplet  is  based  on  spin-orbit  coupling.  In  those  para¬ 
magnetic  insulators  in  which  crystal  field  splitting  quenches  the  orbital  angular 
momentum  (page  657)  even  this  consequence  of  spin-orbit  coupling  is  superseded 
by  purely  electrostatic  effects. 

MAGNETIC  PROPERTIES  OF  A  TWO-ELECTRON  SYSTEM: 
SINGLET  AND  TRIPLET  STATES 

To  illustrate  how  the  Pauli  principle  can  lead  to  magnetic  effects  even  when  there 
are  no  spin-dependent  terms  in  the  Hamiltonian,  we  consider  a  two-electron  system 
with  a  spin-independent  Hamiltonian.  Because  H  does  not  depend  on  spin,  the  general 
stationary  state  VF  will  be  the  product  of  a  purely  orbital  stationary  state  whose  wave 
function  ^(r,,^)  satisfies  the  orbital  Schrodinger  equation. 

Hip  =  —  f-(Vi2  +  V22)<>  +  bOh,  r2)ip  =  Eip,  (32.3) 

2m 

with  any  linear  combination  of  the  four  spin  states4 

\n>,  |ti>,  |ff>,  [l£.  (32.4) 

We  can  choose  these  linear  combinations  to  have  definite  values  of  the  total  spin  S, 
and  its  component  S7  along  an  axis.  The  appropriate  linear  combinations  can  be 
tabulated  as  follows:5 


3  A  strong  hint  that  the  dipolar  interaction  is  far  too  weak  is  provided  by  the  ferromagnetic  critical 
temperatures  in  iron,  cobalt,  and  nickel,  which  are  many  hundreds  of  degrees  K.  If  the  spins  were  aligned 
by  magnetic  dipolar  interactions,  one  would  expect  ferromagnetic  alignment  to  be  thermally  obliterated 
above  a  few  degrees  K  (1  K  ~  10“ 4  eV).  On  the  other  hand,  in  solids  with  widely  separated  magnetic 
moments  the  dipolar  interactions  may  dominate  those  of  electrostatic  origin.  Dipolar  interactions  are  also 
of  crucial  importance  in  accounting  for  the  phenomenon  of  ferromagnetic  domains  (page  718). 

4  These  symbols  denote  spin  states  with  both  electrons  in  levels  of  definite  sz.  In  the  state  |Ti),  for 
example,  electron  1  has  s2  =  1/2,  and  electron  2,  sz  =  — 1/2. 

5  See,  for  example,  D.  Park,  Introduction  to  the  Quantum  Theory,  McGraw-Hill,  New  York,  1964, 
pp.  154-156. 
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STATE 

S 

S- 

4(|Ti>  -  |H» 

0 

0 

ItD 

1 

1 

^(|U>  +  PM 

1 

0 

|U> 

1 

-1 

Note  that  the  one  state  with  S  =  0  (known  as  the  singlet  state)  changes  sign  when 
the  spins  of  the  two  electrons  are  interchanged,  while  the  three  states  with  S  =  1 
(known  as  the  triplet  states)  do  not.  The  Pauli  exclusion  principle  requires  that  the 
total  wave  function  change  sign  under  the  simultaneous  interchange  of  both  space 
and  spin  coordinates.  Since  the  total  wave  function  is  the  product  of  its  spin  and 
orbital  parts,  it  follows  that  solutions  to  the  orbital  Schrodinger  equation  (32.3)  that 
do  not  change  sign  under  interchange  ofr,  and  r2  (symmetric  solutions)  must  describe 
states  with  S  —  0,  while  solutions  that  do  change  sign  (antisymmetric  solutions)  must 
go  with  S  =  l.6 7  There  is  thus  a  strict  correlation  between  the  spatial  symmetry  of 
the  solution  to  the  (spin-independent)  orbital  Schrodinger  equation  and  the  total 
spin:  Symmetric  solutions  require  singlet  spin  states;  and  antisymmetric,  triplets. 

If  Es  and  E,  are  the  lowest  eigenvalues  of  (32.3)  associated  with  the  singlet  (sym¬ 
metric)  and  triplet  (antisymmetric)  solutions,  then  the  ground  state  will  have  spin 
zero  or  one,  depending  only  on  whether  Es  is  less  than  or  greater  than  £„  a  question, 
we  stress  again,  that  is  settled  completely  by  an  examination  of  the  spin-independent 
Schrodinger  equation  (32.3). 

As  it  happens,  for  two-electron  systems  there  is  an  elementary  theorem  that  the 
ground-state  wave  function  for  (32.3)  must  be  symmetric.  Thus  Es  must  be  less  than 
E„  and  the  ground  state  must  have  zero  total  spin.  However,  the  theorem  holds  only 
for  two-electron  systems,8  and  it  is  therefore  important  to  find  a  way  to  estimate 
Es  -  E,  that  can  be  generalized  to  the  analogous  problem  for  an  A-atom  solid.  We 
continue  to  use  the  two-electron  system  to  illustrate  this  method  (in  spite  of  the 
theorem  assuring  us  that  the  singlet  state  has  the  lowest  energy)  because  it  reveals 
most  simply  the  inadequacy  of  the  independent  electron  approximation  in  magnetic 
problems. 


6  All  solutions  to  (32.3)  can  be  taken  to  be  symmetric  or  antisymmetric,  because  of  the  symmetry  of 
V  (which  contains  all  electrostatic  interactions  among  the  two  electrons  and  two  protons  fixed  at  R,  and 
R2).  See  Problem  1 . 

7  See  Problem  2. 

8  In  one  spatial  dimension  it  has  been  proved  that  the  ground  state  of  any  number  of  electrons  with 
arbitrary  spin-independent  interactions  must  have  zero  total  spin  (E.  Lieb  and  D.  Mattis,  Phys.  Rev.  125, 
164  (1962)).  The  theorem  cannot  be  generalized  to  three  dimensions  (where,  for  example,  Hunds  rules 
(see  Chapter  31)  provide  many  counterexamples). 
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CALCULATION  OF  THE  SINGLET-TRIPLET  SPLITTING: 
FAILURE  OF  THE  INDEPENDENT  ELECTRON  APPROXIMATION 


The  singlet-triplet  energy  splitting  measures  the  extent  to  which  the  antiparallel 
(S  =  0)  spin  alignment  of  two  electrons  is  more  favorable  than  the  parallel  (S  =  1). 
Since  Es  —  E,  is  the  difference  between  eigenvalues  of  a  Hamiltonian  containing  only 
electrostatic  interactions,  this  energy  should  be  of  the  order  of  electrostatic  energy 
differences,  and  therefore  quite  capable  of  being  the  dominant  source  of  magnetic 
interaction,  even  when  explicitly  spin-dependent  interactions  are  added  to  the  Hamil¬ 
tonian.  We  shall  describe  some  approximate  methods  for  calculating  Es  —  E,.  Our 
aim  is  not  to  extract  numerical  results  (though  the  methods  are  used  for  that  purpose), 
but  to  illustrate  in  the  very  simple  two-electron  case  the  very  subtle  (especially 
when  N  is  large)  inadequacies  of  the  independent  electron  approximation,  in  dealing 
with  electron  spin  correlations. 

Suppose,  then,  we  begin  by  trying  to  solve  the  two-electron  problem  (32.3)  in  the 
independent  electron  approximation;  i.e,  we  ignore  the  electron-electron  Coulomb 
interaction  in  F(r,,  r2),  retaining  only  the  interaction  of  each  electron  with  the  two 
ions  (which  we  take  to  be  fixed  at  Rj  and  R2).  The  two-electron  Schrodinger  equation 
(32.3)  then  assumes  the  form 


where9 


(/q  +  h2)ip(  r1(r2)  =  E\p(\ r2,r2), 


h\2  e> 

2m  Vi  |rf  -  R]|  |rf  -  R2I 


i  =  1,2. 


(32.5) 

(32.6) 


Because  the  Hamiltonian  in  (32.5)  is  a  sum  of  one-electron  Hamiltonians,  the  solution 
can  be  constructed  from  solutions  of  the  one-electron  Schrodinger  equation: 

ft# r)  =  #(r).  (32.7) 

If  i//0(r)  and  #(r)  are  the  two  solutions  to  (32.7)  of  lowest  energy,  with  energies  £0  < 
gj,  then  the  symmetric  solution  of  lowest  energy  to  the  approximate  two-electron 
Schrodinger  equation  (32.5)  is 

Wii,r2)  =  Wi1)Wr2),  Es=  2E0,  (32.8) 

and  the  lowest  antisymmetric  solution  is 

Wri> r2)  =  iMri)#(n)  -  iMuJiMuX  £,  =  £0  +  Si-  (32.9) 

The  singlet-triplet  energy  splitting  is  then 

Es  -  E,  =  E0  —  gj,  (32.10) 

which  is  consistent  with  the  general  theorem  Es  <  Et  for  two-electron  systems. 

In  arriving  at  the  ground-state  energy  2£0,  we  have  merely  followed  the  steps  of 


9  The  following  discussion  would  apply  unchanged  if  one  were  to  approximate  the  electron-electron 
interaction  by  a  self-consistent  field  which  modified  the  bare  electron-ion  Coulomb  interaction.  (See 
page  192.) 
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band  theory,  specialized  to  the  case  of  an  N  —  2  “solid,”  first  solving  the  one-electron 
problem  (32.7),  and  then  filling  the  lowest  N/2  one-electron  levels  with  two  electrons 
(of  opposite  spin)  per  level.  In  spite  of  this  pleasing  familiarity,  the  wave  function  (32.8) 
is  manifestly  a  very  bad  approximation  to  the  ground  state  of  the  exact  Schrodinger 
equation  (32.3)  when  the  protons  are  very  far  apart,  for  in  that  case  it  fails  quite 
disastrously  to  deal  with  the  electron-electron  Coulomb  interaction.  This  becomes 
evident  when  one  examines  the  structure  of  the  one-electron  wave  functions  <p0(r) 
and  i/',(r).  For  well-separated  protons  these  solutions  to  (32.7)  are  given  to  an  excellent 
approximation  by  the  tight-binding  method  (Chapter  10)  specialized  to  the  case  in 
which  N  =  2.  The  tight-binding  method  takes  the  one-electron  stationary-state  wave 
functions  of  the  solid  to  be  linear  combinations  of  atomic  stationary-state  wave 
functions  centered  at  the  lattice  points  R.  When  N  —  2  the  correct  linear  combinations 
are10 

•Ao(r)  =  4>i(r)  + 

lMr)  =  <Pi(r)  ~  4>i( r),  (32.11) 

where  r/>,(r)  is  the  ground-state  electronic  wave  function  for  a  single  hydrogen  atom 
whose  proton  is  fixed  at  R;.  If  the  one-electron  levels  have  this  form  (which  is  essen¬ 
tially  exact  for  well-separated  protons),  then  the  two-electron  wave  functions  (32.8) 
and  (32.9)  (given  by  the  independent  electron  approximation)  become 

Wrj,r2)  =  <Mr,)<!>2(r2)  +  W<rWi(r2) 

+  4>i(ri)<t>i(r2)  +  (32.12) 

and 

M*t>  fz)  =  2[<£2(r1)0,(r2)  -  <M«t)02 (r2)]-  (32.13) 

Equation  (32.12)  gives  an  excellent  approximation  to  the  ground  state  of  the 
Schrodinger  equation  (32.5),  in  which  electron-electron  interactions  are  ignored.  How¬ 
ever,  it  gives  a  very  bad  approximation  to  the  original  Schrodinger  equation  (32.3), 
in  which  electron-electron  interactions  are  retained.  To  see  this,  note  that  the  first 
and  second  terms  in  (32.12)  are  quite  different  from  the  third  and  fourth.  In  the  first 
two  terms  each  electron  is  localized  in  a  hydrogenic  orbit  in  the  neighborhood  of  a 
different  nucleus.  When  the  two  protons  are  far  apart  the  interaction  energy  of  the 
two  electrons  is  small,  and  the  description  of  the  molecule  as  two  slightly  perturbed 
atoms  (implied  by  the  first  two  terms  in  (32.12))  is  quite  good.  However,  in  each  of 
the  last  two  terms  in  (32.12)  both  electrons  are  localized  in  hydrogenic  orbits  about 
the  same  proton.  Their  interaction  energy  is  therefore  considerable  no  matter  how 
far  apart  the  protons  are.  Thus  the  last  two  terms  in  (32.12)  describe  the  hydrogen 
molecule  as  a  H“  ion  and  a  bare  proton — a  highly  inaccurate  picture  when  electron- 
electron  interactions  are  allowed  for.11 


10  See  Problem  3.  If  vve  choose  phases  so  that  the  <pi  are  real  and  positive  (which  can  be  done  for 
the  hydrogen  atom  ground  state),  then  the  linear  combination  with  the  positive  sign  will  have  the  lower 
energy  since  it  has  no  nodes. 

11  This  failure  of  the  independent  electron  approximation  to  describe  accurately  the  hydrogen  mole¬ 
cule  is  analogous  to  the  failure  of  the  independent  electron  approximation  treated  in  our  discussion  of 
the  tight-binding  method  in  Chapter  10.  The  problem  does  not  arise  in  the  case  of  a  filled  band  (or,  in 
the  molecular  analogue,  two  nearby  helium  atoms)  because  a  more  accurate  wave  function  also  must  place 
two  electrons  in  each  localized  orbital. 
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The  ground  state  (32. 12)  of  the  independent  electron  approximation  therefore  gives 
a  50  percent  probability  of  both  electrons  being  together  on  the  same  ion.  The  in¬ 
dependent  electron  triplet  state  (32.13)  does  not  suffer  from  this  defect.  Consequently, 
when  we  introduce  electron-electron  interactions  into  the  Hamiltonian,  the  triplet 
(32.13)  will  surely  give  a  lower  mean  energy  than  the  singlet  (32.12),  when  the  protons 
are  far  enough  apart. 

This  does  not  mean,  however,  that  the  true  ground  state  is  a  triplet.  A  symmetric 
state  which  never  places  two  electrons  on  the  same  proton,  and  is  therefore  of  much 
lower  energy  than  the  independent  electron  ground  state,  is  given  by  taking  only  the 
first  two  terms  in  (32.12) : 

fcfri.  r2)  =  <£i(ri)02(r2)  +  4>2(ri)4>i(T2)-  (32.14) 

The  theory  that  takes  its  approximations  to  the  singlet  and  triplet  ground  states  of 
the  full  Hamiltonian  (32.3)  to  be  proportional  to  (32.14)  and  (32.13)  is  known  as  the 
Heitler-London  approximation.12  Evidently  the  Heitler-London  singlet  state  (32.14) 
is  far  more  accurate  for  widely  separated  protons  than  the  independent  electron 
singlet  state  (32.12).  When  appropriately  generalized,  it  should  be  more  suitable 
for  discussing  magnetic  ions  in  an  insulating  crystal. 

On  the  other  hand,  when  the  protons  are  very  close  together  the  independent 
electron  approximation  (32.8)  is  closer  to  the  true  ground  state  than  the  Heitler- 
London  approximation  (32.14),  as  is  easily  seen  in  the  extreme  case  in  which  the  two 
protons  actually  coincide.  The  independent  electron  approximation  starts  with  two 
one-electron  wave  functions  appropriate  to  a  single  doubly  charged  nucleus,  whereas 
the  Heitler-London  approximation  works  with  one-electron  wave  functions  for  a 
singly  charged  nucleus.  These  are  far  too  extended  in  space  to  form  a  good  starting 
point  for  the  description  of  what  is  now  not  a  hydrogen  molecule,  but  a  helium  atom. 

The  foregoing  analysis  was  intended  primarily  to  emphasize,  through  the  simple 
example  of  a  two-electron  system,  that  one  cannot  apply  the  concepts  of  band  theory, 
based  as  it  is  on  the  independent  electron  approximation,  to  account  for  magnetic 
interactions  in  insulating  crystals.  As  for  the  Heitler-London  method  itself,  it  too  has 
shortcomings,  for  although  it  gives  very  accurate  singlet  and  triplet  energies  for  large 
spatial  separations,13  its  prediction  for  the  very  small  singlet-triplet  energy  splitting 
is  considerably  less  certain  when  the  ions  are  far  apart.  The  method  is  therefore 
quite  treacherous  to  use  uncritically.14  We  nevertheless  give  below  the  form  of  the 


12  In  the  context  of  molecular  physics,  the  description  based  on  the  independent  electron  ground 
state  (32.12)  is  known  as  the  Hund-Mulliken  approximation,  or  the  method  of  molecular  orbitals.  Other 
terminology  is  associated  with  the  fact  that  the  Heitler-London  approximation  to  the  ground  state  can  be 
written  as  a  linear  combination  of  two  independent  electron  approximation  two-electron  states: 

"?(ri,  r2)  =  <M«-i)Wr2)  -  •Ai(ri)l/'i(r2)> 

a  simple  example  of  a  state  of  affairs  referred  to  as  “configuration  mixing”  The  Heitler-London  states 
tj}s  and  \pt  are  known  as  the  “bonding”  and  “anti-bonding”  states. 

13  Unlike  the  independent  electron  approximation. 

14  A  thorough  critique  of  the  Heitler-London  method  has  been  given  by  C.  Herring,  “ Direct  Exchange 
Between  Well  Separated  Atoms,'  in  Magnetism ,  vol.  2B,  G.  T.  Rado  and  H.  Suhl,  eds..  Academic  Press, 
New  York,  1965. 
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Heitler-London  result  for  Es  -  £„  because  it  is  both  a  starting  point  for  more  refined 
treatments  and  the  source  of  a  nomenclature  that  pervades  much  of  the  subject  of 
magnetism : 

The  Heitler-London  approximation  uses  the  singlet  and  triplet  wave  functions 
(32.14)  and  (32.13)  to  estimate  the  singlet-triplet  splitting  as 


tffc)  (iA„  Hi]/,) 

1  (iW 


(32.15) 


where  H  is  the  full  Hamiltonian  (32.3).  In  the  limit  of  large  spatial  separations  this 
splitting  can  be  shown  (Problem  4)  to  reduce  simply  to 


1 

2 


(Es  -  £,) 


dr,  dr 2  [(WnkMu)]  | 


|r,  -  r2|  +  |R,  -  R2| 


lri  -  »i|  lr2  -  Rj 


|>2(ri)0i(r2)]-  (32.16) 


Because  this  is  a  matrix  element  between  two  states  that  differ  solely  through  the 
exchange  of  the  coordinates  of  the  two  electrons,  the  singlet-triplet  energy  difference 
is  referred  to  as  an  exchange  splitting  or,  when  viewed  as  a  source  of  magnetic  inter¬ 
action,  an  exchange  interaction.1 5 

Since  the  atomic  orbital  </>,(r)  is  strongly  localized  in  the  neighborhood  of  r  =  R,-, 
the  factors  ^.(r,)^,)  and  </>,(r2)4>2(r2)  in  the  integrand  of  (32.16)  insure  that  the 
singlet-triplet  energy  splitting  will  fall  off  rapidly  with  the  distance  |R,  —  R2|  between 
protons. 


THE  SPIN  HAMILTONIAN  AND  THE  HEISENBERG  MODEL 

There  is  a  way  to  express  the  dependence  of  the  spin  of  a  two-electron  system  on  the 
singlet-triplet  energy  splitting,  which,  though  unnecessarily  complicated  in  this  simple 
case,  is  of  fundamental  importance  in  analyzing  the  energetics  of  the  spin  configura¬ 
tions  of  real  insulating  solids.  One  first  notes  that  when  the  two  protons  are  far  apart, 
the  ground  state  describes  two  independent  hydrogen  atoms  and  is  therefore  fourfold 
degenerate  (since  each  electron  can  have  two  spin  orientations).  One  next  considers 
the  protons  to  be  brought  a  bit  closer  together,  so  that  there  is  a  splitting  (£s  #  £,) 
of  the  fourfold  degeneracy  due  to  interactions  between  the  atoms,  which  is,  however, 
small  compared  with  all  other  excitation  energies  of  the  two-electron  system.  Under 
such  conditions  these  four  states  will  play  the  dominant  role  in  determining  many 
important  properties  of  the  molecule,16  and  one  often  simplifies  the  analysis  by 
ignoring  the  higher  states  altogether,  representing  the  molecule  as  a  simple  four-state 
system.  If  we  do  represent  a  general  state  of  the  molecule  as  a  linear  combination  of  the 
four  lowest  states,  it  is  convenient  to  have  an  operator,  known  as  the  spin  Hamiltonian, 


ls  One  should  not,  however,  be  lulled  by  this  nomenclature  into  forgetting  that  underlying  the 
exchange  interaction  are  nothing  but  electrostatic  interaction  energies,  and  the  Pauli  exclusion  principle. 

16  For  example,  the  thermal  equilibrium  properties  when  kBT  is  comparable  to  —  E„  but  small 
enough  that  no  states  other  than  the  four  are  thermally  excited. 
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whose  eigenvalues  are  the  same  as  those  of  the  original  Hamiltonian  within  the  four- 
state  manifold,  and  whose  eigenfunctions  give  the  spin  of  the  corresponding  states. 

To  construct  the  spin  Hamiltonian  for  a  two-electron  system,  note  that  each 
individual  electron  spin  operator  satisfies  S;2  =  2(2  +  1)  =  f,  so  that  the  total  spin 
S  satisfies 

S2  =  (S,  +  S2)2  =  f  +  2S,  '  S2.  (32.17) 

Since  S2  has  the  eigenvalue  S(S  4-  1)  in  states  of  spin  S,  it  follows  from  (32.17)  that 
the  operator  S,  •  S2  has  eigenvalue  — f  in  the  singlet  (S  =  0)  state  and  +4  in  the 
triplet  (S  =  1)  states.  Consequently  the  operator 

3Cspin  =  i (£s  +3 £,)  -  (£,  -  £,)S1  •  S2  (32.18) 

has  eigenvalue  £s  in  the  singlet  state,  and  E,  in  each  of  the  three  triplet  states,  and  is 
the  desired  spin  Hamiltonian. 

By  redefining  the  zero  of  energy  we  may  omit  the  constant  (£s  +  3£,)/4  common 
to  all  four  states,  and  write  the  spin  Hamiltonian  as 

3Cspin  =  -  JSt  •  S2,  J  =  £s  -  £,.  (32.19) 

Since  3Cspin  is  the  scalar  product  of  the  vector  spin  operators  S,  and  S2,  it  will  favor 
parallel  spins  if  J  is  positive  and  antiparallel  if  J  is  negative.17  Note  that  in  contrast 
to  the  magnetic  dipolar  interaction  (32.1),  the  coupling  in  the  spin  Hamiltonian 
depends  only  on  the  relative  orientation  of  the  two  spins,  but  not  on  their  directions 
with  respect  to  R2  -  R2.  This  is  a  general  consequence  of  the  spin  independence  of 
the  original  Hamiltonian  and  (one  should  note)  holds  without  any  assumption  about 
its  spatial  symmetry.  One  must  include  terms  that  break  rotational  symmetry  in  spin 
space  (such  as  dipolar  interactions  or  spin-orbit  coupling)  in  the  original  Hamiltonian, 
in  order  to  produce  a  spin  Hamiltonian  with  anisotropic  coupling.18 

When  N  is  large,  rather  than  merely  reexpressing  some  known  results  (as  when 
N  =  2),  the  spin  Hamiltonian  contains  in  highly  compact  form  some  exceedingly 
complex  information  about  the  low-lying  levels.19  When  N  ions  of  spin  S  are  widely 
separated,20  the  ground  state  will  be  (2 S  +  l)A-fold  degenerate.  The  spin  Hamiltonian 
describes  the  splitting  of  this  vastly  degenerate  ground  state  when  the  ions  are  some¬ 
what  closer  together,  but  still  far  enough  apart  that  the  splittings  are  small  compared 
with  any  other  excitation  energies.  One  can  (in  a  variety  of  ways)  construct  an  operator 
function  of  the  S;  whose  eigenvalues  give  the  split  levels.-  What  is  remarkable,  how¬ 
ever,  is  that  for  many  cases  of  interest  the  form  of  the  spin  Hamiltonian  is  simply  that 


17  Since  J  is  positive  or  negative  depending  on  whether  E,  or  E,  is  lower,  this  simply  restates  the  fact 
that  the  spins  are  parallel  in  the  triplet  state  and  antiparallel  in  the  singlet. 

1 8  Such  anisotropy  is  of  critical  importance  in  understanding  the  existence  of  easy  and  hard  directions 
of  magnetization  and  plays  a  role  in  the  theory  of  domain  formation.  (See  page  720.) 

19  In  general,  this  information  is  not  easily  extracted,  even  from  a  spin  Hamiltonian.  In  contrast  to 
the  case  N  =  2  one  does  not  know  the  low-lying  levels  from  the  start,  and  finding  a  spin  Hamiltonian  is 
only  half  the  problem.  There  remains  the  highly  nontrivial  problem  of  finding  the  eigenvalues  of  that  spin 
Hamiltonian.  (See,  for  example,  pages  701-709). 

20  For  simplicity  wre  assume  that  for  each  ion  J  S  (i.e.,  L  =  0).  This  restriction  is  not  essential  to 
the  development  of  a  spin  Hamiltonian. 
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for  the  two-spin  case,  summed  over  all  pairs  of  ions: 

Hspin  =  -  £  Ji} S;  ■  S;.  (32.20) 

We  shall  not  go  into  the  question  of  when  (32.20)  can  be  justified,  which  is  quite  a 
complex  matter.21  However  one  should  note: 

1.  For  only  products  of  pairs  of  spin  operators  to  appear  in  (33.20)  it  is  necessary 
that  all  magnetic  ions  be  far  enough  apart  that  the  overlap  of  their  electronic 
wave  functions  is  very  small. 

2.  When  the  angular  momentum  of  each  ion  contains  an  orbital  as  well  as  a  spin 
part,  the  coupling  in  the  spin  Hamiltonian  may  depend  on  the  absolute  as  well  as 
the  relative  spin  orientations. 

The  spin  Hamiltonian  (32.20)  is  known  as  the  Heisenberg22  Hamiltonian,  and  the 
Jij  are  known  as  the  exchange  coupling  constants  (or  parameters  or  coefficients). 
Extracting  information  even  from  the  Heisenberg  Hamiltonian  is,  in  general,  so 
difficult  a  task,  that  it  by  itself  is  taken  as  the  starting  point  for  many  quite  profound 
investigations  of  magnetism  in  solids.  One  must  remember,  however,  that  much  subtle 
physics  and  quite  complex  approximations  must  be  delved  into  before  one  can  even 
arrive  at  a  Heisenberg  Hamiltonian. 

DIRECT  EXCHANGE,  SUPEREXCHANGE, 

INDIRECT  EXCHANGE,  AND  ITINERANT  EXCHANGE 

The  magnetic  interaction  we  have  just  described  is  known  as  direct  exchange,  because 
it  arises  from  the  direct  Coulomb  interaction  among  electrons  from  the  two  ions.  It 
often  happens  that  two  magnetic  ions  are  separated  by  a  nonmagnetic  ion  (i.e.,  one 
with  all  electronic  shells  closed).  It  is  then  possible  for  the  magnetic  ions  to  have  a 
magnetic  interaction  mediated  by  the  electrons  in  their  common  nonmagnetic  neigh¬ 
bors,  which  is  more  important  than  their  direct  exchange  Interaction.  This  type  of 
magnetic  interaction  is  called  superexchange.  (See  Figure  32.2.) 

Yet  another  source  of  magnetic  interaction  can  occur  between  electrons  in  the 
partially  filled  /-shells  in  the  rare  earth  metals.  In  addition  to  their  direct  exchange 
coupling,  the  /-electrons  are  coupled  through  their  interactions  with  the  conduction 
electrons.  This  mechanism  (in  a  sense  the  metallic  analogue  of  superexchange  in 
insulators)  is  known  as  indirect  exchange.  It  can  be  stronger  than  the  direct  exchange 
coupling,  since  the  /-shells  generally  overlap  very  little. 

There  are  also  important  exchange  interactions  in  metals  among  the  conduction 
electrons  themselves,  often  referred  to  as  itinerant  exchange.13  To  emphasize  the  great 
generality  of  exchange  interactions,  we  give  below  a  brief  discussion  of  itinerant 
exchange  in  the  case  farthest  away  from  the  well-localized  electrons  for  which  the 
Heitler-London  theory  of  direct  exchange  was  devised :  the  free  electron  gas. 


21  A  very  thorough  discussion  is  given  by  C.  Herring  (see  footnote  14). 

22  In  the  older  literature  it  is  known  as  the  Heisenberg-Dirac  Hamiltonian. 

23  A  discussion  of  itinerant  exchange  is  given  by  C.  Herring  in  Magnetism .  vol.  4,  G.  T.  Rado  and 
H.  Suhl.  eds.,  Academic  Press,  New  York,  1966. 
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(b) 


Figure  32.2 

Schematic  illustrations  of  (a)  direct  exchange, 
in  which  the  magnetic  ions  interact  because 
their  charge  distributions  overlap;  (b)  super¬ 
exchange,  in  which  magnetic  ions  with  non¬ 
overlapping  charge  distributions  interact 
because  both  have  overlap  with  the  same  non¬ 
magnetic  ion;  and  (c)  indirect  exchange,  in 
which  in  the  absence  of  overlap  a  magnetic 
interaction  is  mediated  by  interactions  with 
the  conduction  electrons. 
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MAGNETIC  INTERACTIONS  IN  THE  FREE  ELECTRON  GAS 


The  theory  of  magnetism  in  a  free  electron  gas  is  hopelessly  inadequate  as  an  approach 
to  the  problem  of  magnetism  in  real  metals.  However,  the  subject  is  not  without 
interest  because  (a)  it  provides  another  simple  model  in  which  magnetic  structure  is 
implied  in  the  absence  of  explicitly  spin-dependent  interactions,  (b)  its  complexity 
indicates  the  magnitude  of  the  problem  one  faces  in  real  metals,  and  (c)  a  correct 
(currently  nonexistent)  general  theory  of  magnetism  in  metals  will  undoubtedly  have 
to  find  a  way  of  dealing  simultaneously  with  both  the  localized  (as  described  above) 
and  itinerant  (as  described  below)  aspects  of  exchange. 

F.  Bloch24  first  pointed  out  that  the  Hartree-Fock  approximation  can  predict 
ferromagnetism  in  a  gas  of  electrons  interacting  only  through  their  mutual  Coulomb 
interactions.  Within  this  approximation  we  showed  in  Chapter  17  that  if  every  one- 
electron  level  with  wave  vector  less  than  kF  is  occupied  by  two  electrons  of  opposite 
spin,  then  the  ground-state  energy  of  N  free  electrons  is  (Eq.  (17.23)): 


E  =  N 


3  3 

5  (Mo)2  -  jz  (Mo) 


Ry 


(32.21) 


The  first  term  in  (32.21)  is  the  total  kinetic  energy,  and  the  second,  known  as  the 
exchange  energy,  is  the  Hartree-Fock  approximation  to  the  effect  of  electron-electron 
Coulomb  interactions. 

In  deriving  (32.21),  however,  we  assumed  that  every  occupied  one-electron  orbital 
level  would  be  occupied  by  two  electrons  of  opposite  spin.  A  more  general  possibility, 
leading  to  a  net  spin  imbalance,  would  be  to  fill  each  one-electron  level  with  k  less 
than  some  k,  with  spin-up  electrons,  and  each  with  k  <  k,,  with  spin-down  electrons. 


24  Z.  Physik  57,  545  (1929). 
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Since  (see  Eq.  (1 7. 1 5) )  the  exchange  interaction  in  Hartree-F ock  theory  is  only  between 
electrons  of  the  same  spin,  we  will  have  an  equation  of  the  form  (32.21)  for  each  spin 
population: 

E,  =  Wt| 


E,  =  N, 
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(32.22) 


where  the  total  energy  and  total  number  of  electrons  are 
E  =  E,  4-  E„ 

N  _Ni  Ni  _  kf_  k±_  _ 

"K  ~  17  +  E  ~  6712  67t2  37t2' 


(32.23) 


Equation  (32.21)  is  the  form  E  takes  if  IV,  =  N,  =  N/2,  but  we  may  now  ask  whether 
one  cannot  get  a  lower  energy  by  dropping  this  assumption.  If  so,  then  the  ground 
state  will  have  a  nonvanishing  magnetization  density. 


IV,-  IV, 
M  =  -cj!ib - - — , 


(32.24) 


and  the  electron  gas  will  be  ferromagnetic.  _ 

For  simplicity,  we  consider  only  the  opposite  extreme,25  taking  N,  =  N  and  A,  - 
0.  Then  E  will  be  £,,  and  k,  will  be  2lI'kr  (according  to  (32.23),  and  therefore 


E  =  N 


h2'HkFa0)2  -  21,3(kFa0)  . 

5  Zn  _ 


(32.25) 


Compared  with  the  nonmagnetic  case  (32.21),  the  positive  kinetic  energy  in  (32.25) 
is  larger  by  22/3,  and  the  magnitude  of  the  negative  exchange  energy  is  larger  by  2  . 

Hence  the  energy  of  the  fully  magnetized  state  is  lower  than  that  of  the  unmagnetized 
state  when  exchange  energy  dominates  kinetic  energy.  This  happens  for  small  k,  i.e., 
at  low  densities.  As  the  density  decreases,  the  transition  from  nonmagnetic  to  fully 
magnetic  ground  states  occurs  when  the  energies  (32.21)  and  (32.25)  become  equal 
i.e.,  when 


5  1 

kF<h  ~  2k2''3  +  1* 


(32.26) 


or  (see  Eq.  (2.22))  when 


«o 


y(21/3  +  1) 


5.45. 


(32.27) 


Cesium  is  the  only  metallic  element  with  so  low  a  conduction  electron  density  that 
rs  exceeds  this  value,  but  there  are  metallic  compounds26  with  rja0  >  5.45.  In  none 


25  It  can  be  shown  that  values  of  N,  and  N,  between  the  extremes  IV,  =  N,.  and  N,  (or  N.)  -  N, 
Ni  (or  Nt)  =  0,  give  a  higher  energy  than  one  or  the  other  limiting  case. 

«  For  example,  the  metal  amines.  See  J.  J.  Lagowski  and  M.  J.  Sienko,  eds..  Metal  Ammoma  Solutions , 

Butterworth,  London,  1970. 
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of  these  cases,  however,  are  the  materials  ferromagnetic,  even  though  their  band 
structures  are  reasonably  well  described  by  the  free  electron  model. 

The  simple  criterion  (32.26)  for  low-density  ferromagnetism  is  invalidated  by  fur¬ 
ther  theoretical  considerations  as  well : 

1.  Even  within  the  Hartree-Fock  approximation  there  are  still  more  complicated 
choices  of  one-electron  levels  that  lead  to  a  lower  energy  than  either  the  fully 
magnetized  or  nonmagnetic  solutions.  These  solutions,  called  spin  density  waves, 
were  discovered  by  Overhauser27  and  give  an  antiferromagnetic  ground  state, 
at  densities  in  the  neighborhood  of  that  given  by  (32.27). 

2.  The  Hartree-Fock  approximation  is  improved  if  one  lets  the  electrons  screen 
the  exchange  interaction  (page  344),  and  thereby  reduce  its  spatial  range.  This 
improvement  alters  the  Hartree-Fock  prediction  quite  drastically.  For  example, 
in  the  extreme  short-range  case  of  a  delta  function  potential,  it  predicts  ferro¬ 
magnetism  at  high  densities  and  a  nonmagnetic  state  at  low  densities. 

3.  At  very  low  densities  the  true  ground  state  of  the  free  electron  gas  bears  no 
resemblance  to  any  of  the  forms  described  above.  In  the  low  density  limit  the 
free  electron  gas  can  be  shown  to  crystallize,  taking  on  a  configuration  (the 
Wigner  crystal)  whose  description  is  quite  outside  the  reach  of  the  independent 
electron  approximation.28 

Thus  the  best  Hartree-Fock  ground  state  is  by  no  means  obvious  and,  worse  still, 
simple  attempts  to  improve  Hartree-Fock  theory  can  drastically  alter  its  predictions. 
It  is  currently  thought  that  the  free  electron  gas  is  probably  not  ferromagnetic  at  any 
density,  but  a  rigorous  proof  of  this  is  lacking.  Certainly  ferromagnetism  has  only 
been  observed  experimentally  in  metals  whose  free  ions  contain  partially  filled  d-  or 
/-shells,  a  state  of  affairs  hopelessly  beyond  the  range  of  competence  of  the  free 
electron  model.  To  account  for  magnetic  ordering  in  metals,  itinerant  exchange 
interactions  must  be  combined  with  specific  features  of  the  band  structure29  and/or 
the  kinds  of  atomic  considerations  leading  to  Hund’s  rules. 


27  A.  W.  Overhauser,  Phys.  Rev.  Lett.  4,  462  (1960);  Phys.  Rev.  128,  1437  (1962);  see  also  C.  Herring 
(footnote  23).  Introducing  screening  eliminates  the  spin  density  wave.  However,  certain  special  features  of 
the  band  structure  of  chromium  make  it  possible  to  resurrect  the  spin  density  wave  by  introducing  band 
structure  into  the  theory  in  a  fairly  simple  way,  and  it  is  currently  believed  that  such  a  theory  accounts  for 
the  antiferromagnetism  of  chromium.  See,  for  example,  T.  M.  Rice,  Phys.  Rev.  B2,  3619  (1970)  and  references 
cited  therein 

28  E.  Wigner,  Trans.  Farad.  Soc.  34,  678  (1938)- 

29  A  widely  held  view'  of  the  ferromagnetism  of  nickel,  for  example,  simply  combines  the  free  electron 
picture  of  itinerant  ferromagnetism  with  the  band  theory.  Thus  the  energy  bands  of  nickel  are  calculated 
in  the  usual  way  except  that  one  allows  for  a  self-consistent  exchange  field  (frequently  taken  to  be  simply  a 
constant)  that  can  differ  for  electrons  of  opposite  spins  when  the  two  spin  populations  differ.  By  suitably 
choosing  the  exchange  field  one  can  construct  a  ground  state  for  nickel  (which  has  a  total  of  ten  electrons 
in  3 d  and  4s  levels  in  the  atomic  state)  in  which  one  d-band  is  filled  (five  electrons  per  atom)  with  spin-up 
electrons.  The  second  d-band  contains  spin-down  electrons,  but  is  displaced  (relative  to  the  first)  upward 
in  energy  across  the  Fermi  level,  so  that  it  is  somewhat  short  of  being  filled  (containing  only  4.4  electrons 
per  atom).  The  missing  0.6  electron  per  atom  resides  in  a  free  electron  band,  with  randomly  oriented  spins. 
Since  the  population  of  the  spin-up  band  exceeds  that  of  the  spin-down  band  by  0.6  electron  per  atom,  the 
solid  has  a  net  magnetization.  See  E.  C.  Stoner,  Rept.  Prop.  Phys.  11, 43  (1947)  for  a  review  of  early  work 
along  these  lines,  and  C.  Herring  (footnote  23)  for  a  more  up-to-date  survey. 
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THE  HUBBARD  MODEL 

J.  Hubbard30  has  proposed  a  highly  oversimplified  model  that  attempts  to  come  to 
grips  with  these  problems,  containing  the  bare  minimum  of  features  necessary  to 
yield  both  bandlike  and  localized  behavior  in  suitable  limits.  In  the  Hubbard  model 
the  vast  set  of  bound  and  continuum  electron  levels  of  each  ion  is  reduced  to  a  single 
localized  orbital  level.  The  states  of  the  model  are  given  by  specifying  the  four  possible 
configurations  of  each  ion  (its  level  can  either  be  empty,  contain  one  electron  with 
either  of  two  spins,  or  two  electrons  of  opposite  spin).  The  Hamiltonian  for  the 
Hubbard  model  contains  two  types  of  terms:  (a)  a  term  diagonal  in  these  states,  that 
is  just  a  positive  energy  U  times  the  number  of  doubly  occupied  ionic  levels  (plus  an 
(unimportant)  energy  £  times  the  number  of  electrons);  and  (b)  a  term  off-diagonal  in 
these  states  that  has  nonvanishing  matrix  elements  t  between  just  those  pairs  of  states 
that  differ  only  by  a  single  electron  having  been  moved  (without  change  in  spin)  from 
a  given  ion  to  one  of  its  neighbors.  The  first  set  of  terms,  in  the  absence  of  the  second, 
would  favor  local  magnetic  moments,  since  it  would  suppress  the  possibility  of  a 
second  electron  (with  oppositely  directed  spin)  at  singly  occupied  sites.  The  second 
set  of  terms  in  the  absence  of  the  first,  can  be  shown  to  lead  to  a  conventional  band 
spectrum  and  one-electron  Bloch  levels  in  which  each  electron  is  distributed  through¬ 
out  the  entire  crystal.  When  both  sets  of  terms  are  present,  even  this  simple  model  has 
proved  too  difficult  for  exact  analysis,  although  much  interesting  information  has 
been  extracted  in  special  cases.  If,  for  example,  the  total  number  of  electrons  is  equal 
to  the  total  number  of  sites,  then  in  the  limit  of  negligible  intrasite  repulsion  (t  »  U) 
one  has  an  ordinary  half-filled  metallic  band.  In  the  opposite  limit  (U  »  t),  however, 
one  can  derive  an  antiferromagnetic  Heisenberg  spin  Hamiltonian  (having  an  ex¬ 
change  constant  |J|  =  4t2/U)  to  describe  the  low-lying  excitations.  No  one,  however, 
has  given  a  rigorous  solution  to  how  the  model  changes  from  a  nonmagnetic  metal  to 
an  antiferromagnetic  insulator  as  t/U  is  varied. 


LOCALIZED  MOMENTS  IN  ALLOYS 

We  have  pointed  out  the  perils  in  treating  magnetism  in  conductors  from  a  purely 
itinerant  point  of  view.  On  the  other  hand,  in  metals  for  which  the  magnetic  ions  have 
a  partially  filled  d-shell  in  the  free  atom,  one’s  natural  inclination  might  be  to  go  to 
the  other  extreme,  treating  the  d-electrons  with  the  same  techniques  used  in  magnetic 
insulators — i.e.,  with  the  concept  of  direct  exchange  (supplemented  by  the  indirect 
exchange  that  can  also  arise  in  metals).31  This  approach  is  also  perilous.  An  ion  that 
in  isolation  has  a  magnetic  electronic  shell  may  retain  only  a  fraction  of  its  magnetic 
moment,  or  none  at  all,  when  placed  in  a  metallic  environment.  This  is  very  well 
illustrated  by  the  properties  of  dilute  magnetic  alloys. 

When  small  amounts  of  transition  metal  elements  are  dissolved  in  a  nonmagnetic 
(often  free  electronlike)  metal,  the  resulting  alloy  may  or  may  not  display  a  localized 


M  J.  Hubbard,  Proc.  Roy.  Soc.  A276,  238  (1963);  A277,  237  (1964),  A281,  401  (1964).  The  model  is 
applied  to  the  hydrogen  molecule  in  Problem  5. 

31  With  the  /-shells  in  rare  earth  metals  this  is  probably  a  reasonable  procedure. 
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moment  (Table  32.1).32  The  moment  of  the  free  ion  is  determined  by  Hund’s  rules 
(pages  650-652),  which  in  turn  are  based  on  considerations  of  the  intraionic  Coulomb 
(and,  to  a  lesser  degree,  spin-orbit)  interactions.  A  theory  of  localized  moments  in 
dilute  magnetic  alloys  must  determine  how  these  considerations  are  modified  when 
the  ion  is  not  free,  but  embedded  in  a  metal.33 


Table  32.1 

PRESENCE  OR  ABSENCE  OF  LOCALIZED  MOMENTS  WHEN 
TRANSITION  METAL  IMPURITIES  ARE  DISSOLVED  IN 
NONMAGNETIC  HOSTS” 


HOST 


IMPURITY 

Au 

Cu 

Ag 

A1 

Ti 

No 

-  - 

— 

No 

V 

7 

— 

— 

No 

Cr 

Yes 

Yes 

Yes 

No 

Mn 

Yes 

Yes 

Yes 

7 

Fe 

Yes 

Yes 

— 

No 

Co 

7 

7 

— 

No 

Ni 

No 

No 

— 

No 

“  Presence  is  indicated  by  a  “Yes”  entry,  absence,  by  “No.”  A  question 
mark  indicates  that  the  situation  is  uncertain.  For  some  combinations 
of  host  and  impurity  there  are  metallurgical  difficulties  in  achieving 
reproducible  dilute  alloys,  due  chiefly  to  problems  of  insolubility.  These 
account  for  the  majority  of  blank  entries. 

Source:  A.  J.  Heeger,  Solid  Stale  Physics  23,  F.  Seitz  and  D.  Turnbull,  eds., 
Academic  Press,  New  York,  1969. 


Even  if  we  were  to  ignore  all  interactions  between  the  electrons  on  the  magnetic 
ion,  and  the  electrons  and  ions  of  the  host  metal,  there  would  still  be  a  simple  mecha¬ 
nism  by  which  the  net  moment  of  the  ion  could  be  altered.  Depending  on  the  position 
of  the  levels  of  the  ion  relative  to  the  metallic  Fermi  level,  electrons  might  leave  the 
ion  for  the  conduction  band  of  the  host,  or  drop  from  the  conduction  band  into 
lower-lying  ionic  levels,  thereby  altering  or,  in  some  cases,  even  eliminating  the 
moment  of  the  ion.  In  addition  to  this,  because  levels  of  the  ion  are  degenerate  with 
the  continuum  of  conduction  band  levels  in  the  host,  there  will  be  a  mixing  of  levels, 
in  which  the  ionic  ones  become  spatially  less  localized,  while  nearby  conduction  band 
levels  have  their  charge  distributions  altered  in  the  neighborhood  of  the  magnetic 


32  The  criterion  for  a  localized  moment  is  a  term  in  the  magnetic  susceptibility  inversely  proportional 
to  the  temperature,  with  a  coefficient  proportional  to  the  density  of  magnetic  ions  as  predicted  by  Curie’s 
law,  Eq.  (31.47). 

33  In  Chapter  31  we  considered  the  analogous  problem  arising  when  the  ion  w'as  embedded  in  an 
insulator,  and  found  that  effects  of  the  crystal  field  could  easily  be  more  important  than  the  Hund’s  rules 
couplings,  with  a  corresponding  alteration  in  the  net  magnetic  moment. 
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ion.  This,  in  turn,  will  radically  alter  the  intra-ionic  energetics  that  determine  the  net 
spin  configuration  of  those  electrons  that  are  localized  near  the  ion.  For  example, 
as  the  ionic  levels  become  less  and  less  localized,  the  intra-ionic  Coulomb  repulsion 
will  become  of  progressively  less  importance. 

In  short,  the  problem  is  one  of  considerable  complexity.  Some  insight  has  been 
provided  by  a  model  due  to  P.  W.  Anderson,34  in  which  all  the  levels  of  the  magnetic 
ion  are  replaced  by  a  single  localized  level  (as  the  ions  are  treated  in  the  Hubbard 
model),  and  the  coupling  between  localized  and  band  levels  is  reduced  to  a  bare 
minimum.  It  is  a  measure  of  the  complexity  of  problems  in  which  both  localized  and 
band  features  play  an  important  role  that  even  the  highly  oversimplified  Anderson 
model  has  failed  to  yield  an  exact  solution,  in  spite  of  its  having  been  subject  to  an 
onslaught  of  theoretical  analysis. 

THE  KONDO  THEORY  OF  THE  RESISTANCE  MINIMUM 

The  existence  of  localized  moments  in  dilute  alloys  that  couple  to  the  conduction 
electrons  has  important  consequences  for  the  electrical  conductivity.  The  magnetic 
impurities  act  as  scattering  centers,  and  if  they  are  the  predominant  type  of  impurity 
or  lattice  imperfection,  then  at  low  enough  temperatures  the  scattering  they  cause 
will  be  the  primary  source  of  electrical  resistance.35  In  Chapter  16  we  found  that 
nonmagnetic  scatterers  lead  to  a  temperature  independent  term  toward  which  the 
resistivity  monotonically  drops  with  decreasing  temperature  (the  so-called  residual 
resistivity).  In  magnetic  alloys,  however,  it  has  been  known  since  193036  that  instead 
of  dropping  monotonically  the  resistivity  has  a  rather  shallow  minimum  occurring 
at  a  low  (0(10  K) )  temperature  that  depends  weakly  on  the  concentration  of  magnetic 
impurities  (Figure  32.3). 

It  was  not  until  1963  that  this  minimum  was  shown  by  J.  Kondo37  to  arise  from 
some  unexpected  features  of  the  scattering  of  conduction  electrons  that  arise  only 
when  the  scattering  center  has  a  magnetic  moment.  In  such  a  case  the  exchange 
interaction  between  the  conduction  electrons  and  the  local  moment  leads  to  scattering 
events  in  which  the  electronic  spin  is  flipped  (with  a  compensating  change  of  spin  on 
the  local  moment).  Prior  to  Kondo’s  analysis,  this  scattering  was  treated  only  to 
leading  order  in  perturbation  theory  and  was  found  not  to  differ  qualitatively  from 
nonmagnetic  scattering,  such  as  we  described  in  Chapter  16.  Kondo  discovered, 
however,  that  in  all  higher  orders  of  perturbation  theory  the  magnetic  scattering 
cross  section  is  divergent,  yielding  an  infinite  resistivity. 

The  divergence  depends  critically  on  there  being  a  sharp  cutoff  in  the  conduction 
electron  wave  vector  distribution,  as  there  is  at  T  =  0.  Subsequent  analysis  by  Kondo 
and  others  has  shown  that  the  thermal  rounding  of  the  electron  distribution  removes 


34  P.  W.  Anderson,  Phys.  Rev.  124, 41  (1961).  See  also  A.  J.  Heeger,  Solid  Slate  Physics,  vol.  23,  F.  Seitz 
and  D.  Turnbull,  eds.,  Academic  Press,  New  York,  1969,  p.  293. 

35  Recall  that  the  contribution  due  to  phonon  scattering  decreases  as  T5. 

36  W.  Meissner  and  B.  Voigt,  Ann.  Phys.  7,  761,  892  (1930). 

37  J.  Kondo,  Prog.  Theoret.  Phys.  32,  37  (1964);  Solid  State  Physics,  vol.  23,  F.  Seitz  and  D.  Turnbull, 
eds..  Academic  Press,  New  York,  1969,  p.  183. 
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Figure  32.3 

The  resistance  minimum  for  various  dilute  alloys  of  iron  in  copper.  ( R0  is  the  resistivity  at  0°C.) 
The  position  of  the  minimum  depends  on  the  concentration  of  iron.  (From  J.  P.  Franck  et  al., 
Proc.  Roy,  Soc.  A263,  494  (1961).) 

the  divergence,  yielding  instead  a  term  in  the  impurity  contribution  to  the  resistivity 
that  increases  with  decreasing  temperature.  It  is  the  balancing  of  this  term  against  the 
phonon  contribution,  which  decreases  as  the  temperature  falls,  that  results  in  the 
resistance  minimum. 

The  foregoing  has  offered  little  more  than  a  glimpse  into  the  difficult,  subtle,  often 
fascinating  questions  that  one  encounters  in  almost  any  attempt  to  account  for 
magnetic  interactions.  It  is,  however,  only  half  the  problem.  Even  if  one  is  given  a 
suitably  simplified  model  of  the  important  magnetic  interactions  (for  example,  the 
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Heisenberg  Hamiltonian  (32.20)),  one  must  still  face  the  task  of  extracting  information 
of  physical  interest  from  the  model.  In  general  this  turns  out  to  be  as  difficult,  subtle, 
and  fascinating  as  the  problem  of  deriving  the  model  in  the  first  place.  A  view  of 
this  aspect  of  magnetism  is  given  in  Chapter  33. 

PROBLEMS 


1.  Symmetry  of  Two-Electron  Orbital  Wave  Functions 

Prove  that  the  stationary  states  of  the  orbital  SchrOdinger  equation  for  a  two-electron  system 
with  a  symmetric  potential— i.e.,  Eq.  (32.3)  with  P(r„  r2)  =  P(r2,  r,) — can  be  chosen  to  be  either 
symmetric  or  antisymmetric  (The  proof  is  quite  analogous  to  the  first  proof  of  Bloch’s  theorem 
in  Chapter  8.) 


2.  Proof  That  the  Two-Electron  Ground  State  of  a  Spin-Independent  Hamiltonian 
Is  a  Singlet 

(a)  The  mean  energy  of  a  two-electron  system  with  Hamiltonian  (32.3)  in  the  state  ip  can 
be  written  (after  an  integration  by  parts  in  the  kinetic  energy  term)  in  the  form: 


=  1*1  dr2  (Ml*  +  lV2^|2}  +  V(T1 


r-,)liM2 


(32.28) 


Show  that  the  lowest  value  (32.28)  assumes  over  all  normalized  antisymmetric  differentiable  wave 
functions  ip  that  vanish  at  infinity  is  the  triplet  ground-state  energy  £„  and  that  when  symmetric 
functions  are  used  the  lowest  value  is  the  singlet  ground-state  energy  Es. 

(b)  Using  (i)  the  result  of  (a),  (ii)  the  fact  that  the  triplet  ground  state  ip,  can  be  taken  to  be 
real  when  V  is  real,  and  (iii)  the  fact  that  |^,|  is  symmetric,  deduce  that  £,  <  £,. 


3.  Symmetry  of  One-Electron  Orbital  Ware  Functions  for  the  Hydrogen  Molecule 
Prove  (in  much  the  same  way  as  in  Problem  1)  that  if  a  one-electron  potential  has  a  plane  of 
mirror  symmetry,  then  the  stationary  one-electron  levels  can  be  chosen  so  that  they  are  either 
invariant  or  change  sign  under  reflection  in  that  plane.  (This  establishes  that  Eq.  (32.11)  gives 
the  correct  linear  combinations  of  atomic  orbitals  for  the  two-proton  potential.) 


4.  Heitler-London  Singlet-Triplet  Splitting 

Derive  the  Heitler-London  estimate  (32. 1 6)  for  the  difference  in  singlet  and  triplet  ground-state 
energies  for  the  hydrogen  molecule.  (In  showing  that  (32.15)  reduces  to  (32.16)  for  well-separated 
protons,  it  is  essential  to  take  into  account  the  following  points:  (a)  The  one-electron  wave  func¬ 
tions  and  4‘2  out  of  which  (32.T3)  and  (32.14)  are  constructed  are  exact  ground-state  wave 
functions  for  a  single  electron  in  a  hydrogen  atom  at  Rj  and  R2,  respectively,  (b)  The  criterion 
for  well-separated  protons  is  that  they  be  far  apart  compared  with  the  range  of  a  one-electron 
hydrogenic  wave  function,  (c)  The  electrostatic  field  outside  of  a  spherically  symmetric  distribution 
of  charge  is  precisely  the  field  one  would  have  if  all  the  charge  were  concentrated  in  a  single 
point  charge  at  the  center  of  the  sphere.  It  is  also  convenient  to  include  in  the  Hamiltonian  the 
(constant)  interaction  energy  e2/|R,  —  R2|  of  the  two  protons.) 

5.  Hubbard  Model  of  the  Hydrogen  Molecule 

The  Hubbard  model  represents  an  atom  at  R  by  a  single  orbital  electronic  level  |R>.  If  the  level 
is  empty  (no  electron  on  the  atom)  the  energy  is  zero,  if  one  electron  (of  either  spin)  is  in  the  level 
the  energy  is  £,  and  if  two  electrons  (of  necessarily  opposite  spins)  are  in  the  level  the  energy 
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is  26  +  U,  the  additional  positive  energy  U  representing  the  intra-atomic  Coulomb  repulsion 
between  the  tun  localized  electrons.  (The  exclusion  principle  prevents  more  than  two  electrons 
from  occupying  the  level.) 

The  Hubbard  model  for  a  two-atom  molecule  consists  of  two  such  orbital  levels,  |R>  and 
|R'>,  representing  electrons  localized  at  R  and  R'.  respectively.  For  simplicity,  one  takes  the  two 
levels  to  be  orthogonal: 

<R|R'>  =  0.  (32.29) 

We  first  consider  the  problem  of  two  “protons”  and  one  electron  (i.e..  II2  +  l  If  the  one-electron 
Hamiltonian  h  were  diagonal  in  |R>  and  [R'>,  the  stationary  levels  would  describe  a  hydrogen 
atom  and  a  proton.  We  know,  however,  that  if  the  protons  are  not  too  far  apart,  there  will  be 
a  probability  for  the  electron  tunneling  from  one  to  the  other,  which  leads  to  an  ionized  hydrogen 
molecule.  We  represent  this  amplitude  for  tunneling  by  an  off-diagonal  term  in  the  one-electron 
Hamiltonian: 

<R|/i[R'>  =  <R'|/i|R>  =  -t,  (32.30) 

where  we  may  choose  the  phases  of  |R>  and  ]R'>  to  make  the  number  f  real  and  positive.  This, 
in  conjunction  with  the  diagonal  terms 

<R|/t|R>  =  <R'|/t|R'>  =  S,  (32.31) 

defines  the  one-electron  problem. 

(a)  Show  that  the  one-electron  stationary  levels  are 

-^(|R>  +  |R'»  (32.32) 

with  corresponding  eigenvalues 

6  ±  t.  (32.33) 

As  a  first  approach  to  the  two-electron  problem  (the  hydrogen  molecule)  we  make  the  inde¬ 
pendent  electron  approximation  for  the  singlet  (spatially  symmetric)  ground  state,  putting  both 
electrons  into  the  one-electron  level  of  lowest  energy,  to  get  a  total  energy  of  2(6  —  t).  This 
ignores  entirely  the  interaction  energy  V  arising  when  two  electrons  are  found  on  the  same  proton. 
The  crudest  way  to  improve  upon  the  estimate  2(6  -  t)  is  to  add  the  energy  U,  multiplied  by 

the  probability  of  actually  finding  two  electrons  on  the  same  proton,  when  the  molecule  is  in 

the  ground  state  of  the  independent  electron  approximation. 

(b)  Show  that  this  probability  is  so  that  the  improved  independent  electron  estimate  of 
the  ground-state  energy  is 

Ew  =  2(6  -  f)  +  \V.  (32.34) 

(This  result  is  just  the  Hartree  (or  self-consistent  field)  approximation,  applied  to  the  Hubbard 
model.  See  Chapters  11  and  17.) 

The  full  set  of  singlet  (spatially  symmetric)  states  of  the  two-electron  problem  are: 

=  ^(|R>!R'>  +  iR’>lR». 

0),  =  |R>|R>,  0>2  =  |R')|R'>, 

where  |R>|R'>  has  electron  1  on  the  ion  at  R,  and  electron  2  on  the  ion  at  R',  etc. 


(32.35) 
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(c)  Show  that  the  approximate  ground  state  wave  function  in  the  independent  electron 
approximation  can  be  written  in  terms  of  the  states  (32.35)  as 


+^(®l  +  ®2) 

The  matrix  elements  of  the  full  two-electron  Hamiltonian. 

H  —  hy  +  h2  +  VI2, 

in  the  space  of  singlet  states  are  H,j  —  (4>,,  //<!>,),  where 


H00 

Hot 

Hq2\  /  28 

~y/2t 

—  72  f 

H10 

H.t 

Hl2  =  \-j2t 

28+1/ 

0 

flio 

Hu 

H2J  W2r 

0 

26  +  U, 

(32.36) 


(32.37) 


(32.38) 


Note  that  the  diagonal  elements  in  the  states  <X>,  and  <1>2  that  place  two  electrons  on  the  same 
proton  contain  the  extra  Coulomb  repulsion  U;  the  Coulomb  repulsion  is  not  present  in  the 
diagonal  element  in  the  state  <I>0,  since  in  <I>0  the  electrons  are  on  different  protons.  This  appearance 
of  U  is  the  only  effect  of  the  electron-electron  interaction  V12.  Note  also  that  the  one-electron 
tunneling  amplitude  t  connects  only  states  in  which  a  single  electron  has  been  moved  from  one 
proton  to  the  other  (it  would  take  a  further  two-body  interaction  to  give  a  nonvanishing  matrix 
element  between  states  in  which  the  positions  of  two  electrons  are  changed).  Convince  yourself 
that  the  factor  of  J2.  in  (32.38)  is  correct. 

The  Heitler-London  approximation  to  the  singlet  ground  state  is  just  <J>C>  and  therefore  the 
Heitler-London  estimate  of  the  ground-state  energy  is  just  Hco,  so 


Ehl  —  28. 


(32.39) 


(d)  Show  that  the  exact  ground-state  energy  of  the  Hamiltonian  (32.38)  is 

E  =  28  +  \U  -  Jit1  +  kU 2 .  (32.40) 

Plot  this  energy,  the  independent  electron  approximation  (32.34)  to  the  ground-state  energy,  and 
the  Heitler-London  approximation  to  the  ground-state  energy  as  functions  of  U  (for  fixed  8  and 
f).  Comment  on  the  behavior  for  large  and  small  U/t  and  why  it  is  physically  reasonable.  How 
do  these  three  energies  compare  when  U  =  2 1? 

(e)  Show  that  the  exact  ground  state  of  the  Hamiltonian  (32.38)  is  (to  within  a  normalization 

constant)  _ 

®-75®”  +  (v ,32'4,) 

What  is  the  probability  in  this  state  of  finding  two  electrons  on  the  same  ion?  Plot  your  answer 
as  a  function  of  U  (for  fixed  £  and  t)  and  comment  on  its  behavior  for  small  and  large  U/t. 
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In  the  preceding  two  chapters  we  were  primarily  concerned  with  how  magnetic 
moments  can  be  established  in  solids,  and  how  electron-electron  Coulomb  interac¬ 
tions,  together  with  the  Pauli  exclusion  principle,  can  give  rise  to  effective  interactions 
among  moments.  In  this  chapter  we  shall  start  from  the  existence  of  such  interacting 
moments,  without  further  inquiry  into  the  theory  of  their  origin.  We  shall  survey  the 
types  of  magnetic  structure  such  moments  are  observed  to  yield,  and  shall  examine 
some  typical  problems  encountered  in  trying  to  deduce  the  behavior  of  these  struc¬ 
tures,  even  when  one’s  starting  point  is  not  the  fundamental  Hamiltonian  of  the 
solid,  but  a  “simple”  phenomenological  model  of  interacting  moments. 

In  describing  observed  properties  of  magnetic  structures  we  shall  avoid  committing 
ourselves  to  a  particular  model  of  the  underlying  magnetic  interactions.  In  most  of  our 
theoretical  analysis,  however,  we  shall  use  the  spin  Hamiltonian  (32.20)  of  Heisenberg. 
As  it  turns  out,  even  starting  from  the  Heisenberg  model,  it  is  an  exceedingly  difficult 
task  to  deduce  the  magnetic  properties  of  the  solid  as  the  temperature  and  applied  field 
vary.  No  systematic  solution  has  been  found  even  to  this  simplified  model  problem, 
although  much  partial  information  has  been  extracted  in  a  variety  of  important  cases. 
We  shall  discuss  the  following  representative  subjects: 

1.  The  types  of  magnetic  ordering  that  have  been  observed. 

2.  The  theory  of  the  very  low-lying  states  of  magnetically  ordered  systems. 

3.  The  theory  of  high-temperature  magnetic  properties. 

4.  The  critical  region  of  temperatures  in  which  magnetic  ordering  disappears. 

5.  A  very  crude  phenomenological  theory  of  magnetic  ordering  (mean  field  theory). 

6.  Some  important  consequences  of  magnetic  dipolar  interactions  in  ferromagnet- 
ically  ordered  solids. 

TYPES  OF  MAGNETIC  STRUCTURE 

We  shall  use  the  language  appropriate  to  solids  in  which  the  magnetic  ions  are 
localized  at  lattice  sites,  indicating  below  how  the  discussion  may  be  generalized  to 
itinerant  electron  magnetism. 

If  there  were  no  magnetic  interactions,  individual  magnetic  moments  would,  in 
the  absence  of  a  field,  be  thermally  disordered  at  any  temperature,  and  the  vector 
moment  of  each  magnetic  ion  would  average  to  zero.1  In  some  solids,  however, 
individual  magnetic  ions  have  nonvanishing  average  vector  moments  below  a  critical 
temperature  Tc.  Such  solids  are  called  magnetically  ordered. 

The  individual  localized  moments  in  a  magnetically  ordered  solid  may  or  may 
not  add  up  to  a  net  magnetization  density  for  the  solid  as  a  whole.  If  they  do,  the 
microscopic  magnetic  ordering  is  revealed  by  the  existence  of  a  macroscopic  bulk 
magnetization  density  (even  in  the  absence  of  an  applied  field)  known  as  the  spon¬ 
taneous  magnetization,  and  the  ordered  state  is  described  as  ferromagnetic. 

More  common  is  the  case  in  which  the  individual  local  moments  sum  to  zero  total 
moment,  and  no  spontaneous  magnetization  is  present  to  reveal  the  microscopic 
ordering.  Such  magnetically  ordered  states  are  called  antiferromagnetic. 


This  was  demonstrated  in  Chapter  31.  Equation  (31.44)  gives  M  =  0  when  H  =  0  at  any  T. 
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In  the  simplest  ferromagnets  all  the  local  moments  have  the  same  magnitude  and 
average  direction.  The  simplest  antiferromagnetic  state  arises  when  the  local  moments 
fall  on  two  interpenetrating  sublattices  of  identical  structure.2  Within  each  sublattice 
the  moments  have  the  same  magnitude  and  average  direction,  but  the  net  moments 
of  the  two  sublattices  are  oppositely  directed,  summing  to  zero  total  moment  (see 
Figure  33.1). 


Figure  33.1 

Some  simple  antiferromagnetic  spin 
arrangements,  (a)  Antiferromagnetic 
ordering  on  a  body-centered  cubic 
lattice.  Spins  of  the  same  kind  form 
two  interpenetrating  simple  cubic  lat¬ 
tices.  (b)  Antiferromagnetic  ordering 
on  a  simple  cubic  lattice.  Spins  of  the 
same  kind  form  two  interpenetrating 
face-centered  cubic  lattices. 


The  term  “ferromagnetic”  is  also  used  in  a  more  restrictive  sense,  when  one 
distinguishes  among  the  varieties  of  ferromagnetic  states  that  can  occur  when  there 
are  many  (not  necessarily  identical)  magnetic  ions  per  primitive  cell.  In  such  contexts 
the  term  “ferromagnetic”  is  often  reserved  for  those  magnetic  structures  in  which  all 
the  local  moments  have  a  positive  component  along  the  direction  of  spontaneous 
magnetization.  Solids  possessing  a  spontaneous  magnetization  that  fail  to  satisfy  this 
criterion  are  called  ferrimagnets.3  In  a  simple  ferrimagnet  the  exchange  coupling 
between  nearest  neighbors  may  favor  antiparallel  alignment,  but  because  neighboring 
magnetic  ions  are  not  identical  their  moments  will  not  cancel,  leaving  a  net  moment 
for  the  solid  as  a  whole. 

Some  of  the  many  types  of  magnetic  ordering  are  illustrated  schematically  in 
Figure  33.2.  Many  magnetic  structures  are  so  complex  that  it  is  better  to  describe 
them  explicitly  rather  than  using  one  of  the  above  three  categories. 

Similar  distinctions  can  be  made  for  magnetically  ordered  metals,  even  though  the 
concept  of  a  localized  magnetic  ion  may  be  inapplicable.  One  specifies  the  ordering 
in  terms  of  the  spin  density,  which  is  defined  so  that  at  any  point  r  and  along  any 
direction  z,  .s_(r)  =  j[nT(r)  —  n,(r)],  where  n,(r)  and  n,(r)  are  the  contributions  of  the 
two  spin  populations  to  the  electronic  density  when  the  spins  are  resolved  into 
components  along  the  z-axis.  In  a  magnetically  ordered  metal  the  local  spin  density 
fails  to  vanish.  In  a  ferromagnetic  metal  J  drsz(r)  is  also  nonvanishing  for  some 
direction  z,  while  it  is  zero  for  any  choice  of  z  in  an  antiferromagnetic  metal,  even 
though  s.(r)  itself  does  not  vanish. 

The  magnetic  structures  observed  in  metals  can  also  be  complex.  Antiferromagnetic 
chromium,  for  example,  has  a  nonvanishing  periodic  spin  density,  whose  period  is 


2  For  example,  a  simple  cubic  lattice  can  be  viewed  as  two  interpenetrating  face-centered  cubic 
lattices;  a  body-centered  cubic  lattice  can  be  viewed  as  two1  interpenetrating  simple  cubic  lattices.  A  face- 
centered  cubic  lattice,  however,  cannot  be  so  represented. 

3  After  the  ferrites.  See  W.  P.  Wolf,  Repts.  Prog.  Phys.  24,  212  (1961)  for  a  review. 
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unrelated  to  the  periodicity  of  the  lattice  under  normal  conditions,  being  determined 
instead  by  the  Fermi  surface  geometry. 

Some  examples  of  magnetically  ordered  solids  are  listed  in  Tables  33.1  to  33.3. 


OBSERVATION  OF  MAGNETIC  STRUCTURES 

The  magnetic  ordering  of  a  solid  with  a  spontaneous  magnetization  is  clearly  revealed 
by  the  resulting  macroscopic  magnetic  field.4  However  the  magnetic  ordering  in 
antiferromagnetic  solids  yields  no  macroscopic  field,  and  must  be  diagnosed  by 
more  subtle  means.  Low-energy  neutrons  are  an  excellent  probe  of  the  local  moments, 
for  the  neutron  has  a  magnetic  moment  that  couples  to  the  electronic  spin  in  the 
solid.  This  leads  to  peaks  in  the  elastic  neutron  scattering  cross  section  in  addition  to 
those  due  to  the  nonmagnetic  Bragg  reflection  of  the  neutrons  by  the  ionic  nuclei  (see 
page  473).  The  magnetic  reflections  can  be  distinguished  from  the  nonmagnetic  ones 
because  they  weaken  and  disappear  as  the  temperature  rises  through  the  critical 


4  A  caveat :  This  is  often  masked  by  a  domain  structure.  See  pages  718-722. 
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Table  33.1 

SELECTED  FERROMAGNETS,  WITH  CRITICAL 
TEMPERATURES  Tc  AND  SATURATION 
MAGNETIZATION  M0 


MATERIAL 

Te(  K) 

M0  (gauss)0 

Fe 

1043 

1752 

Co 

1388 

1446 

Ni 

627 

510 

Gd 

293 

1980 

Dy 

85 

3000 

CrBr3 

37 

270 

Au2MnAl 

200 

323 

Cu2MnAl 

630 

726 

Cu2MnIn 

500 

613 

EuO 

77 

1910 

EuS 

16.5 

1184 

MnAs 

318 

870 

MnBi 

670 

675 

GdCl3 

2.2 

550 

“At  T  =  0(K). 

Source:  F.  Keffer, 

Hcmdbuch  tier  Plivsik.  vol.  18,  pt.  2,  Springer 

New  York,  1966;  P.  Heller,  Rep.  Progr.  Phys ., 

30,  (pt.  II),  731  (1967). 

Table  33.2 


SELECTED  ANTIFERROMAGNE'I'S,  WITH  CRITICAL 
TEMPERATURES  Tc 


MATERIAL 

TC(K) 

MATERIAL 

Tc  (K) 

MnO 

FeO 

CoO 

NiO 

RbMnF3 

KFeF3 

KMnF3 

122 

198 

291 

600 

54.5 

115 

88.3 

KCoF3 

MnF2 

FeF2 

CoF2 

MnCl2 

VS 

Cr 

125 

67.34 

78.4 

37.7 

2 

1040 

311 

Source:  F.  KelTer.  Handbuch  dcr  Physik.  vol. 

18,  pt.  2,  Springer,  New  York,  1966. 

temperature  at  which  the  ordering  vanishes,  and  also  by  the  way  they  vary  with 
applied  magnetic  fields.5  (See  Figure  33.3.) 

Nuclear  magnetic  resonance6  offers  another  way  to  probe  the  microscopic  spin 
structure.  The  ionic  nuclei  feel  the  dipolar  magnetic  fields  of  nearby  electrons.  Nuclear 
magnetic  resonance  in  magnetically  ordered  solids  can  therefore  be  observed  even 


A  comprehensive  survey  of  both  theoretical  and  experimental  aspects  of  neutron  scattering  from 
magnetically  ordered  solids  can  be  found  in  Y.  A.  Izyumov  and  R.  P.  Ozerov,  Magnetic  Neutron  Diffraction 
Plenum  Press,  New  York,  1970. 

6  See  page  665. 


Neutron  intensity 
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Table  33.3 

SELECTED  FERRIMAGNETS,  WITH  CRITICAL 
TEMPERATURES  Tc  AND  SATURATION 
MAGNETIZATION  M0 


MATERIAL 

TA  K) 

M0  (gauss)" 

Fe304  (magnetite) 

858 

510 

CoFe204 

793 

475 

NiFe204 

858 

300 

CuFe,04 

728 

160 

MnFe204 

573 

560 

Y3Fe5Ojj  (YIG) 

560 

195 

°  At  T  =  0(K). 

Source:  F.  Keffer.  Hamlbuch  der  Physik ,  vol. 
1966. 

18,  pt.  2,  Springer,  New  York, 
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Figure  33.3 

(a)  Neutron  Bragg  peaks  in  manganese  vanadite  (MnV204),  an  antiferromagnet  with  Tc  =  56  K. 
The  intensity  of  the  peaks  decreases  as  T  rises  to  Tc.  (b)  Intensity  of  the  (220)  and  (111)  peaks  vs. 
temperature.  Above  Tc  the  temperature  dependence  is  very  slight.  (From  R.  Plumier,  Proceedings 
of  the  International  Conference  on  Magnetism,  Nottingham,  1964.) 
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in  the  absence  of  applied  fields,  the  field  at  the  nucleus  (and  hence  the  resonance 
frequency)  being  entirely  due  to  the  ordered  moments.  Thus  nuclear  magnetic  reso¬ 
nance  can  be  used,  for  example,  to  measure  the  macroscopically  inaccessible  net 
magnetization  of  each  antiferromagnetic  sublattice  (see,  for  example,  Figure  33.4). 


THERMODYNAMIC  PROPERTIES  AT  THE  ONSET  OF 
MAGNETIC  ORDERING 

The  critical  temperature  Tc  above  which  magnetic  ordering  vanishes  is  known  as 
the  Curie  temperature  in  ferromagnets  (or  ferrimagnets)  and  the  Neel  temperature 
(often  written  TN)  in  antiferromagnets.  As  the  critical  temperature  is  approached 
from  below,  the  spontaneous  magnetization  (or,  in  antiferromagnets,  the  sublattice 
magnetization)  drops  continuously  to  zero.  The  observed  magnetization  just  below 
Tc  is  well  described  by  a  power  law. 

M(T)  ~  (Tc  -  Tf,  (33.1) 

where  [’>  is  typically  between  0.33  and  0.37  (see  Figure  33.4). 

The  onset  of  ordering  is  also  signaled  as  the  temperature  drops  to  Tc  from  above, 
most  notably  by  the  zero-field  susceptibility.  In  the  absence  of  magnetic  interactions 
the  susceptibility  varies  inversely  with  T  at  all  temperatures  (Curie’s  law,  page  656). 
In  a  ferromagnet,  however,  the  susceptibility  is  observed  to  diverge  as  T  drops  to 
Tc,  following  the  power  law: 

X(T)  ~  (T—  Tcy\  (53.2) 

where  y  is  typically  between  1.3  and  1.4  (see  Figure  33.5).  In  an  antiferromagnet  the 
susceptibility  rises  to  a  maximum  a  little  above  Tc,  and  then  drops  toward  Tc,  with 
a  large  maximum  in  its  slope  at  the  critical  point  (see  Figure  33.6). 

There  is  also  a  characteristic  singularity  in  the  zero-field  specific  heat  at  a  magnetic 
critical  point: 

c(T)  ~  (T  -  Tcya-  (33.3) 

The  singularity  is  not  nearly  so  strong  as  in  the  susceptibility,  the  exponent  a  being 
of  order  0.1  or  less.7 

The  critical  region  of  temperatures  is  probably  the  most  difficult  to  handle  theoret¬ 
ically.  We  shall  comment  further  on  the  theory  of  the  critical  region  below,  but  shall 
first  turn  our  attention  to  the  regimes  of  low  (T  «  Tc)  and  high  (T  »  Tc)  temperatures, 
which  are  more  amenable  to  analysis. 


7  A  variety  of  other  critical  exponents  can  also  be  defined  and  measured.  Excellent  reviews  of  magnetic 
and  other  critical  points  have  been  given  by  M.  E.  Fisher,  Rep.  Progr.  Phys.  30  (pt.  II),  615  (1967);  P. 
Heller,  Rep.  Progr.  Phys.  30  (pt.  II),  731  (1967);  and  L.  P.  Kadanoff  et  al„  Rev.  Mod.  Phys.  39, 395  (1967). 
A  theory  of  the  critical  point  that  permits  the  numerical  calculation  of  critical  exponents  has  been  devel¬ 
oped  by  K.  G.  Wilson.  It  is  based  on  the  methods  of  the  renormalization  group  and  has  been  given  an 
elementary  review  by  S.  Ma,  Rev.  Mod.  Phys.  45,  589  (1973).  See  also  M.  E.  Fisher,  Rev.  Mod.  Phys.  46, 
597  (1974). 


Figure  33.4 

(a)  The  temperature  dependence  of  the  zero  field  19F  nuclear  magnetic  resonance  frequency  in 
the  antiferromagnet  MnF,.  The  resonance  frequency  vanishes  at  the  antiferromagnetic  critical 
temperature  Tc  =  67.336  K.  (From  P.  Heller  and  G.  B.  Benedek,  Phys.  Rev.  Lett.  8,  428  (1962).) 

(b)  Temperature  dependence  of  the  cube  of  the  zero-field  I9F  nuclear  magnetic  resonance  fre¬ 
quency  in  MnF2  in  the  immediate  vicinity  of  Tc  (note  that  the  temperature  scale  is  much  expanded 
compared  with  that  in  (a)).  If  the  frequency  is  proportional  to  the  sublattice  magnetization,  this 
demonstrates  that  the  magnetization  vanishes  as  (Tc  —  V ) 1/3  to  very  high  accuracy. 
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Figure  33.5 

The  susceptibility  of  iron 
(with  a  small  amount  of  dis¬ 
solved  tungsten)  above  the 
critical  temperature  Tc  = 
1043  K.  A  power  law  is  very 
closely  obeyed  in  this  range, 
and  the  slope  gives  y  ~ 
(T  -  Tcr133.  (From  J.  E. 
Noakes  et  al,  J.  Appl.  Phys. 
37,  1264  (1966).  Note  that 
logic  X  =  0.4343  In  y.) 


ZERO-TEMPERATURE  PROPERTIES:  GROUND  STATE  OF  THE 
HEISENBERG  FERROMAGNET 

We  consider  a  set  of  magnetic  ions  at  Bravais  lattice  sites  R,  whose  low  lying  excita¬ 
tions  can  be  described  by  a  ferromagnetic  Heisenberg  Hamiltonian  (Eq.  (32.20)  ),8 

3C  =  -  l  £  S(R)  ■  S(R')J(R  -  R')  -  glxBH  £  S2(R), 

J(R  -  R  )  =  J(R'  —  R)  35  0.  (33.4) 

We  describe  this  Hamiltonian  as  ferromagnetic  because  a  positive  exchange  inter¬ 
action  J  favors  parallel  spin  alignment.  Effects  due  to  magnetic  dipolar  coupling 
between  the  moments  are  not  included  in  the  interaction  J,  but  can  be  taken  into 
account  by  suitable  definition  of  the  field  H  (whose  direction  we  take  to  define  the 
z-axis)  that  acts  on  the  local  spins.  We  shall  elaborate  on  this  below  (p.  722),  but 


8  It  is  the  widespread  practice  to  refer  to  the  operators  in  the  Heisenberg  Hamiltonian  as  spin 
operators,  even  though  the  spin  operator  for  each  ion  here  represents  its  total  angular  momentum  which, 
in  general,  has  both  a  spin  and  an  orbital  part.  It  is  also  the  usual  practice  to  take  these  fictitious  spins  to 
be  parallel  to  the  magnetic  moment  of  the  ion.  rather  than  to  its  total  angular  momentum:  i.e..  the  term 
in  H  in  (33.4)  is  preceded  by  a  minus  sign  (for  positive  gj.iB),  when  H  is  along  the  positive  z-axis. 


702  Chapter  33  Magnetic  Ordering 


X 


Figure  33.6 

Characteristic  temperature 
dependence  of  the  suscepti¬ 
bility  of  an  antiferromagnet 
near  the  critical  temperature. 
Below  Tc  the  susceptibility 
depends  very  much  on 
whether  the  field  is  applied 
parallel  or  perpendicular  to 
the  direction  of  sublattice 
magnetization.  Note  that  if 
the  antiferromagnet  were 
perfectly  isotropic  this  would 
not  be  the  case:  Whatever 
the  direction  of  the  applied 
field,  the  sublattice  magneti¬ 
zation  would  rotate  into  the 
energetically  most  favorable 
orientation  with  respect  to 
the  field  (presumably  per¬ 
pendicular)  and  there  would 
be  only  one  susceptibility 
(XL).  The  orientation  depen¬ 
dence  below  Tc  is  due  to 
crystalline  anisotropy.  The 
anisotropy  is  also  responsible 
for  the  slight  difference  in  X\\ 
and  Xl  above  Tc,  where 
parallel  and  perpendicular 
now  refer  to  the  axis  along 
which  (because  of  anisotropy) 
the  sublattice  magnetization 
below  Tc  prefers  to  lie.  (See 
M.  E.  Fisher,  Phil.  Mag.  7, 
1731  (1962).) 


merely  note  for  now  that  H  is  the  local  field  (in  the  sense  of  Chapter  27)  acting  on 
each  magnetic  ion,  which  is  not  necessarily  equal  to  the  externally  applied  field. 

If  we  were  to  regard  the  spins  appearing  in  the  Hamiltonian  (33.4)  as  classical 
vectors,  we  should  expect  the  state  of  lowest  energy  to  be  the  one  with  all  spins 
aligned  along  the  z-axis,  parallel  to  the  magnetic  field  and  to  each  other.  This  suggests, 
as  a  candidate  for  the  quantum-mechanical  ground  state  |0>,  one  that  is  an  eigenstate 
of  SZ(R)  for  every  R  with  the  maximum  eigenvalue,  S: 

|0>  =  n  |S>«,  (33.5) 

R 


where 


sz(R)|s>„  =  s\syR 


(33.6) 


) 


Ground  State  of  the  Heisenberg  Ferromagnet  703 


To  verify  that  |0>  is  indeed  an  eigenstate  of  3C,  we  rewrite  the  Hamiltonian  (33.4) 


in  terms  of  the  operators 

S±(R)  =  SX(R)  ±  tSy(R), 

(33.7) 

which  have  the  property9 

S±(R)|S*>r 

=  y/(S  +  SZ)(S  +  1  ±SZ)  |S2  +  1>R. 

(33.8) 

Separating  the  terms  in  S2  from  those  containing  S+  or  S_ ,  we  may  write 


3C  =  -  i  £  J(R  -  R')S2(R)S2(R')  -  ggBH  £  S2(R) 

2  R.R'  R 

-  \  I  -  R')S_(R')S+(R).  (33.9) 

2  R.R' 

Since  S+(R)|S2>r  =  0  when  Sz  =  S,  it  follows  that  when  3€  acts  on  |0>,  only  the 
terms  in  S2  contribute  to  the  result.  But  |0>  is  constructed  to  be  an  eigenstate  of  each 
S2(R)  with  eigenvalue  S,  and  therefore 

3C|0>  =  £o|0>,  (33.10) 

where 

E0  =  -  \  S2  £  J(R  -  R')  -  NggBHS.  (33.11) 

2  R.R' 

Thus  |0>  is  indeed  an  eigenstate  of  3C.  To  prove  that  E0  is  the  energy  of  the  ground 
state,  we  consider  any  other  eigenstate  |0'>  of  3C  with  eigenvalue  Since 

E0'  =  <0'|3C|0'>,  (33.12) 

it  follows  that  when  all  the  J(R  -  R')  are  positive,  E0‘  has  the  lower  bound 

-  T  I  J(R  ~  R')  max  <S(R)  •  S(R')>  -  ggBH  £  max  <S2(R)>  (33.13) 

2  RR'  R 

where  max  <X>  is  the  largest  diagonal  matrix  element  that  the  operator  X  can  assume 
(in  any  state  whatsoever).  In  Problem  I  it  is  shown  that10 

<S(R)  •  S(R')>  <  S2,  R  #  R', 

<S2(R)>  <  S.  (33.14) 

Combining  these  inequalities  with  the  bound  (33.13)  for  E0'  and  comparing  the 
resulting  inequality  with  the  form  (33.11)  of  E0,  we  conclude  that  E0'  cannot  be  less 
than  E0,  and  therefore  E0  must  be  the  energy  of  the  ground  state. 


9  See,  for  example,  A.  Messiah.  Quantum  Mechanics ,  Wiley,  New  York,  p.  512,  1962. 

10  These  results  may  seem  “obvious”  to  one’s  classical  intuition,  but  one  should  reflect  on  the 
fact  that  min  <S(R)  •  S(R')>  is  not  -S2,  but  -5(5  +  1)  (Problem  1).  And,  of  course,  if  R  =  R',  then 
max  <5(R)  ■  S(R')>  is  not  52,  but  5(5  +1). 


704  Chapter  33  Magnetic  Ordering 


ZERO-TEMPERATURE  PROPERTIES: 

GROUND  STATE  OF  THE  HEISENBERG  ANTIFERROMAGNET 

Finding  the  ground  state  of  the  Heisenberg  antiferromagnet  is  an  unsolved  problem 
except  in  the  special  case  of  a  one-dimensional  array  of  spin  1/2  ions  with  coupling 
only  between  nearest  neighbors.11  The  difficulty  is  illustrated  by  the  case  in  which 
the  spins  reside  on  two  sublattices,  each  spin  interacting  only  with  those  on  the 
other  sublattice.  In  the  absence  of  an  applied  field  the  Hamiltonian  is 

3C  =  \  £  |  J(R  -  R')|S(R)  •  S(R').  (33.15) 

2  RJR 

An  obvious  guess  for  the  ground  state  is  to  put  each  sublattice  into  a  ferromagnetic 
ground  state  of  the  form  (33.5),  with  oppositely  directed  sublattice  magnetizations. 
If  the  spins  were  classical  vectors  this  would  take  maximum  advantage  of  the  anti¬ 
ferromagnetic  coupling  between  sublattices,  and  would  yield  a  ground-state  energy 

E0  =  -  \  E  |J(R  -  R')|S2.  (33.16) 

In  contrast  to  the  ferromagnetic  case,  however,  the  terms  S.(R)S+(R')  in  the  Hamilto¬ 
nian  (33.9),  when  acting  on  such  a  state  do  not  always  give  zero,  but  also  yield 
a  state  in  which  a  spin  in  the  “up”  sublattice  has  had  its  z-component  reduced  by 
unity  and  a  spin  in  the  “down”  sublattice  is  correspondingly  raised.  Thus  the  state 
is  not  an  eigenstate. 

All  that  can  easily  be  established  is  that  (33.16)  is  an  upper  bound  to  the  true 
ground-state  energy  (Problem  2).  A  lower  bound  can  also  be  found  (Problem  2) 
leading  to  the  inequality: 

-  l-  S(S  +  1)  E  \m  -  R')|  <  Eo  «  -  X  I E(R  -  R')|.  (33.17) 

2  R.R  Z  R.R 

In  the  limit  of  large  spin  (when  the  spins  become,  in  effect,  classical  vectors)  the 
ratio  of  these  bounds  approaches  unity.  However,  they  are  far  from  restrictive  when 
S  is  small.  In  the  one-dimensional  nearest-neighbor  spin  1/2  chain,  for  example, 
the  bounds  give  —0.25 N J  >  E0  >  -0.75 NJ,  while  Bethe’s  exact  result  is  E0  = 
—  NJ[ln  2  —  (1/4)]  =  —0.443AJ.  Thus  more  elaborate  analysis  is  needed  to  estimate 
accurately  the  antiferromagnetic  ground-state  energy. 


LOW-TEMPERATURE  BEHAVIOR  OF  THE  HEISENBERG 
FERROMAGNET:  SPIN  WAVES 

We  can  not  only  exhibit  the  exact  ground  state  of  the  Heisenberg  ferromagnet,  but 
can  also  find  some  of  its  low-lying  excited  states.  The  knowledge  of  these  states 
underlies  the  theory  of  the  low-temperature  properties  of  the  Heisenberg  ferromagnet. 


11 


H.  A.  Bethe,  Z.  Physik,  71,  205  (1931). 
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At  zero  temperature  the  ferromagnet  is  in  its  ground  state  (33.5),  the  mean  “spin” 
of  each  ion  is  S,  and  the  magnetization  density  (known  as  the  saturation  magnetization ) 
is 

M  =  ggB~S.  (33.18) 

When  T  #  0,  we  must  weight  the  mean  magnetization  of  all  states  with  the  Boltzmann 
factor  e~Elkl>T.  Very  near  T  —  0,  only  low-lying  states  will  have  appreciable  weight. 
To  construct  some  of  these  low-lying  states  we  examine  a  state12  |R>  differing  from  the 
ground  state  |0>  only  in  that  the  spin  at  site  R  has  had  its  z-component  reduced  from 
S  to  S  -  1 : 

|R>  =  ^5  S_(R)|0>.  (33.19) 

The  state  |R>  remains  an  eigenstate  of  the  terms  containing  S2  in  the  Hamiltonian 
(33.9).  Because,  however,  the  spin  at  R  does  not  assume  its  maximum  z-component, 
S+(R)|R>  will  not  vanish,  and  S_(R')S+(R)  simply  shifts  the  site  at  which  the  spin 
is  reduced  from  R  to  R'.  Thus13 

S_(R)S+(R)]R>  =  2S|R'>.  (33.20) 

If  in  addition  we  note  that 

SZ(R')|R>  =  S|R>,  R  /  R, 

=  (S  -  1)|R>,  R'  =  R,  (33.21) 

then  it  follows  that 

3C|R>  =  E0|R>  +  ggBH\R}  +  S  £  J(R  -  R')[|R>  -  |R'>],  (33.22) 

R- 

where  E0  is  the  ground-state  energy  (33.1 1). 

Although  |R>  is  therefore  not  an  eigenstate  of  3C,  3C|R>  is  a  linear  combination 
of  |R>  and  other  states  with  only  a  single  lowered  spin.  Because  J  depends  on  R  and 
R'  only  in  the  translationally  invariant  combination  R  —  R',  it  is  straightforward  to 
find  linear  combinations  of  these  states  that  are  eigenstates.14  Let 

|k>  =  7N?*'k  R|RX 

Equation  (33.22)  implies  that 

3C|k>  =  £k|k>, 

Ek  =  E0  +  +  SX-f(R)d  -  e,k  R) 

R 

12  Exercise:  Verify  that  |R>  is  normalized  to  unity. 

13  Exercise:  Verify  that  the  numerical  factor  2S  is  correct. 

14  The  analysis  that  follows  closely  parallels  the  discussion  in  Chapter  22  of  normal  modes  in  a 
harmonic  crystal.  In  particular,  the  state  |k>  can  be  formed  for  just  ;V  distinct  wave  vectors  lying  in  the 
first  Brillouin  zone,  if  we  invoke  the  Born-von  Karman  boundary  condition.  Since  values  of  k  that  differ 
by  a  reciprocal  lattice  vector  lead  to  identical  states,  it  suffices  to  consider  only  these  iV  values.  The  reader 
should  also  verify,  using  the  appropriate  identities  from  Appendix  F,  that  the  states  |k>  are  orthonormal: 
<k|k->  =  v 


(33.23) 


(33.24) 
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Taking  advantage  of  the  symmetry,  J(  —  R)  =  J(R),  we  can  write  the  excitation  energy 
£(k)  of  the  state  |k>  (i.e.,  the  amount  by  which  its  energy  exceeds  that  of  the  ground 
state)  as 

£(k)  =  Ek  -  E0  =  2S  X  J(R)  sin2  (ik  •  R)  +  gnBH.  (33.25) 

R 

To  give  a  physical  interpretation  of  the  state  |k>  we  note  the  following: 

1.  Since  |k>  is  a  superposition  of  states  in  each  of  which  the  total  spin  is  diminished 
from  its  saturation  value  NS  by  one  unit,  the  total  spin  in  the  state  |k>  itself  has 
the  value  NS  —  1. 

2.  The  probability  of  the  lowered  spin  being  found  at  a  particular  site  R  in  the  state 
|k>  is  |<k|R)|2  =  1  /TV;  i.e.,  the  lowered  spin  is  distributed  with  equal  probability 
among  all  the  magnetic  ions. 

3.  We  define  the  transverse  spin  correlation  function  in  the  state  |k>  to  be  the 
expectation  value  of 

S±(R)  •  SjJR')  =  SiR)S,(R')  +  Sv(R)Sy(R').  (33.26) 

A  straightforward  evaluation  (Problem  4)  gives 

2  s 

<k| SX(R)  •  Sj(R')|k>  =  —  cos  [k  •  (R  -  R')],  R  #  R'.  (33.27) 

Thus  on  the  average  each  spin  has  a  small  transverse  component,  perpendicular  to 
the  direction  of  magnetization,  of  size  (2S/A')1'2;  the  orientations  of  the  transverse 
components  of  two  spins  separated  by  R  —  R'  differ  by  an  angle  k  •  (R  —  R'). 

The  microscopic  magnetization  in  the  state  |k>  suggested  by  these  facts  is  pictured 
in  Figure  33.7.  One  describes  the  state  |k>  as  a  state  containing  a  spin  wave  (or 
“magnon”)  of  wave  vector  k  and  energy  £(k)  (Eq.  (33.25)). 


(b) 


Figure  33.7 

Schematic  representations  of  the  orientations  in  a  row  of  spins  in  (a)  the  ferromagnetic 
ground  state  and  (b)  a  spin  wave  state. 

These  one-spin-wave  states  are  exact  eigenstates  of  the  Heisenberg  Hamiltonian. 
To  calculate  the  low-temperature  properties,  one  often  makes  the  assumption  that 
additional  many-spin-wave  eigenstates  with  excitation  energies  £(kt)  +  £(k2)  +  •••  + 
£(kWo)  can  be  constructed  by  superposing  N0  spin  waves  with  wave  vectors  k1( . . . ,  kNo. 
On  the  basis  of  the  analogy  with  phonons  in  a  harmonic  crystal  (where  many-phonon 
states  are  exact  stationary  states,  just  as  one-phonon  states  are)  this  appears  to  be  a 
reasonable  assumption.  However,  in  the  case  of  spin  waves  it  is  only  an  approximation. 
Spin  waves  do  not  rigorously  obey  the  superposition  principle.  Nevertheless,  it  has 


) 
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been  shown  that  this  approximation  correctly  reproduces  the  dominant  term  in  the 
low-temperature  spontaneous  magnetization.  We  therefore  pursue  the  approximation 
further,  using  it  to  calculate  M(T),  with  the  warning  that  if  one  wishes  to  go  beyond 
the  leading  correction  to  the  T  =  0  result,  one  must  resort  to  a  far  more  sophisticated 
analysis. 

If  the  low-lying  excited  states  of  the  ferromagnet  have  excitation  energies  of  the 
form 

£e(kK,  «k=  0,1,2,...,  (33.28) 

then  the  mean  number  of  spin  waves  with  wave  vector  k  at  temperature  T  will  be 
given  by15 

n(k)  =  <nk>  =  ^f<k)/tBr  _  (33.29) 

Since  the  total  spin  is  reduced  from  its  saturation  value  NS  by  one  unit  per  spin 
wave,  the  magnetization  at  temperature  T  satisfies 


or 


M(T)  =  M(0)[l  -  ^£n(k)], 

[V  C  dk  1 

1 -in  far  (**»*— \ 


(33.30) 

(33.31) 


The  spontaneous  magnetization  is  evaluated  from  (33.31)  using  the  form  (33.25) 
assumed  by  the  £(k),  in  the  limit  of  vanishing  magnetic  field : 


S(k)  =  2 S  £  7(R)  sin2  (ik  •  R).  (33.32) 

R 

At  very  low  temperatures  we  can  evaluate  (33.31)  in  the  same  way  as  we  extracted 
the  low-temperature  lattice  specific  heat  in  Chapter  23.  As  T  -*■  0,  only  spin  waves 
with  vanishingly  small  excitation  energies  will  contribute  appreciably  to  the  integral. 
Since  we  have  taken  all  the  exchange  constants  J(R)  to  be  positive,  the  energy  of  a 
spin  wave  is  vanishingly  small  only  in  the  limit  as  k  -*■  0,  when  it  becomes : 


£(k)  ~  ^  £  J(R)(k  •  R)2.  (33.33) 

^  R 


We  may  insert  this  form  into  (33.31)  for  all  k,  for  when  the  approximation  (33.33) 
ceases  to  be  valid,  both  the  exact  and  approximate  £(k)  will  be  so  large  that  they 
will  give  a  negligibly  small  contribution  to  the  integral  as  T  -»  0.  For  the  same  reason 
we  may  extend  the  integration  over  the  first  Brillouin  zone  to  all  of  k- space  with 
negligible  error  at  low  temperatures.  If,  finally,  we  make  the  change  of  variables 
k  =  (kBT)l/2q,  we  arrive  at  the  result: 


M(T)  =  M(  0) 


1 


—z(kBT)312 

NSy 


’  dq 
(2nf 


jexp  S£J(R) 


(q-R)2 


(33.34) 


15 


See  the  analogous  discussion  for  phonons  on  pages  453-454. 
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This  asserts  that  as  the  temperature  rises  from  T  =  0,  the  spontaneous  magneti¬ 
zation  should  deviate  from  its  saturation  value  by  an  amount  proportional  to  T3/2, 
a  result  known  as  the  Bloch  T3/2  law.  The  T3/2  law  is  well  confirmed  by  experiment16 
(Figure  33.8).  The  result  (33.34)  has  also  been  shown17  to  be  the  exact  leading  term 
in  the  low-temperature  expansion  of  the 
from  saturation. 
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Another  implication  of  (33.34)  has  also  been  rigorously  verified.  In  one  and  two 
spatial  dimensions  the  integral  in  (33.34)  diverges  at  small  q.  One  generally  interprets 
this  to  mean  that  at  any  nonzero  temperature  so  many  spin  waves  are  excited  that 
the  magnetization  is  completely  eliminated.  This  conclusion,  that  there  can  be  no 
spontaneous  magnetization  in  the  one-  and  two-dimensional  isotropic  Heisenberg 
models,  has  been  proved  directly  without  making  the  spin  wave  approximation.18 

Spin  waves  are  not  peculiar  to  the  isotropic  Heisenberg  ferromagnet.  There  is  a 
spin  wave  theory  of  the  low-lying  excitations  of  the  antiferromagnet  which  is  rather 
more  complex,  as  one  might  guess  from  the  fact  that  even  the  antiferromagnetic 
ground  state  is  unknown.  The  theory  predicts  a  spin  wave  excitation  energy  that,  in 
contrast  to  the  ferromagnetic  case,  is  linear  in  k  at  long  wavelengths.19 

Spin  wave  theories  have  also  been  constructed  for  itinerant  models  of  magnetism. 
Quite  generally,  one  expects  spin  waves  to  exist  whenever  there  is  a  direction  asso¬ 
ciated  with  the  local  order  that  can  vary  spatially  in  a  continuous  manner  at  a  cost 
in  energy  that  becomes  small  as  the  wavelength  of  the  variation  becomes  very  long. 

We  noted  above  that  elastic  magnetic  neutron  scattering  can  reveal  magnetic 


deviation  of  the  spontaneous  magnetization 


Figure  33.8 

The  ratio  of  the  spontaneous  magnetization  at 
temperature  T  to  its  saturation  (T  =  0)  value  as  a 
function  of  (T/Tc)312  for  ferromagnetic  gadolinium 
(Tc  =  293  K).  The  linearity  of  the  curve  accords 
with  the  Bloch  T3/2  law.  (After  F.  Holtzberg  et  al, 
J.  Appl.  Phys.  35. 1033  (1964).) 


16  In  isotropic  ferromagnets;  if  there  is  significant  anisotropy  in  the  exchange  coupling,  the  spin 
wave  excitation  energy  will  not  vanish  for  small  k,  and  the  T312  law  fails.  See  Problem  5. 

11  F.  Dyson,  Phys.  Re r.  102, 1230(1956).  Dyson  also  calculated  several  of  the  higher-order  corrections. 
That  his  calculation  was  something  of  a  tour  de  force  is  indicated  by  the  fact  that  .prior  to  his  work  there 
were  almost  as  many  disagreeing  “corrections”  to  the  T312  term  as  there  were  published  papers  on  the 
subject. 

18  Theproof(N.  D.  Mcrmin  and  1 I.  Wagner,  Phys.  Rev.  Lett  17, 1 133  (1966))  is  based  on  an  argument 
of  P.  C.  Hohenberg.  See  N.  D.  Mermin,  J.  Phys.  Soc.  Japan  26  Supplement,  203  (1969)  for  a  review  and 
for  other  applications  of  the  method  in  solids. 

19  An  elementary  phenomenological  analysis  is  given  by  F.  Kefier  et  ah,  Am.  J.  Phys.  21,  250  (1953). 
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structure,  just  as  elastic  nonmagnetic  neutron  scattering  can  reveal  the  spatial  arrange¬ 
ment  of  the  ions.  The  analogy  extends  to  inelastic  scattering:  Inelastic  magnetic 
neutron  scattering  reveals  the  spin  wave  spectrum  in  the  same  way  as  inelastic 
nonmagnetic  neutron  scattering  reveals  the  phonon  spectrum.  Thus  there  are  “one- 
spin-wave”  peaks  in  the  magnetic  part  of  the  inelastic  scattering  cross  section,  in 
which  the  change  in  neutron  energy  and  wave  vector  are  given  by  the  excitation 
energy  and  wave  vector  of  a  spin  wave.  Observation  of  these  peaks  confirms  the  k 2 
dependence  of  the  spin  wave  excitation  energy  in  ferromagnets  (and  also  the  linear 
dependence  on  k  in  antiferromagnets).  (See  Figure  33.9.) 


(a)  (b) 

Figure  33.9 

Characteristic  spin  wave  spectra  as  measured  by  inelastic  neutron  scattering  in  (a)  a 
ferromagnet  and  (b)  an  antiferromagnet.  (a)  Spin  wave  spectrum  for  three  crystallographic 
directions  in  an  alloy  of  cobalt  with  8  percent  iron.  (R.  N.  Sinclair  and  B.  N.  Brock  house, 
Phys.  Re r.  120.  1638  (I960).)  The  curve  is  parabolic,  as  expected  for  a  ferromagnet,  with 
a  gap  at  q  =  0  due  to  anisotropy  (see  Problem  5).  (b)  Spin  wave  spectrum  for  tw'O 
crystallographic  directions  in  MnF2.  (G.  G.  Low  et  ah,  J.  Appl.  Phys.  35,  998  (1964).) 
The  curve  exhibits  the  linear  small-q  behavior  characteristic  of  an  antiferromagnet.  The 
gap  at  q  =  0  is  again  due  to  anisotropy. 


HIGH-TEMPERATURE  SUSCEPTIBILITY 

Except  in  artificially  simplified  models,  no  one  has  succeeded  in  calculating  the  zero- 
field  susceptibility  y(T)  of  the  Heisenberg  model  in  closed  form,  when  magnetic 
interactions  are  present.  It  has,  however,  been  possible  to  compute  many  terms  in 
the  expansion  of  the  susceptibility  in  inverse  powers  of  the  temperature.  The  leading 
term  is  inversely  proportional  to  T,  independent  of  the  exchange  constants,  and  by 
itself  gives  the  Curie’s  law  susceptibility  (page  656)  characteristic  of  noninteracting 
moments.  Subsequent  terms  give  corrections  to  Curie’s  law. 
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; 


The  high-temperature  expansion  starts  from  the  exact  identity20 
X(T)  =  ~-^<YsM>\h=o 

=  ~  r^(^B)2<[I  SZ(R)]2>„=0.  (33.35) 

V  KB  1  R 

Here  the  angular  brackets  denote  an  equilibrium  average  in  the  absence  of  an  applied 
field: 


<*>*=<>  =  - 


Tr  Xe~pxo 
Tr  e~fXo  ’ 


where 


=  -  =  I  J(R  -  R')S(R)  •  S(R'). 


(33.36) 


(33.37) 


It  is  convenient  to  express  the  mean  square  z-component  of  spin  in  the  form: 


<[£SZ(R)]2>  =  [r(R,R],  (33.38) 

R  R  R 

where  T  is  the  spin  correlation  function, 

T(R,  R')  =  <Sz(R)S_(R’)>J/=0.  (33.39) 

The  leading  term  in  the  susceptibility  at  high  temperatures  is  found  by  evaluating 
r  in  the  limit  as  T  -*•  oo  (i.e.,  e~^olkBr  ->  l).  In  the  limit  of  infinite  temperature, 
interactions  are  inconsequential  (formally,  e~J/kBT  ->  1  is  both  the  high-temperature 
and  the  zero-interaction  limit)  and  therefore  spins  at  different  sites  are  completely 
uncorrelated.  Thus21 

<SZ(R)SZ(R')>0  =  <Sz(R)>0<Sz(R'))o  =  0,  R  #  R ,  (33.40) 

but 

<Sr(R)Sz(R)>0  =  K(S(R))2>0  =  ?S(S  +  1).  (33.41) 

Combining  these,  we  have 

<SZ(R)SZ(R')>0  =  \S(S  +  1)  6^  (33.42) 

The  leading  correction  to  the  behavior  of  T  for  T  -»  oo  is  given  by  retaining  the 
first  term  in  the  expansion  of  the  statistical  weight: 

e-"*”  =  1  -  P3C0  +  O(/?3C0)2.  (33.43) 


20  It  the  spins  in  the  Hamiltonian  (33.4)  were  classical  vectors,  this  would  follow  directly  from 
the  definition  (31.6).  The  fact  that  they  are  operators  does  not  invalidate  the  derivation,  provided  that 
the  component  of  the  total  spin  along  the  field  commutes  with  the  Hamiltonian. 

21  We  introduce  the  notation  <X>0  =  Iimr-i,  <A">.  Note  that  (X}0  =  Tr  .Y/Trl. 
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Inserting  this  into  (33.39)  we  find  that 

_  iS(S  +  1)  £r.r- — /KSz(R)Sz(R')3C0>0 

r(R>R)* 


(33.44) 


At  infinite  T  (i.e.,  in  the  absence  of  interactions)  we  have 

<S(R)  •  S(R')>0  =  0,  R  #  R', 

<3C0>o  =  0,  (33.45) 

so  the  denominator  in  (33.44)  remains  unity.  The  correction  to  the  leading  term  in 
the  numerator,  however,  is 


p\  £  J( R,  -  R2)<S2(R)S2(R')  SfRj)  •  S(R2)>0.  (33.46) 

2  r,.r2 

Because  the  spins  at  different  sites  are  independent  in  the  T  ->  co  limit,  (33.46)  fails 
to  vanish  only  when  RL  =  R,  R,  =  R',  or  vice  versa.  It  therefore  reduces  to 


pj( R  -  R')  £  <Si(R)S(1(R)>0<S2(R')S(,(R')>0.  (33.47) 

H=xty,z 

Since  different  components  of  a  given  spin  are  uncorrelated,  this  further  simplifies  to 

pm  -  R')<S22(R)>o<Sz2(R')>o  =  pm  -  R')  •  (33.48) 

Collecting  these  results,  we  find  that  the  high-temperature  expansion  (33.44)  yields 

T(R,  R')  =  |\R.  +  pm  _  R')  +  O(PJ)2  .  (33.49) 

Thus  at  high  temperatures  the  correlation  function  for  two  distinct  spins  is  simply 
proportional  to  the  exchange  interaction  itself.  This  is  reasonable,  since  one  expects 
a  positive  (i.e.,  ferromagnetic)  exchange  coupling  between  two  spins  to  favor  their 
parallel  alignment  (and  hence  lead  to  a  positive  value  of  their  inner  product),  while 
a  negative  (i.e.,  antiferromagnetic)  coupling  should  favor  antiparallel  alignment.  The 
result  neglects,  however,  the  possibility  that  two  distinct  spins  may  be  more  strongly 
correlated  by  their  common  coupling  to  other  spins  than  by  their  direct  coupling. 
Terms  that  can  be  so  interpreted  do  appear  when  one  carries  the  high-temperature 
expansion  to  still  higher  orders  in  J/kBT. 

Inserting  the  correlation  function  (33.49)  into  the  susceptibility  (33.35),  using  Eq. 
(33.38),  we  find  the  high-temperature  susceptibility: 


_  N  (gA9s)2 
^  *  V  3kBT 


S(S  +  1)  1  +j  +  0  ^  ,  (33.50) 


J0  =  XJ(R). 
3  kB  r 


(33.51) 


The  susceptibility  (33.50)  has  the  form  of  Curie’s  law  (Eq.  (31.47))  multiplied  by  the 
correction  factor  (1  +  0/T),  which  is  greater  than  or  less  than  unity,  according  to 
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whether  the  coupling  is  predominantly  ferromagnetic  or  antiferromagnetic.22  Thus 
even  well  above  the  critical  temperature  one  can  obtain  a  clue  as  to  the  nature  of  the 
ordering  that  sets  in  below  Tc  from  the  temperature  dependence  of  the  susceptibility.23 


ANALYSIS  OF  THE  CRITICAL  POINT 

Quantitative  theories  of  magnetic  ordering  have  proved  most  difficult  to  construct 
near  the  critical  temperature  Tc  at  which  the  ordering  disappears.  The  difficulty  is  not 
peculiar  to  the  problem  of  magnetism.  The  critical  points  of  liquid-vapor  transitions, 
superconducting  transitions  (Chapter  34),  the  superfluid  transition  in  liquid  He4,  and 
order-disorder  transitions  in  alloys,  to  name  just  a  few,  present  quite  strong  analogies 
and  give  rise  to  quite  similar  theoretical  difficulties. 

One  computational  approach24  has  been  to  calculate  as  many  terms  as  possible 
in  the  high-temperature  expansion  of  (for  example)  the  susceptibility,  and  extrapolate 
down  in  T  to  the  singularity,  thereby  obtaining  the  critical  temperature  as  well  as 
the  exponent  y  (Eq.  (33.2)).  Highly  sophisticated  extrapolation  techniques  have  been 
developed2  5  and  the  value  of  y  thereby  obtained  is  quite  compatible  with  the  observed 
divergence.  Unfortunately,  a  similar  approach  cannot  easily  be  used  for  the  sponta¬ 
neous  magnetization  of  the  Heisenberg  model.  If  the  series  expansion  for  M(T)  were 
known  about  T  =  0,  one  could  extrapolate  upward  to  the  singularity,  obtaining 
both  a  check  on  the  Tc  evaluated  by  extrapolating  the  susceptibility  downward,  as 
well  as  the  critical  exponent  /?  (Eq.  (33.1)).  Unfortunately,  however,  low-temperature 
expansions  of  M(T)  require  calculating  corrections  to  the  spin  wave  approximation. 
Although  this  is  possible  to  a  limited  extent,  it  cannot  be  carried  to  anywhere  near 
the  level  of  the  systematic  procedure  available  for  calculating  high-temperature  ex¬ 
pansions. 

Another  approach  is  to  simplify  the  Hamiltonian  still  further.  The  price  that  one 
pays  is  a  model  that  bears  only  a  generic  resemblance  to  the  original  physical  problem, 
except  occasionally  in  peculiar  cases  that  happen  (generally  ex  post  facto)  to  resemble 
the  new  models.  What  one  gains  is  a  model  that  is  considerably  more  tractable 
analytically.  The  detailed  theoretical  analyses  of  such  models  are  of  value  both  for 
what  they  suggest  about  the  more  realistic  Heisenberg  model,  and  also  as  a  prelimi¬ 
nary  testing  ground  for  various  approximation  techniques. 

By  far  the  most  important  simplification  of  the  Heisenberg  model  is  the  Ising 


22  When  this  analysis  is  generalized  to  more  complex  crystal  structures  (which  is  straightforward), 
the  result  (33.51)  provides  a  way  to  distinguish  simple  ferromagnetic  solids  from  simple  ferrimagnetic 
ones;  if  a  spontaneous  magnetization  (below  Tc )  is  due  to  a  positive  exchange  interaction  (ferromagnetism), 
then  the  term  in  1  jT1  in  the  high-temperature  susceptibility  should  be  positive;  if  it  is  due  to  a  negative 
(antiferromagnetic)  coupling  between  unlike  spins,  then  the  term  in  1/T2  in  the  high -temperature  suscep¬ 
tibility  should  be  negative. 

23  This  is  the  most  important  content  of  a  phenomenological  modification  of  Curie’s  law,  known 
as  the  Curie-Weiss  law.  See  the  discussion  of  mean  field  theory  below. 

24  This  is  reviewed  by  M.  E.  Fisher,  Rep.  Progr.  Phys.  30  (pt.  II),  615  (1967). 

25  The  most  important  is  the  method  of  Pade  approximants.  This  is  reviewed  by  G.  A.  Baker,  Advances 
in  Theoretical  Physics  /,  K.  A.  Brueckner,  ed..  Academic  Press,  New  York,  1965. 
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model,  in  which  the  terms  in  S+  and  S_  are  simply  dropped  from  the  Heisenberg 
Hamiltonian  (33.9),  leaving 

3CIsra*  R')S2(R)S.(R')  -  gtiBH  X  S.(R).  (33.52) 

Since  all  the  SZ(R)  commute,  3Cls,ng  is  explicitly  diagonal  in  the  representation  in  which 
each  individual  SZ(R)  is  diagonal;  i.e.,  all  the  eigenfunctions  and  eigenvalues  of  the 
Hamiltonian  are  known.  In  spite  of  this,  calculating  the  partition  function  is  still  a 
task  of  formidable  difficulty.  However,  the  high-temperature  expansion  is  more  easily 
evaluated  and  can  be  carded  out  to  more  terms  than  in  the  Heisenberg  model,  and 
the  profound  difficulties  in  the  low-temperature  expansion  disappear  (together,  un¬ 
fortunately,  with  the  Bloch  T 3,2  law). 

Near  the  critical  point,  however,  one  can  still  do  little  more  than  extrapolate  the 
high-temperature  (and  low-temperature)  expansions,  except  in  the  case  of  the  two- 
dimensional  Ising  model  with  only  nearest-neighbor  interactions.26  In  that  one  case, 
for  several  simple  lattices  (e.g.,  square,  triangular,  honeycomb)  the  exact  free  energy 
is  known27  in  zero  magnetic  field,  together  with  the  spontaneous  magnetization.  It  is 
sobering  to  note  that  the  calculation  of  these  results  is  among  the  most  impressive 
tours  deforce  achieved  by  theoretical  physicists,  in  spite  of  the  extensive  oversimpli¬ 
fications  that  have  had  to  be  made  to  construct  a  model  even  this  tractable. 

According  to  the  exact  Onsager  solution,  the  specific  heat  of  the  two-dimensional 
Ising  model  has  a  logarithmic  singularity  in  zero  magnetic  field,  as  the  critical  tem¬ 
perature  Tc  is  approached  from  either  above  or  below.  The  spontaneous  magnetiza¬ 
tion  vanishes  as  (Tc  -  T)1'8,  and  the  susceptibility  diverges  as  (T  -  Tr)~llA.  Note 
that  these  exponents  are  quite  different  from  the  observed  values  described  on  page 
699,  except,  perhaps,  for  the  specific  heat  singularity  (a  very  small  power  law  diver¬ 
gence  being  hard  to  distinguish  from  a  logarithmic  singularity).  This  is  a  consequence 
of  the  two-dimensional  structure  of  the  model.  Series  expansions  in  three  dimensions 
indicate  singularities  much  closer  to  those  observed. 

Finally,  we  note  another  approach  to  the  critical  region  arising  from  the  hypothe¬ 
sis28  that  in  the  neighborhood  of  T  —  Tc  and  H  =  0  the  magnetic  equation  of  state 
should  have  the  form 

|7 ;  -  T\p+y  ~  ^±(|  Tc  -  7f)’  T  ^  Tc’  (33.53) 

which  is  known  as  a  scaling  equation  of  state.  Given  such  a  form,  one  can  deduce 
certain  relations  between  the  exponents  describing  the  critical  point  singularities — 
for  example  (see  Eqs.  (33.1),  (33.2),  and  (33.3)),  a  +  2/J  +  y  =  2 — that  can  only  be 


26  See,  however,  the  remark  on  renormalization  group  methods  in  footnote  7.  The  model  can  also 
be  analyzed  completely  in  one  dimension,  but  for  any  finite  range  of  interaction  there  is  no  magnetic 
ordering  at  any  temperature. 

27  The  solution  was  found  by  L.  Onsager,  Phys.  Rev.  65.  117  (1944).  The  first  published  calculation 
of  the  spontaneous  magnetization  (Onsager  reported  the  result  but  never  published  his  calculation)  is  by 
C.  N.  Yang,  Phys.  Rev.  85,  808  (1952).  A  relatively  accessible  version  of  the  Onsager  calculation  of  the  free 
energy  has  been  given  by  T.  Schultz  et  al..  Rev.  Mod.  Phrs.  36,  856  (1964). 

28  B.  Widom,  J.  Chem.  Phys.  43,  3898  (1965);  L.  P.  Kadanoff,  Physics  2,  263  (1966). 
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proved29  as  inequalities,  but  appear  to  be  satisfied  in  real  systems  as  strict  equalities. 
The  scaling  concept  has  been  applied  to  the  static  correlation  function30  and  even 
to  the  time-dependent  correlation  function.31  It  has  set  the  direction  to  many  ex¬ 
periments  on  the  critical  point,  which  have,  in  turn,  confirmed  the  original  conjec¬ 
ture  (see,  for  example,  Figure  33.10).  It  is  only  with  the  recent  theoretical  work  of 
K.  G.  Wilson,  however,  that  a  firm  basis  has  been  given  to  the  scaling  hypothesis.32 


H/\l-T/Tc\p*y 
M/ It  -  TlrJ 


Figure  33. 10 

The  magnetic  equation  of  slate 
of  nickel  near  Tc  =  627.4  K. 
If  the  scaling  hypothesis  is 
correct,  there  should  be  two 
temperature-independent  ex¬ 
ponents  [>  and  y  such  that 
H/\  T  —  depends  on  M 
and  T  only  in  the  combination 
M/\T  —  Tcf.  (The  functional 
relations  will  not  be  the  same, 
however,  above  and  below  Tc.) 
By  plotting  [M/|l  - 
vs.  [«/|l  -  (T/Tjr>]/[M/ 
|l  —  (T/Tc)|'i],  one  can  demon¬ 
strate  the  extent  to  which  the 
hypothesis  is  satisfied.  For  five 
different  temperatures  above 
Tc  the  points  so  plotted  all 
lie  on  one  universal  curve; 
the  same  behavior  is  found 
for  five  different  temperatures 
below  Tc.  The  exponents  used 
are  P  —  0.378  and  y  =  1.34. 
(The  scales  are  based  on  H 
in  gauss  and  M  in  emu/gm.) 
(From  J.  S.  Kouvel  and  J.  B. 
Comly,  77ns.  Rev.  Lett.  20, 
1237  (1968).’) 


29  R.  B.  Griffiths  ( J .  Chem.  Phys. 43,  1958  (1965))  gives  a  large  number  of  thermodynamic  inequalities 
that  can  be  proved  about  singular  quantities  near  the  critical  point. 

30  In  its  simplest  form,  scaling  asserts  (M.  E.  Fisher,  J.  Math.  Phys.  5,944  (1964))  that  the  correlation 
function  has  tile  form 


where  f(7"),  known  as  the  correlation  length,  diverges  at  the  critical  temperature.  It  is  clear,  from  the  fact 
that  the  susceptibility  diverges  at  the  critical  point,  that  the  correlation  function  should  acquire  a  very 
long  spatial  range  at  Tc  (see  Eqs.  (33.35)  and  (33.38)).  The  scaling  hypothesis  makes  the  additional  assump¬ 
tions  that  the  correlation  function  falls  off  as  a  simple  power  of  R  at  Tc ,  and  that  it  depends  on  temperature 
only  through  the  variable  R/((T). 

31  B.  I.  Halperin  and  P.  C.  Holienberg,  Phys.  Rev.  Lett.  19, 700  (1967). 

32  See  footnote  7,  and  also  F.  J.  Wegner,  Phys.  Rev.  B5, 4529  (1972). 
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MEAN  FIELD  THEORY 

The  earliest  attempt  at  a  quantitative  analysis  of  the  ferromagnetic  transition  was  put 
forward  by  P.  Weiss  and  is  known  as  mean  (or  molecular)  field  theory.33  Mean  field 
theory  gives  a  grossly  inadequate  picture  of  the  critical  region,  fails  to  predict  spin 
waves  at  low  temperatures,  and  even  at  high  temperatures  reproduces  without  error 
only  the  leading  correction  to  Curie’s  law.  We  nevertheless  mention  it  here  because 
(a)  the  theory  has  been  so  widely  used  and  quoted  that  one  must  learn  to  recognize 
it  and  be  aware  of  its  inadequacies ;  (b)  when  one  is  confronted  with  a  new  situation 
(e.g.,  a  particular  complicated  arrangement  of  spins  on  a  crystal  structure  with  several 
types  of  coupling),  mean  field  theory  probably  offers  the  simplest  rough  way  of  sorting 
out  the  types  of  structures  one  might  expect  to  arise;  and  (c)  mean  field  theory  is 
sometimes  taken  as  a  starting  point  for  more  sophisticated  calculations. 

Suppose  that  in  the  Heisenberg  Hamiltonian  (33.4)  we  focus  our  attention  on  a 
particular  site  R  and  isolate  from  3C  those  terms  containing  S(R): 

A3C  =  —  S(R)  •  f  X  J(R  “  R')S(R')  +  W'/iHj.  (33.54) 

\RiiR  / 

This  has  the  form  of  the  energy  of  a  spin  in  an  effective  external  field : 


HcfI  =  H  +  —  X  J(R  -  R')S(R'),  (33.55) 

QVb  r 

but  the  “field”  HeB  is  an  operator,  depending  in  a  complicated  way  on  the  detailed 
configuration  of  all  the  other  spins  at  sites  different  from  R.  The  mean  field  approxi¬ 
mation  evades  this  complexity  by  replacing  Hefr  with  its  thermal  equilibrium  mean 
value.  In  the  case  of  a  ferromagnet34  every  spin  has  the  same  mean  value,  which  can 
be  represented  in  terms  of  the  total  magnetization  density  as 


<S(R)>  = 


V  M 

N  gnB 


(33.56) 


If  we  replace  each  spin  in  Eq.  (33.55)  by  its  mean  value  (33.56)  we  arrive  at  the  effective 
field, 


He(r  =  H  +  /.M, 


(33.57) 


where 


X  = 


V  Jo 

N  (ggB)2’ 


j0  =  im- 

R 


(33.58) 


The  mean  field  theory  of  a  ferromagnet  assumes  that  the  only  effect  of  interactions 
is  to  replace  the  field  each  spin  feels  by  Heff.  This  is  rarely  justified  in  cases  of  practical 


33  The  theory  is  readily  generalized  to  describe  all  varieties  of  magnetic  ordering,  is  closely  analogous 
to  the  van  der  Waals  theory  of  the  liquid-vapor  transition,  and  is  a  particular  example  of  a  very  general 
theory  of  phase  transitions  due  to  Landau. 

34  For  other  cases  see  Problem  7.  In  general,  one  makes  an  initial  ansatz  for  the  equilibrium  average 
of  each  S(R),  uses  that  to  construct  the  average  mean  held,  and  then  requires  (self-consistency)  that  the 
equilibrium  average  of  each  spin  S(R),  computed  as  if  it  were  a  free  spin  in  the  average  mean  field,  agree 
with  the  initial  ansatz. 
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interest,  since  it  requires  either  that  individual  spin  directions  do  not  deviate  dras¬ 
tically  from  their  average  values,  or  that  the  exchange  interaction  is  of  such  long  range 
that  many  spins  contribute  to  (33.55),  with  individual  spin  fluctuations  about  the 
average  canceling  among  themselves. 

If  we  nevertheless  make  the  mean  field  approximation,  then  the  magnetization 
density  is  given  by  the  solution  to 

M  =  Mo(^y\  (33.59) 

where  M0  is  the  magnetization  density  in  the  field  H  at  temperature  T,  calculated  in 
the  absence  of  magnetic  interactions.  We  computed  M0  in  Chapter  31,  where  we 
found  (Eq.  31.44)  that  it  depended  on  H  and  T  only  through  their  ratio,  as  Eq.  (33.59) 
takes  explicitly  into  account.  If  there  is  a  spontaneous  magnetization  M(T)  at  a 
temperature  T,  then  it  will  be  given  by  a  nonzero  solution  to  (33.59)  when  the  applied 
field  vanishes.  Since  HcR  =  xM  when  H  —  0,  we  must  have 

M(T)  =  M0(y).  (33.60) 

The  possibility  of  solutions  to  Eq.  (33.60)  is  most  easily  investigated  graphically. 
If  we  write  it  as  the  pair  of  equations 

M(T)  =  M0(x), 

M(T)  =  yx,  (33.61) 

then  solutions  will  occur  whenever  the  graph  of  M0(x)  intersects  the  straight  line 
{T/?.)x  (see  Figure  33.1 1).  This  will  happen  at  a  nonzero  value  of  x  if  and  only  if  the 

Figure  33.1 1 

Graphical  solution  to  the 
mean  field  equations  (33.61). 
When  T  has  a  value  exceed¬ 
ing  Tc  (for  example,  T  =  T0) 
there  is  no  solution  except 
M  =  0.  When  T  is  less  than 
Tc  (for  example,  T  =  7  , , . . . 
7„)  there  are  solutions  with 
nonzero  M.  The  critical  value 
of  T,  Tc,  is  determined  by  the 
geometrical  condition  that 
the  slope  of  M0(.\)  at  the 
origin  be  TJX.. 

slope  of  the  straight  line.  T/2,  is  less  than  the  slope  of  M0(x)  at  the  origin,  M o'(0).  The 
latter  slope,  however,  can  be  expressed  in  terms  of  the  zero-field  susceptibility  Xo 
calculated  in  the  absence  of  interactions,  for 


T 


(33.62) 
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Comparing  this  with  the  explicit  form  (31.47)  of  Curie’s  law,  we  can  read  off  the  value 
of  M0'( 0),  and  conclude  that  the  critical  temperature  Tc  below  which  a  nonzero 
spontaneous  magnetization  can  be  found  is  given  by 


Tr  = 


N  (gn, 

V  3k 


Bl  S(S  +  1)2  =  |S^_.+  —  Jn- 


3k„ 


(33.63) 


In  Table  33.4  this  prediction  is  compared  with  the  exact  critical  temperatures  for 
several  two-  and  three-dimensional  Ising  models.35  The  actual  critical  temperatures 
are  lower  than  the  mean  field  prediction  by  as  much  as  a  factor  of  2.  However,  agree¬ 
ment  does  improve  with  increasing  lattice  dimensionality  and  coordination  number, 
as  one  might  expect. 

Table  33.4 

RATIO  OF  THE  EXACT  CRITICAL  TEMPERATURES  TO  THOSE  PREDICTED 
BY  MEAN  FIELD  THEORY  (MFT)  FOR  SEVERAL  NEAREST  NEIGHBOR  ISING 
MODELS" 


LATTICE 

DIMENSIONALITY 

COORDINATION  NUMBER 

TJT™1' 

Honeycomb 

2 

3 

0.5062173 

Square 

2 

4 

0.5672963 

Triangular 

2 

6 

0.6068256 

Diamond 

3 

4 

0.67601 

Simple  cubic 

3 

6 

0.75172 

Body-centered  cubic 

3 

8 

0.79385 

Face-centered  cubic 

3 

12 

0.8162 

°  The  two-dimensional  values  of  Tc  are  known  in  closed  form ;  the  values  in  three  dimen¬ 
sions  have  been  computed  by  extrapolation  techniques  to  the  accuracy  quoted. 

Source:  M.  E.  Fisher,  Repts.  Prog.  Phys.  30  (pt.  II),  615  (1967). 


Just  below  Tc,  Eq.  (33.60)  gives  a  spontaneous  magnetization  that  varies  as 
(Tc  —  T)1'2,  regardless  of  the  dimensionality  of  the  lattice  (see  Problem  6).  This  is  in 
striking  contrast  to  the  known  results  M  ~  (Tc  —  Tf,  with  [S  =  g  for  the  two- 
dimensional  Ising  model,  and  /?  =;  {  for  most  three-dimensional  physical  and  model 
systems.  Note,  though,  that  the  agreement  with  mean  field  theory  again  improves  as 
the  dimensionality  increases.36 

Near  zero  temperature  mean  field  theory  predicts  that  the  spontaneous  magne¬ 
tization  deviates  from  its  saturation  value  by  a  term  of  order  e~JoSlkBT  (Problem  9). 
This  is  in  striking  contrast  to  the  T3/2  dependence  that  a  more  accurate  analysis  of 
the  isotropic37  Heisenberg  model  predicts  and  experiments  have  confirmed. 


35  When  applying  (33.63)  to  the  Ising  model.  JS(S  +  1)  must  be  replaced  by  the  term  from  which 
it  arose:  the  mean  value  of  S22  for  a  randomly  oriented  spin. 

36  It  is  believed  that  in  more  than  four  dimensions  the  mean  field  critical  indices  are  correct. 

37  The  spontaneous  magnetization  of  the  anisotropic  Heisenberg  model  does  deviate  only  expo¬ 
nentially  from  saturation.  However,  J0fkBT  is  replaced  by  AJ/kBT,  where  A J  is  a  measure  of  the  anisotropy 
in  the  exchange  coupling,  which  is  very  much  smaller  than  J0  when  the  anisotropy  is  weak.  See  Problem  5. 
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The  susceptibility  in  the  mean  field  approximation  is  given  by  differentiating 
(33.59): 


Hence 


8M 

Jh 


8M0  8HeR 
8HcS  8H 


=  Xo(l  +  h)- 


X  = 


Xo 

1-ho’ 


(33.64) 

(33.65) 


where  Xo  is  evaluated  in  the  field  HcS.  Above  Tc  in  the  limit  of  zero  applied  field,  Ha 
vanishes  and  the  susceptibility  Xo  assumes  the  Curie’s  law  form  (31.47).  Equation 
(33.65)  then  gives  a  zero-field  susceptibility 


X  = 


Xo 

i  -  (tjtY 


(33.66) 


This  result  is  identical  in  form  to  Curie’s  law  for  an  ideal  paramagnet  (Eq.  (31.47)), 
except  that  the  T  in  the  denominator  has  been  replaced  by  T  —  Tc,  a  modification 
known  as  the  Curie-Weiss  law.  The  term  “law”  is  an  unfortunate  one,  since  near 
Tc  the  measured  and  calculated  susceptibilities  of  three-dimensional  ferromagnets 
diverge  as  an  inverse  power  of  T  —  Tc  somewhere  between  f  and  f ,  rather  than  as 
the  simple  pole  that  (33.66)  predicts.38  However,  the  dominant  (order  1  /T2)  correc¬ 
tion  to  the  high-temperature  Curie’s  law  susceptibility  given  by  (33.66)  does  agree 
with  the  exact  result  (33.50),  and  this  is  the  only  real  content  of  the  Curie-Weiss  law: 
the  high-temperature  correction  to  the  susceptibility  of  a  ferromagnet  makes  it 
larger  than  the  value  predicted  by  Curie’s  law.39  Corrections  beyond  the  dominant 
one  at  high  temperatures  disagree  with  the  prediction  of  (33.66),  and  therefore  once 
one  leaves  the  high-temperature  regime  the  Curie-Weiss  law  is  little  more  than  a 
particularly  simple  and  not  very  reliable  way  of  extrapolating  the  high-temperature 
susceptibility  series  down  to  lower  T. 


CONSEQUENCES  OF  DIPOLAR  INTERACTIONS  IN 
FERROMAGNETS:  DOMAINS 

Although  the  critical  temperature  of  iron  is  over  1000  K,  a  piece  of  iron  picked  from 
the  shelf  normally  appears  to  be  “unmagnetized.”  The  same  piece  of  iron,  however, 
is  attracted  by  magnetic  fields  far  more  strongly  than  a  paramagnetic  substance,  and 
can  be  “magnetized”  by  stroking  it  with  a  “permanent  magnet.” 

To  explain  these  phenomena  it  is  necessary  to  consider  the  hitherto  neglected 
magnetic  dipolar  interactions  between  the  spins.  We  stressed  in  Chapter  32  that  this 
interaction  is  very  weak,  the  dipolar  coupling  between  nearest  neighbors  being  typi¬ 
cally  a  thousand  times  smaller  than  the  exchange  coupling.  However,  the  exchange 


38  In  the  two-dimensional  Ising  model  the  susceptibility  diverges  as  (T  —  rc)-7/ 4,  an  even  wider 
departure  from  the  Curie-Weiss  prediction.  Note  once  again,  however,  that  the  mean  field  prediction 
improves  with  increasing  dimensionality. 

39  In  anti  ferromagnets  mean  field  theory  leads  above  Tc  to  a  susceptibility  of  the  form  (33.66),  but 
with  a  pole  at  negative  T  (see  Problem  7).  Here,  again,  this  result  is  not  reliable  except  for  its  prediction 
about  the  sign  of  the  high-temperature  correction  to  Curie’s  law. 
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interaction  is  quite  short-ranged  (falling  off  exponentially  with  spin  separation  in  a 
ferromagnetic  insulator)  whereas  the  dipolar  interaction  is  not  (falling  off  only  as  the 
inverse  cube  of  the  separation).  As  a  result,  the  magnetic  configuration  of  a  macro¬ 
scopic  sample  can  be  quite  complex,  for  the  dipolar  energies  become  significant  when 
enormous  numbers  of  spins  are  involved,  and  can  then  considerably  alter  the  spin 
configuration  favored  by  the  short-range  exchange  interactions. 

In  particular,  a  uniformly  magnetized  configuration  such  as  we  have  used  to 
characterize  the  ferromagnetic  state  is  exceedingly  uneconomical  in  dipolar  energy. 
The  dipolar  energy  can  be  substantially  reduced  (Figure  33.12)  by  dividing  the  speci¬ 
men  into  uniformly  magnetized  domains  of  macroscopic  size,  whose  magnetization 
vectors  point  in  widely  different  directions.  Such  a  subdivision  is  paid  for  in  exchange 
energy,  for  the  spins  near  the  boundary  of  a  domain  will  experience  unfavorable 
exchange  interactions  with  the  nearby  spins  in  the  neighboring  misaligned  domain. 
Because,  however,  the  exchange  interaction  is  short-ranged,  it  is  only  the  spins  near 
the  domain  boundaries  that  will  have  their  exchange  energies  raised.  In  contrast,  the 
gain  in  magnetic  dipolar  energy  is  a  bulk  effect:  because  of  the  long  range  of  the 
interaction,  the  dipolar  energy  of  every  spin  drops  when  domains  are  formed.  There¬ 
fore,  provided  that  the  domains  are  not  too  small,  domain  formation  will  be  favored 
in  spite  of  the  vastly  greater  strength  of  the  exchange  interaction.  Every  spin  can 
lower  its  (small)  dipolar  energy,  but  only  a  few  (those  near  domain  boundaries)  have 
their  (large)  exchange  energy  raised. 
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Figure  33.12 

A  ferromagnetically  ordered  solid  can  reduce  its  magnetic  dipolar 
energy  by  breaking  up  into  a  complex  structure  of  domains.  Thus 
the  single-domain  structure  (a)  has  a  much  higher  dipolar  energy 
than  the  structure  (b)  consisting  of  two  domains.  (To  see  this 
think  of  the  two  halves  of  (b)  as  being  two  bar  magnets.  To  form 
the  single  domain  (a),  one  of  the  magnets  in  (b)  must  be  reversed, 
thereby  changing  a  configuration  in  which  opposite  poles  are 
near  one  another  to  one  in  which  like  poles  are  near  one  another.) 

The  two-domain  structure  (b)  can  lower  its  dipolar  energy  still 
further  by  producing  the  additional  domains  shown  in  (c). 

The  ease  with  which  a  ferromagnet  below  Tc  retains  or  loses  (by  breaking  up  into 
domains)  its  spontaneous  magnetization,  as  well  as  the  process  by  which  application 
of  a  field  forces  the  spontaneous  magnetization  to  reappear,  is  intimately  tied  to  the 
physics  of  how  domains  alter  their  size  and  orientation.  The  structure  of  the  boundary 
between  two  domains  (known  as  the  domain  wall,  or  Bloch  wall)  plays  an  important 
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role  in  these  processes.  An  abrupt  boundary  (Figure  33.13a)  between  two  domains  is 
unnecessarily  costly  in  exchange  energy.  One  can  lower  the  surface  energy  of  a  domain 
wall  by  spreading  out  the  reversal  of  spin  direction  over  many  spins.40  For  if  the  spin 
reversal  is  spread  over  n  spins,  then  as  one  passes  through  the  wall  each  spin  will  be 
seen  to  differ  in  orientation  from  its  neighbor  by  an  angle  7t/n  (Figure  33.13b).  In  a 
crude  classical  picture  the  exchange  energy  of  successive  pairs  will  therefore  be  not 
the  minimum  value  -  JS2,  but  rather  -JS2  cos  (n/n)  w  —  JS2[1  -  j(7r/n)2]  Since 
it  takes  n  steps  to  reverse  the  spin,  the  cost  of  achieving  a  180°  spin  reversal  down  a 
line  of  n  spins  will  be 

-JS2  cos  ^  -  (-JS2)j  =  ^  JS2,  (33.67) 

which  is  lower  than  the  cost  of  an  abrupt  (one-step)  reversal  by  the  factor  7t2/2n. 
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Figure  33.13 

Detailed  view  of  a  portion  of  domain  wall  showing  (a)  an  abrupt 
boundary  and  (b)  a  gradual  boundary.  The  latter  type  is  less  costly 
in  exchange  energy. 

If  this  were  the  only  consideration,  the  domain  wall  would  broaden  out  to  a 
thickness  limited  only  by  dipolar  interactions.  However,  the  above  analysis  assumed 
that  the  exchange  coupling  between  neighboring  spins  was  perfectly  isotropic,  de¬ 
pending  only  on  the  angle  between  them.  Although  interactions  in  the  Heisenberg 
Hamiltonian  (33.4)  have  this  isotropy,  this  is  only  because  spin-orbit  coupling  was 
neglected  in  its  derivation.  In  a  real  solid  the  spins  will  be  coupled  to  the  electronic 
charge  density  via  spin-orbit  coupling,  and  their  energy  will  therefore  depend  to  some 
extent  on  their  absolute  orientation  with  respect  to  the  crystal  axes,  as  well  as  on  their 
relative  orientation  with  respect  to  one  another.  Although  this  dependence  of  the  spin 
energy  on  absolute  orientation  (known  as  anisotropy  energy )  may  be  quite  weak,  it 
will,  on  the  average,  contribute  a  fixed  energy  per  spin  to  the  energy  of  a  line  of 
deviant  spins,  and  will  therefore  eventually  outweigh  the  increasingly  small  reduc¬ 
tions  in  exchange  energy  made  possible  by  successive  extensions  of  the  domain  wall 


40  We  consider  the  case  where  the  wall  is  not  so  very  thick  that  the  dipolar  energy  of  the  boundary 
itself  is  consequential 
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thickness.  Thus  in  practice  the  thickness  of  a  domain  wall  is  determined  by  a  balance 
between  exchange  and  anisotropy  energies.41 

The  “magnetization”  of  a  piece  of  “unmagnetized”  iron  by  application  of  a  field 
(well  below  Tc )  is  a  process  in  which  domains  are  rearranged  and  reoriented.  When  a 
weak  field  is  applied,  domains  oriented  along  the  field  can  grow  at  the  expense  of 
adversely  oriented  domains  by  smooth  motion  of  the  domain  walls  (Figure  33.14).42 
The  magnetization  process  in  weak  fields  is  reversible:  As  the  aligning  field  returns 
to  zero,  the  domains  revert  to  their  original  forms  (with  zero  bulk  magnetization  for 
the  entire  specimen).  If,  however,  the  aligning  field  is  not  weak,  favorably  aligned 
domains  may  also  extend  themselves  by  irreversible  processes.  For  example,  the 


Figure  33.14 

The  magnetization  process, 
(a)  An  unmagnetized  speci¬ 
men.  (b)  The  specimen  in  a 
weak  field  that  favors  spin  up. 
The  domain  of  up  spins  has 
grown  at  the  expense  of  the 
domain  of  down  spins  by 
motion  of  the  domain  wall  to 
the  right.  In  (c)  the  applied 
field  is  stronger,  and  domain 
rotation  is  starting  to  take 
place.  The  magnetization 
curve  (conventionally  plotted 
as  B  =  H  +  4nM  vs.  H) 
from  zero  magnetization 
(configuration  (a)  in  zero 
field)  up  to  saturation  is 
shown  in  the  inset.  If  one  sub¬ 
sequently  reduces  the  field, 
the  magnetization  does  not 
return  to  zero  with  the  field, 
and  a  hysteresis  curve  (d) 
results.  At  the  field  —  Hc,  B 
vanishes.  Sometimes  this  is 
taken  as  an  alternative  defi¬ 
nition  of  the  coercive  force. 
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41  Anisotropy  energy  is  also  responsible  for  the  observed  phenomenon  of  “easy”  and  “hard”  axes 
of  magnetization. 

42  The  gradual  reversal  of  spins  in  the  wall  is  important  for  the  smoothness  of  the  wall’s  motion. 
To  move  an  abrupt  wall  by  a  series  of  1 80°  flips  of  individual  spins  would  require  moving  each  spin  through 
a  large  (exchange)  energy  barrier. 


) 
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reversible  low-field  motion  of  domain  walls  may  be  hindered  by  crystalline  imper¬ 
fections  through  which  the  wall  will  pass  only  if  the  gain  in  external  field  energy  is 
sufficiently  large.  When  the  aligning  field  is  removed,  these  defects  may  prevent  the 
domain  walls  from  returning  to  their  original  unmagnetized  configuration.  It  then 
becomes  necessary  to  apply  a  rather  strong  field  in  the  opposite  direction  to  restore 
the  unmagnetized  configuration.  This  phenomenon  is  known  as  hysteresis,  and  the 
field  necessary  to  restore  zero  magnetization  (usually  from  saturation)  is  known  as 
the  coercive  force.  Evidently  the  value  of  the  coercive  force  depends  on  the  state  of 
preparation  of  the  specimen. 

At  very  large  fields  it  may  become  energetically  favorable  for  entire  domains  to 
rotate  as  a  whole,  in  spite  of  the  cost  in  anisotropy  energy.  Once  magnetized  in  this 
way,  it  may  be  quite  difficult  for  a  substance  to  reform  into  domains  unless  some 
remnant  of  domain  structure  is  left  to  provide  nucleation  centers  for  the  less  cata¬ 
strophic  sort  of  domain  growth  through  motion  of  the  walls. 

CONSEQUENCES  OF  DIPOLAR  INTERACTIONS : 
DEMAGNETIZATION  FACTORS 

Finally  we  note  that  the  dipolar  magnetic  interactions  can  result  in  strong  internal 
fields  acting  on  each  spin  site,  causing  the  local  field  H  a  spin  actually  experiences  to 
differ  substantially  from  the  applied  external  field.  The  analogous  electrical  phenom¬ 
enon  in  insulators  was  discussed  in  some  detail  in  Chapter  27.  Here  we  merely  note, 
in  addition,  that  the  effect  in  ferromagnetic  materials  can  be  quite  large:  The  internal 
local  field  in  a  ferromagnet  can  be  thousands  of  gauss  in  zero  external  field.  As  in  the 
case  of  dielectrics,  the  value  of  the  internal  field  depends  in  a  complicated  way  on  the 
shape  of  the  specimen.  One  often  introduces  a  “demagnetization  factor”  to  convert 
the  applied  field  to  the  true  local  field. 


PROBLEMS 


1.  Bounds  for  Products  of  Spin  Operators 

(a)  From  the  fact  that  the  eigenstates  of  a  Hermitian  matrix  form  a  complete  orthonormal 
set,  deduce  that  the  largest  (smallest)  diagonal  matrix  element  a  Hermitian  operator  can  have  is 
equal  to  its  largest  (smallest)  eigenvalue. 

(b)  Prove  that  the  largest  diagonal  matrix  element  S(R)  •  S(R')  can  have  when  R  #  R'  is  Sz. 
(Hint:  Write  the  operator  in  terms  of  the  square  of  S(R)  +  S(R').) 

(c)  Prove  that  the  smallest  diagonal  matrix  element  S(R)  •  S(R')  can  have  is  -S(S  +  1). 

2.  Bounds  for  the  Ground-State  Energy  of  an  Antiferromagnet 

Derive  the  lower  bound  in  (33.17)  for  the  ground  state  energy  of  a  Heisenberg  antiferromagnet 
from  one  of  the  results  in  Problem  1.  Derive  the  upper  bound  in  (33.17)  from  a  variational  argu¬ 
ment,  using  as  a  trial  ground  state  the  one  described  on  page  704. 

3.  Exact  Ground-State  Energy  of  a  Simple  “ Antiferromagnet ” 

Show  that  the  ground-state  energy  of  the  four  spin  antiferromagnetic  nearest-neighbor  Heisenberg 
linear  chain. 


3C  =  J(Si  ’  S2  +  S2  •  S3  +  S3  •  S4  +  S4  •  SJ, 


(33.68) 


) 


Problems  723 


is 

E0  =  -4JS2  j^l  +  ^J.  (33.69) 

(Hint:  Write  the  Hamiltonian  in  the  form 

3C  =  +  S2  +  S3  +  S4)2  —  (Sj  +  S3)2  —  (S2  +  S4)  ]•)  (33.70) 

4.  Properties  of  Spin  Ware  States 

(a)  Confirm  the  normalization  in  Eqs.  (33.19)  and  (33.20). 

(b)  Derive  Eq.  (33.27). 

(c)  Show  that  <k|S±(R)|k>  =  0,  i.e.,  that  the  phase  of  the  spin  wave  is  unspecified  in  the 
state  |k>. 

5.  Anisotropic  Heisenberg  Model 

Consider  the  anisotropic  Heisenberg  spin  Hamiltonian 

3C  =  -  iXWR  -  R')Sz(R)Sz(R')  +  J( R  -  R')Si(R)  •  S±(R')]  (33.71) 

RR' 

with  JA R  -  R')  >  J(R  -  R)  >  o. 

(a)  Show  that  the  ground  state  (33.5)  and  one-spin-wave  states  (33.23)  remain  eigenstates  of 
3C,  but  that  the  spin  wave  excitation  energies  are  raised  by 

S  I  [JZ(R)  -  J(R)].  (33.72) 

R 

(b)  Show  that  the  low-temperature  spontaneous  magnetization  now  deviates  from  saturation 
only  exponentially  in  —  1/T. 

(c)  Show  that  the  argument  on  page  708,  that  there  can  be  no  spontaneous  magnetization 
in  two  dimensions,  no  longer  works. 

6.  Mean  Field  Theory  Near  the  Critical  Point 

For  small  x,  the  Brillouin  function  Bj(x)  has  the  form  Ax  -  fix3,  where  A  and  B  are  positive. 

(a)  Deduce  that  as  T  approaches  Tc  from  below,  the  spontaneous  magnetization  of  a  ferro- 
magnet  vanishes  as  (Tc  —  T)'12  according  to  mean  field  theory. 

(b)  Deduce  that  at  Tc,  the  magnetization  density  M(H,  Tc)  vanishes  as  1/3  in  mean  field 
theory.  (Observations  and  calculations  indicate  an  exponent  closer  to  1/5  for  three-dimensional 
systems.  The  exponent  for  the  two-dimensional  Ising  model  is  1/15.) 

7.  Mean  Field  Theory  of  Ferrimagnetism  and  Antiferromagnetism 

Consider  a  magnetic  structure  made  up  of  two  types  of  spins  that  occupy  two  interpenetrating 
sublattices.  Let  spins  within  sublattice  1  be  coupled  by  exchange  constants  Jit  within  sublattice  2, 
by  Jz,  and  between  sublattices  1  and  2,  by  J3. 

(a)  Generalize  the  mean  field  theory  of  a  simple  ferromagnet  to  this  structure,  showing  that 
Eq.  (33.59)  for  the  spontaneous  magnetization  generalizes  to  two  coupled  equations  for  the  two 
sublattice  magnetizations  of  the  form: 

Mj  =  M0[(H  +  i-iM-i  +  A3M2)/T], 

M2  =  M0[(H  +  ;.2M2  +  ^MiVT].  (33.73) 

(b)  Deduce  from  this  that  above  Tc  the  zero-field  susceptibility  is  the  ratio  of  a  polynomial 
linear  in  T  to  one  quadratic  in  T. 
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(c)  Verify  that  the  susceptibility  reduces  back  to  the  Curie-Weiss  form  when  the  ions  in  the 
two  sublattices  are  identical  and  ferromagnetically  coupled  (A,  =  X2  >  0,  A3  >  0). 

(d)  Verify  that  when  the  ions  in  the  two  sublattices  are  identical  (A]  =  A2  >  0)  and  anti- 
ferromagnetically  coupled  (A3  <  0)  with  IA3I  >  |A|[,  the  temperature  in  the  Curie-Weiss  “law” 
becomes  negative. 

8.  High-Temperature  Susceptibility  of  Ferrimagnets  and  Antiferromagnets 
Generalize  the  high-temperature  susceptibility  expansion  to  the  case  of  the  structure  described 
in  Problem  7,  and  compare  the  exact  leading  (0(1/T2))  correction  to  Curie’s  law  to  the  mean 
field  result. 

9.  Low-Temperature  Spontaneous  Magnetization  in  Mean  Field  Theory 

Show  that  when  T  is  far  below  Tc,  the  mean  field  theory  of  a  ferromagnet  predicts  a  spontaneous 
magnetization  that  differs  from  its  saturation  value  exponentially  in  —  1/T. 
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In  Chapter  32  we  found  that  the  independent  electron  approximation  cannot  ade¬ 
quately  describe  most  magnetically  ordered  solids.  In  many  metals  without  any 
magnetic  ordering  a  still  more  spectacular  failure  of  the  independent  electron  approxi¬ 
mation  sets  in  abruptly  at  very  low  temperatures,  where  another  kind  of  electronically 
ordered  state  is  established,  known  as  the  superconducting  state.  Superconductivity 
is  not  peculiar  to  a  few  metals.  More  than  20  metallic  elements  can  become  super¬ 
conductors  (Table  34.1).  Even  certain  semiconductors  can  be  made  superconducting 
under  suitable  conditions,1  and  the  list  of  alloys  whose  superconducting  properties 
have  been  measured  stretches  into  the  thousands.2 


Table  34.1 

SUPERCONDUCTING  ELEMENTS0 


H 

Ha 

Li 

Be 

B 

C 

N 

0 

F 

Ne 

Na 

Mg 

Al 

• 

Si 

P 

s 

Cl 

Ar 

K 

Ca 

Sc 

B 

D 

a 

11 

E 

Ni 

Cu 

Zn 

Ga 

• 

Ge 

As 

• 

Se 

Br 

Kr 

Rb 

Sr 

Y 

m 

u 

□ 

Q 

Pd 

Ag 

Cd 

In 

Sn 

Sb 

• 

Te 

I 

Xa 

Cs 

• 

Ba 

Lu 

B 

B 

B 

B 

Pt 

Au 

Hg 

Tl 

Pb 

• 

Bi 

Po 

At 

Rn 

Fr 

Ra 

El 

• 

Cs 

:  Nd 

Pm 

Sm 

Eu 

Gd 

Tb 

Dy 

Ho. 

Er 

Tm 

Yb 

Ac 

Th 

Pa 

u 

Np 

Pu 

“Elements  that  are  superconducting  only  under  special  conditions  are  indicated  separately.  Note  the 
incompatibility  of  superconducting  and  magnetic  order.  After  G.  Gladstone,  et  al.  Parks  op.  cit,  note  6. 


Legend: 


Superconducting 

Superconducting  under  high  pressure  or  in  thin  films 
Metallic  but  not  yet  found  to  be  superconducting 


Nonmetallic  elements 
Elements  with  magnetic  order 


The  characteristic  properties  of  metals  in  the  superconducting  state  appear  highly 
anomalous  when  regarded  from  the  point  of  view  of  the  independent  electron  approxi¬ 
mation.  The  most  striking  features  of  a  superconductor  are: 


1  Such  as  application  of  high  pressure,  or  preparation  of  the  specimen  in  very  thin  films.  A  striking 
example  of  the  unexpected  ways  in  which  superconductivity  can  be  enhanced  is  provided  by  bismuth: 
Amorphous  bismuth  is  a  superconductor  at  higher  temperatures  than  crystalline  bismuth,  which  makes 
no  sense  at  all  in  the  independent  electron  approximation. 

2  See  B.  W.  Roberts,  Progr.  Cryog.  4,  161  (1964). 
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1  A  superconductor  can  behave  as  if  it  had  no  measurable  DC  electrical  resistivity. 
Currents  have  been  established  in  superconductors  which,  in  the  absence  of  any 
driving  field,  have  nevertheless  shown  no  discernible  decay  for  as  long  as  people 
have  had  the  patience  to  watch.3 

2.  A  superconductor  can  behave  as  a  perfect  diamagnet.  A  sample  in  thermal 
equilibrium  in  an  applied  magnetic  field,  provided  the  field  is  not  too  strong, 
carries  electrical  surface  currents.  These  currents  give  rise  to  an  additional  mag¬ 
netic  field  that  precisely  cancels  the  applied  magnetic  field  in  the  interior  of  the 
superconductor. 

3.  A  superconductor  usually  behaves  as  if  there  were  a  gap  in  energy  of  width  2A 
centered  about  the  Fermi  energy,  in  the  set  of  allowed  one-electron  levels.4  Thus 
an  electron  of  energy  £  can  be  accommodated  by  (or  extracted  from)  a  super¬ 
conductor5  only  if  £  —  £f  (or  £F  —  £)  exceeds  A.  The  energy  gap  A  increases  in 
size  as  the  temperature  drops,  leveling  off  to  a  maximum  value  A(0)  at  very  low 
temperatures. 

The  theory  of  superconductivity  is  quite  extensive  and  highly  specialized.  Like  the 
theories  we  have  described  elsewhere  in  this  book,  it  is  based  on  the  nonrelativistic 
quantum  mechanics  of  electrons  and  ions,  but  beyond  that  its  similarity  to  the  other 
models  and  theories  we  have  examined  diminishes  rapidly.  The  microscopic  theory 
of  superconductivity  cannot  be  described  in  the  language  of  the  independent  electron 
approximation.  Even  comparatively  elementary  microscopic  calculations  for  super¬ 
conductors  rely  on  formal  techniques  (field  theoretic  methods)  which,  while  concep¬ 
tually  no  more  sophisticated  than  the  ordinary  methods  of  quantum  mechanics, 
require  considerable  experience  and  practice  before  they  can  be  used  with  confidence 
and  understanding. 

Consequently,  to  a  greater  degree  than  in  other  chapters  we  shall  limit  our  survey 
of  the  theory  of  superconductivity  to  qualitative  descriptions  of  some  of  the  major 
concepts,  together  with  statements  of  a  few  of  the  simpler  predictions.  The  reader 
who  wishes  to  acquire  even  an  elementary  working  knowledge  of  the  subject  must 
consult  one  of  the  many  available  books.6 


3  The  record  appears  to  be  years;  S.  C.  Collins,  quoted  in  E.  A.  Lynton,  Superconductivity ,  Wiley, 
New  York,  1969. 

*  Under  a  variety  of  special  conditions  superconductivity  can  also  occur  without  an  energy  gap. 
Gapless  superconductivity  can  be  produced,  for  example,  by  introducing  a  suitable  concentration  of 
magnetic  impurities.  A  review  is  given  by  K.  Maki  in  Superconductivity,  R.  D.  Parks,  ed.,  Dekker,  New 
York,  1969.  In  the  context  of  superconductivity,  the  term,  "energy  gap”  always  refers  to  the  quantity  A. 

5  This  is  most  directly  observed  in  electron  tunneling  experiments,  which  are  described  below  along 
with  other  manifestations  of  the  energy  gap. 

6  Two  fundamental  references  on  the  phenomenological  theory  are  F.  London,  Superfluids,  vol.  1 , 
Wiley,  New  York,  1954,  and  Dover,  New  York,  1954,  and  D.  Shoenberg,  Superconductivity,  Cambridge, 
1962.  A  very  brief  survey  is  given  by  E.  A.  Lynton,  Superconductivity,  Methuen,  London,  1969.  The  micro¬ 
scopic  theory  is  expounded  in  J.  R.  SchriefTer,  Superconductivity ,  W.  A.  Benjamin,  New  York,  1964,  and 
in  the  final  chapter  of  A.  A.  Abrikosov,  L  P.  Gorkov,  and  I.  E.  Dzyalosh  inski.  Methods  of  Quantum 
Field  Theory  in  Statistical  Physics,  Prentice-Hall,  Englewood  Cliffs,  N.J.,  1963.  A  detailed  survey  of  the 
theoretical  aspects  of  the  subject  has  been  given  by  G.  Rickayzen,  Theory  of  Superconductivity,  Interscience, 
New  York,  1965,  and  in  somewhat  less  detail  by  P.  de  Gennes,  Superconductivity  of  Metals  and  Alloys, 
W.  A.  Benjamin,  Menlo  Park,  Calif.,  1966.  A  survey  of  all  aspects  of  the  subject,  theoretical  and  experi¬ 
mental,  by  many  of  the  leading  experts  in  the  field  is  Superconductivity ,  R.  D.  Parks,  ed.,  Dekker,  New 
York,  1969. 
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This  chapter  is  organized  as  follows: 

1.  A  survey  of  the  basic  empirical  facts  about  superconductivity. 

2.  A  description  of  the  phenomenological  London  equation  and  its  relation  to 
perfect  diamagnetism. 

3.  A  qualitative  description  of  the  microscopic  theory  of  Bardeen,  Cooper,  and 
Schrieffer. 

4.  A  summary  of  some  of  the  fundamental  equilibrium  predictions  of  the  microscopic 
theory  and  how  they  compare  with  experiment. 

5.  A  qualitative  discussion  of  the  relationship  between  the  microscopic  theory,  the 
concept  of  an  “order  parameter,”  and  the  transport  properties  of  superconductors. 

6.  A  description  of  the  remarkable  tunneling  phenomena  between  superconductors 
predicted  by  B.  D.  Josephson. 

CRITICAL  TEMPERATURE 

The  transition  to  the  superconducting  state  is  a  sharp  one  in  bulk  specimens.  Above 
a  critical  temperature7  Tc  the  properties  of  the  metal  are  completely  normal;  below 
Tc  superconducting  properties  are  displayed,  the  most  dramatic  of  which  is  the  absence 
of  any  measurable  DC  electrical  resistance.  Measured  critical  temperatures  range 
from  a  few  millidegrees  Kelvin8  up  to  a  little  over  20  K.  The  corresponding  thermal 
energy  kBTc  varies  from  about  10“  7  eV  up  to  a  few  thousandths  of  an  electron  volt. 
This  is  quite  minute  compared  with  the  energies  we  have  become  accustomed  to 
regarding  as  significant  in  solids.9  Transition  temperatures  of  the  superconducting 
elements  are  listed  in  Table  34.2. 

PERSISTENT  CURRENTS 

Figure  34.1  displays  the  resistivity  of  a  superconducting  metal  vs.  temperature  as  the 
critical  temperature  Tc  is  crossed.  Above  Tc  the  resistivity  has  the  form  characteristic 
of  a  normal  metal,  p(T)  =  p0  +  BT5,  the  constant  term  arising  from  impurity10  and 
defect  scattering,  and  the  term  in  T5  arising  from  phonon  scattering.  Below  Tc  these 
mechanisms  lose  the  power  to  degrade  an  electric  current  and  the  resistivity  drops 
abruptly  to  zero.  Currents  can  flow  in  a  superconductor  with  no  discernible  dissipa¬ 
tion  of  energy.11  There  are,  however,  some  limitations: 

1.  Superconductivity  is  destroyed  by  application  of  a  sufficiently  large  magnetic 
field  (see  below). 


7  The  critical  temperature  is  that  at  which  the  transition  occurs  in  the  absence  of  an  applied  magnetic 
field.  When  a  magnetic  field  is  present  (see  below)  the  transition  occurs  at  a  lower  temperature,  and  the 
nature  of  the  transition  changes  from  second  order  to  first  order;  i.e.,  there  is  a  latent  heat  in  nonzero  field. 

8  The  lowest  temperatures  at  which  superconductivity  has  been  sought,  to  date. 

9  Thus  -  10  eV,  hcjD  -  0.1  eV. 

10  We  assume  there  are  no  magnetic  impurities  present;  see  page  687. 

11  When  Ampere  first  proposed  that  magnetism  could  be  understood  in  terms  of  electric  currents 
flowing  in  individual  molecules,  it  was  objected  that  no  currents  were  known  to  flow  without  dissipation. 
Ampere  persisted  in  his  view  and  was  vindicated  by  the  quantum  theory,  which  permits  stationary  molec¬ 
ular  states  in  which  a  net  current  flows  (see  Chapter  31).  A  solid  in  the  superconducting  state  is  behaving 
like  one  enormous  molecule.  The  presence  of  an  electric  current  without  dissipation  in  a  superconductor 
is  a  dramatic  macroscopic  manifestation  of  quantum  mechanics. 
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Table  34.2 

VALUES  OF  Tc  AND  Hc  FOR  THE 
SUPERCONDUCTING  ELEMENTS0 


ELEMENT 

Tc 

(K) 

Hc 

(gauss)1 

A1 

1.196 

99 

Cd 

0.56 

30 

Ga 

1.091 

51 

Hf 

0.09 

_ 

Hg 

a  (rhomb) 

4.15 

411 

ft 

3.95 

339 

In 

3.40 

293 

Ir 

0.14 

19 

La 

a  (hep) 

4.9 

798 

P  (fee) 

6.06 

1096 

Mo 

0.92 

98 

Nb 

9.26 

1980 

Os 

0.655 

65 

Pa 

1.4 

— 

Pb 

7.19 

803 

Re 

1.698 

198 

Ru 

0.49 

66 

Sn 

3.72 

305 

Ta 

4.48 

830 

Tc 

7.77 

1410 

Th 

1.368 

162 

Ti 

0.39 

100 

T1 

2.39 

171 

U 

a 

0.68 

— 

y 

1.80 

— 

V 

5.30 

1020 

w 

0.012 

1 

Zn 

0.875 

53 

Zr 

0.65 

47 

°  For  type  II  superconductors,  the  zero-temperature 
critical  field  quoted  is  obtained  from  an  equal-area 
construction:  The  low-field  (H  <  Hcl)  magnetiza¬ 
tion  is  extrapolated  linearly  to  a  field  Hc  chosen  to 
give  an  enclosed  area  equal  to  the  area  under  the 
actual  magnetization  curve. 
b  At  T  =  0  (K). 

Sources:  B.  W.  Roberts,  Progr.  Crvog.  4,  161  (1964); 
G.  Gladstone,  M.  A.  Jensen,  and  J.  R.  Schrieffer, 
Superconductivity,  R.  D.  Parks,  ed.,  Dekker,  New  York, 
1969;  Handbook  of  Chemistry  and  Physics,  55th  ed.. 
Chemical  Rubber  Publishing  Co.,  Cleveland,  1974-1975. 
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(a) 


Figure  34.1 

(a)  Low-temperature  resistivity  of  a  normal 
metal  ( p(T )  =  p0  +  BT5)  containing  nonmag¬ 
netic  impurities  (b)  Low-temperature  resistivity 
of  a  superconductor  (in  zero  magnetic  field) 
containing  nonmagnetic  impurities.  At  Tc,  p 
drops  abruptly  to  zero. 


CbJ 


2.  If  the  current  exceeds  a  critical  current,”  the  superconducting  state  will  be 
destroyed  (Silsbee  effect).  The  size  of  the  critical  current  (which  can  be  as  large 
as  100  amp  in  a  1-mm  wire)  depends  on  the  nature  and  geometry  of  the  specimen, 
and  is  related  to  whether  the  magnetic  field  produced  by  the  current  exceeds  the 
critical  field  at  the  surface  of  the  superconductor.12 

3.  A  superconductor  well  below  its  transition  temperature  will  also  respond  without 
dissipation  to  an  AC  electric  field  provided  that  the  frequency  is  not  too  large. 
The  change  from  dissipationless  to  normal  response  occurs  at  a  frequency  a  of 
order  A /Pi,  where  A  is  the  energy  gap. 


THERMOELECTRIC  PROPERTIES 

In  the  independent  electron  approximation  good  electrical  conductors  are  also  good 
conductors  of  heat,  since  the  conduction  electrons  transport  entropy  as  well  as 
electric  charge.13  Superconductors,  contrary  to  this,  are  poor  thermal  conductors 
(Figure  34.2). 14  They  also  exhibit  no  Peltier  effect;  i.e.,  an  electric  current  at  uniform 
temperature  in  a  superconductor  is  not  accompanied  by  a  thermal  current,  as  it 
would  be  in  a  normal  metal.  The  absence  of  a  Peltier  effect  indicates  that  those 
electrons  that  participate  in  the  persistent  current  carry  no  entropy.  The  poor  thermal 


12 

13 


14 


See  Problem  3. 

See  page  253. 

This  property  is  exploited  to  make  thermal  switches. 
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Figure  34.2 

The  thermal  conductivity  of  lead.  Below  Tc  the 
lower  curve  gives  the  thermal  conductivity  in 
the  superconducting  state,  and  the  upper  curve, 
in  the  normal  state.  The  normal  sample  is  pro¬ 
duced  below  Tc  by  application  of  a  magnetic  q 
field,  which  is  assumed  otherwise  to  have  no  g 
appreciable  affect  on  the  thermal  conductivity. 
(Reproduced  by  permission  of  the  National  g 
Research  Council  of  Canada  from  J.  H.  P.  * 
Watson  and  G.  M.  Graham,  Can.  J.  Plivs.  41, 
1738(1963).) 


conductivity  indicates  that  even  when  a  superconductor  is  not  carrying  an  electric 
current,  only  a  fraction  of  its  conduction  electrons  are  capable  of  transporting 
entropy.15 


MAGNETIC  PROPERTIES:  PERFECT  DIAMAGNETISM 

A  magnetic  field  (provided  that  it  is  not  too  strong)  cannot  penetrate  into  the  interior 
of  a  superconductor.  This  is  most  dramatically  illustrated  by  the  Meissner-Ochsenfeld 
effect:  If  a  normal  metal  in  a  magnetic  field16  is  cooled  below  its  superconducting 
transition  temperature,  the  magnetic  flux  is  abruptly  expelled.  Thus  the  transition, 
when  it  occurs  in  a  magnetic  field,  is  accompanied  by  the  appearance  of  whatever 
surface  currents  are  required  to  cancel  the  magnetic  field  in  the  interior  of  the 
specimen. 

Note  that  this  is  not  implied  by  perfect  conductivity  (i.e.,  er  =  oo)  alone,  even 
though  perfect  conductivity  does  imply  a  somewhat  related  property:  If  a  perfect 
conductor,  initially  in  zero  magnetic  field,  is  moved  into  a  region  of  nonzero  field  (or 
if  a  field  is  turned  on),  then  Faraday’s  law  of  induction  gives  rise  to  eddy  currents  that 
cancel  the  magnetic  field  in  the  interior.  If,  however,  a  magnetic  field  were  established 
in  a  perfect  conductor,  its  expulsion  would  be  equally  resisted.  Eddy  currents  would 
be  induced  to  maintain  the  field  if  the  sample  were  moved  into  a  field-free  region  (or 
if  the  applied  field  were  turned  off).  Thus  perfect  conductivity  implies  a  time-inde¬ 
pendent  magnetic  field  in  the  interior,  but  is  noncommittal  as  to  the  value  that  field 
must  have.  In  a  superconductor,  the  field  is  not  only  independent  of  time,  but  also 
zero. 


15  Presumably  the  efficacy  of  the  phonons  in  conducting  heat  remains  undiminished,  but  this  is 
generally  a  less  important  contribution  to  the  thermal  conductivity  than  that  of  the  conduction  electrons. 

16  A  normal  metal  is  only  weakly  paramagnetic  or  diamagnetic  (no  magnetically  ordered  metals  are 
superconductors)  and  an  applied  magnetic  field  can  penetrate  it. 
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We  shall  examine  the  relation  between  perfect  conductivity  and  the  Meissner  effect 
somewhat  more  quantitatively  in  our  discussion  of  the  London  equation  below. 


MAGNETIC  PROPERTIES:  THE  CRITICAL  FIELD 

Consider  a  superconductor  at  a  temperature  T  below  its  critical  temperature  Tc.  As 
a  magnetic  field  H  is  turned  on,  a  certain  amount  of  energy  is  expended  to  establish 
the  magnetic  field  of  the  screening  currents  that  cancels  the  field  in  the  interior  of  the 
superconductor.  If  the  applied  field  is  large  enough  it  will  become  energetically 
advantageous  for  the  specimen  to  revert  back  to  the  normal  state,  allowing  the  field 
to  penetrate.  For  although  the  normal  state  has  a  higher  free  energy  than  the  super¬ 
conducting  state  below  Tc  in  zero  field,  at  high  enough  fields  this  increase  in  free 
energy  will  be  more  than  offset  by  the  lowering  of  magnetic  field  energy  that  occurs 
when  the  screening  currents  disappear  and  the  field  is  allowed  to  enter  the  specimen. 

The  manner  in  which  penetration  occurs  with  increasing  field  strength  depends 
in  general  on  the  geometry  of  the  specimen.  However,  for  the  simplest  geometry- 
long,  thin,  cylindrically  shaped  samples  with  their  axes  parallel  to  the  applied  magnetic 
field — there  are  two  clearly  distinguishable  kinds  of  behavior: 

Type  I  Below  a  critical  field  HC(T)  that  increases  as  T  falls  below  Tc,  there  is  no 
penetration  of  flux;  when  the  applied  field  exceeds  HC(T)  the  entire  specimen  reverts 
to  the  normal  state  and  the  field  penetrates  perfectly.17  The  resulting  phase  diagram 
in  the  H-T  plane  is  pictured  in  Figure  34.3.18  One  often  describes  this  type  of  field 
penetration  by  plotting  the  macroscopic  diamagnetic  magnetization  density  M  vs. 
the  applied  field  H  (Figure  34.4a). 

Figure  34.3 

The  phase  boundary  between  the  supercon¬ 
ducting  and  normal  states  of  a  type  I  super¬ 
conductor  in  the  H-T  plane.  The  boundary  is 
given  by  the  curve  HC(T). 


T 
T 


Type  II  Below  a  lower  critical  field  Hcl(T)  there  is  no  penetration  of  flux;  when  the 
applied  field  exceeds  an  upper  critical  field  Hc2(T )  >  Hcl(T),  the  entire  specimen 
reverts  to  the  normal  state  and  the  field  penetrates  perfectly.  When  the  applied  field 
strength  is  between  Hcl(T)  and  Hc2(T ),  there  is  partial  penetration  of  flux,  and  the 


1_  Except  for  the  small  diamagnetic  and  paramagnetic  effects  characteristic  of  normal  metals. 

1S  Some  quantitative  thermodynamic  consequences  of  this  behavior  are  explored  in  Problem  1. 
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Figure  34.4 

(a)  Magnetization  curve  of  a 
type  1  superconductor.  Below 
Hc  no  field  penetrates:  B  =  0 
(or  M  =  —  H/4n).  (See  foot¬ 
note  30  for  the  distinction 
between  B  and  H  in  a  super¬ 
conductor.)  (b)  Magnetiza¬ 
tion  curve  of  a  type  II  super¬ 
conductor.  Below  Hci  be¬ 
havior  is  as  in  the  type  I  case. 
Between  Hci  and  Hcl,  M 
falls  smoothly  to  zero,  and 
B  rises  smoothly  to  H. 


(a)  Type  I 


sample  develops  a  rather  complicated  microscopic  structure  of  both  normal  and 
superconducting  regions,  known  as  the  mixed  state.19  The  magnetization  curve  corre¬ 
sponding  to  type  II  behavior  is  shown  in  Figure  34.4b. 

It  was  proposed  by  A.  A.  Abrikosov,  and  subsequently  confirmed  by  experiment 
(Figure  34.5),  that  in  the  mixed  state  the  field  partially  penetrates  the  sample  in  the 
form  of  thin  filaments  of  flux.  Within  each  filament  the  field  is  high,  and  the  material 
is  not  superconducting.  Outside  of  the  core  of  the  filaments,  the  material  remains 


Figure  34.5 

Triangular  array  of  vortex  lines 
emerging  through  the  surface  of  a 
Pb  98In  02  superconducting  foil  in 
a  field  of  80  gauss  normal  to  the 
surface.  (Courtesy  of  J.  Silcox  and 
G.  Dolan.)  The  vortices  are  re¬ 
vealed  by  the  coagulation  of  fine 
ferromagnetic  particles.  Neigh¬ 
boring  vortices  are  about  half  a 
micron  apart. 


15  Not  to  be  confused  with  the  intermediate  state ,  a  configuration  a  type  I  superconductor  may  assume 
when  its  shape  is  more  complex  than  a  cylinder  parallel  to  the  field,  in  which  macroscopic  superconducting 
and  normal  regions  are  interleaved  in  such  a  way  as  to  lower  the  magnetic  field  energy  by  more  than  the 
cost  in  free  energy  of  the  normal  regions. 
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superconducting,  and  the  field  decays  in  a  manner  determined  by  the  London  equa¬ 
tion  (see  below).  Circulating  around  each  filament  is  a  vortex  of  screening  current.20 

Typical  critical  fields  in  type  I  superconductors  are  about  102  gauss  well  below 
the  transition  temperature.  However,  in  so-called  “hard”  type  II  superconductors  the 
upper  critical  field  can  be  as  high  as  10s  gauss,  which  makes  type  II  materials  of 
considerable  practical  importance  in  the  design  of  high-field  magnets. 

Low  temperature  critical  fields  for  the  elemental  superconductors  are  given  in 
Table  34.2. 


SPECIFIC  HEAT 


At  low  temperatures  the  specific  heat  of  a  normal  metal  has  the  form  AT  4-  BT3, 
where  the  linear  term  is  due  to  electronic  excitations  and  the  cubic  term  is  due  to 
lattice  vibrations.  Below  the  superconducting  critical  temperature  this  behavior  is 
substantially  altered.  As  the  temperature  drops  below  Tc  (in  zero  magnetic  field)  the 
specific  heat  jumps  to  a  higher  value  and  then  slowly  decreases,  eventually  falling 
well  below  the  value  one  would  expect  for  a  normal  metal  (Figure  34.6).  By  applying 


T(K) 


Figure  34.6 

Low-temperature  specific 
heat  of  normal  and  super¬ 
conducting  aluminum.  The 
normal  phase  is  produced 
below  Tc  by  application  of  a 
weak  (300-gauss)  magnetic 
field,  which  destroys  the 
superconducting  ordering 
but  has  otherwise  negligible 
effect  on  the  specific  heat.  The 
Debye  temperature  is  quite 
high  in  aluminum,  so  the 
specific  heat  is  dominated  by 
the  electronic  contribution 
throughout  this  temperature 
range  (as  can  be  seen  from 
the  fact  that  the  normal-state 
curve  is  quite  close  to  being 
linear).  The  discontinuity  at 
Tc  agrees  well  with  the  theo¬ 
retical  prediction  (34.22) 
[cs  -  c„]/c„  =  1.43.  Well  be¬ 
low  Tc,  c,  drops  far  below 
c„,  suggesting  the  existence 
of  an  energy  gap.  (N.  E. 
Phillips,  Phys.  Rev.  114,  676 
(1959).) 


20  The  term  “vortex”  is  often  used  to  refer  to  the  filaments  themselves,  as  well  as  to  the  structure  of 
the  current  in  the  vicinity  of  each  filament.  It  can  be  shown  that  the  magnetic  flux  enclosed  by  each  vortex 
is  just  equal  to  the  magnetic  flux  quantum,  /ic/2e  (see  footnote  60). 
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I 

a  magnetic  field  to  drive  the  metal  into  the  normal  state,  one  can  compare  the  specific 
heats  of  the  superconducting  and  normal  states  below  the  critical  temperature.21 
Such  an  analysis  reveals  that  in  the  superconducting  state  the  linear  electronic  con¬ 
tribution  to  the  specific  heat  is  replaced  by  a  term  that  vanishes  much  more  rapidly 
at  very  low  temperatures,  having  a  dominant  low-temperature  behavior  of  the  form 
exp  (  —  A /kBT).  This  is  the  characteristic  thermal  behavior  of  a  system  whose  excited 
levels  are  separated  from  the  ground  state  by  an  energy  2A.22  Both  theory  (see 
Eq.  (34.19))  and  experiment  (see  Table  34.3)  indicate  that  the  energy  gap  A  is  of 
order  kBTc. 

OTHER  MANIFESTATIONS  OF  THE  ENERGY  GAP 
Normal  Tunneling 

The  conduction  electrons  in  a  superconductor  and  a  normal  metal  can  be  brought 
into  thermal  equilibrium  with  one  another  by  placing  the  metals  into  such  close 
contact  that  they  are  separated  only  by  a  thin  insulating  layer,23  which  the  electrons 
can  cross  by  quantum-mechanical  tunneling.  In  thermal  equilibrium  enough  electrons 
have  passed  from  one  metal  to  the  other  to  make  the  chemical  potentials  of  electrons 
in  both  metals  equal.24  When  both  metals  are  normal,  application  of  a  potential 
difference  then  raises  the  chemical  potential  of  one  metal  with  respect  to  the  other, 
and  further  electrons  tunnel  through  the  insulating  layer.  Such  “tunneling  currents” 
at  normal  metal  junctions  have  been  observed  to  obey  Ohm’s  law.  However,  when 
one  of  the  metals  is  a  superconductor  well  below  its  critical  temperature,  then  no 
current  is  observed  to  flow  until  the  potential  V  reaches  a  threshold  value,  eV  =  A 
(see  Figure  34.7).  The  size  of  A  is  in  good  agreement  with  the  value  inferred  from 
low-temperature  specific  heat  measurements,  confirming  the  picture  of  a  gap  in  the 
density  of  one-electron  levels  in  the  superconductor.  As  the  temperature  is  raised 
toward  Tc,  the  threshold  voltage  declines,25  indicating  that  the  energy  gap  itself  is 
declining  with  increasing  temperature. 

Frequency  Dependent  Electromagnetic  Behavior 

The  response  of  a  metal  to  electromagnetic  radiation  (for  example  the  transmission 
through  thin  films  or  the  reflection  from  bulk  samples)  is  determined  by  the  frequency 
dependent  conductivity.  This  in  turn  depends  on  the  available  mechanisms  for  energy 
absorption  by  the  conduction  electrons  at  the  given  frequency.  Because  the  electronic 
excitation  spectrum  in  the  superconducting  state  is  characterized  by  an  energy  gap  A, 
one  would  expect  the  AC  conductivity  to  differ  substantially  from  its  normal  state 
form  at  frequencies  small  compared  with  A/fi,  and  to  be  essentially  the  same  in  the 
superconducting  and  normal  states  at  frequencies  large  compared  with  A/h.  Except 


21  The  normal  specific  heat  is  not  appreciably  affected  by  the  presence  of  a  magnetic  field. 

22  See  point  3,  page  727. 

23  For  example,  the  thin  layer  of  oxide  on  the  surfaces  of  the  two  specimens. 

24  See  page  360. 

25  The  threshold  also  becomes  blurred,  due  to  the  presence  of  thermally  excited  electrons,  which 
require  less  energy  to  tunnel. 
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(b) 

Figure  34.7  . 

(a)  Current-voltage  relation  for  electron  tunneling  through  a  thin  insulating  barrier 
between  two  normal  metals.  For  small  currents  and  voltages  the  relation  is  linear. 

(b)  Current-voltage  relation  for  electron  tunneling  through  a  thin  insulating  barrier 
between  a  superconductor  and  a  normal  metal.  The  relation  is  strongly  temperature- 
dependent.  At  T  =  0  there  is  a  sharp  threshold,  which  is  blurred  at  higher  tempera¬ 
tures  due  to  the  thermal  excitation  of  electrons  across  the  energy  gap  within  the 
superconductor. 


* 


quite  near  the  critical  temperature  (see  p.  744).  A/ft  is  typically  in  the  range  between 
microwave  and  infrared  frequencies.  In  the  superconducting  state  an  AC  behavior 
is  observed  which  is  indistinguishable  from  that  in  the  normal  state  at  optical  fre¬ 
quencies.  Deviations  from  normal  state  behavior  first  appear  in  the  infrared,  and  only 
at  microwave  frequencies  does  AC  behavior  fully  displaying  the  lack  of  electronic 
absorption  characteristic  of  an  energy  gap  become  completely  developed. 


Acoustic  Attenuation 

When  a  sound  wave  propagates  through  a  metal  the  microscopic  electric  fields  due 
to  the  displacement  of  the  ions  can  impart  energy  to  electrons  near  the  Fermi  level, 
thereby  removing  energy  from  the  wave.26  Well  below  Tc  the  rate  of  attenuation  is 
markedly  lower  in  a  superconductor  than  a  normal  metal,  as  one  would  expect  for 
sound  waves,  where  hca  <  2A. 


26  See  pages  275-277. 
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THE  LONDON  EQUATION 

F.  London  and  H.  London  first  examined  in  a  quantitative  way  the  fundamental  fact 
that  a  metal  in  the  superconducting  state  permits  no  magnetic  field  in  its  interior.27 
Their  analysis  starts  with  the  two-fluid  model  of  Gorter  and  Casimir.28  The  only  cru¬ 
cial  assumption  of  this  model  that  we  shall  use  is  that  in  a  superconductor  at  tempera¬ 
ture  T  <  Tc,  only  a  fraction  ns(T)/n  of  the  total  number  of  conduction  electrons  are 
capable  of  participating  in  a  supercurrent.  The  quantity  ns(T)  is  known  as  the  density 
of  superconducting  electrons.  It  approaches  the  full  electronic  density  n  as  T  falls 
well  below  Tc,  but  it  drops  to  zero  as  T  rises  to  Te.  The  remaining  fraction  of  electrons 
are  assumed  to  constitute  a  “normal  fluid”  of  density  n  —  ns  that  cannot  carry  an 
electric  current  without  normal  dissipation.  The  normal  current  and  the  supercurrent 
are  assumed  to  flow  in  parallel;  since  the  latter  flows  with  no  resistance  whatever,  it 
will  carry  the  entire  current  induced  by  any  small  transitory  electric  field,  and  the 
normal  electrons  will  remain  quite  inert.  Normal  electrons  are  therefore  ignored  in 
the  discussion  that  follows. 

Suppose  that  an  electric  field  momentarily  arises  within  a  superconductor.  The 
superconducting  electrons  will  be  freely  accelerated  without  dissipation  so  that  their 
mean  velocity  vs  will  satisfy29 

m^=-e  E.  (34.1) 

dt 


Since  the  current  density  carried  by  these  electrons  is  j  =  —  evan„  Eq.  (34.1)  can  be 
written  as 


(34.2) 


Note  that  the  Fourier  transform  of  (34.2)  gives  the  ordinary  AC  conductivity  for  an 
electron  gas  of  density  ns  in  the  Drude  model,  Eq.  (1.29),  when  the  relaxation  time  t 
becomes  infinitely  large: 


j(co)  —  <t(w)E(o;), 


cr(co)  =  i 


.  nse 
men ' 


Substituting  (34.2)  into  Faraday’s  law  of  induction. 


V  x  E  =  - 


1  dB 

c  dt  ’ 


(34.3) 


(34.4) 


27  F.  London  and  H.  London,  Proc.  Roy.  Soc.  (London),  A149,  71  (1935),  Physica  2,  341  (1935); 
F.  London,  SuperfluiJs  vol.  1,  Wiley.  New  York,  1954.  and  Dover,  New  York,  1954. 

28  The  two-fluid  model  is  also  used  to  describe  superfluid  helium-4,  and  is  described  in  both  of  the 
volumes  by  F.  London,  Superfiuids ,  vols.  1  and  2,  Ibid. 

20  We  ignore  band  structure  effects  throughout  this  chapter  and  describe  the  electrons  with  free 
electron  dynamics. 
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gives  the  following  relation  between  current  density  and  magnetic  field: 

■^(v  *  j  +  —  B)  =  0.  (34.5) 

ct  \  me  J 

This  relation,  together  with  the  Maxwell  equation30 

V  x  B  =  —  j,  (34.6) 

c 

determines  the  magnetic  fields  and  current  densities  that  can  exist  within  a  perfect 
conductor. 

Note  in  particular  that  any  static  field  B  determines  a  static  current  density  j 
through  Eq.  (34.6).  Since  any  time-independent  B  and  j  are  trivially  solutions  to 
(34.5),  the  two  equations  are  consistent  with  an  arbitrary  static  magnetic  field.  This 
is  incompatible  with  the  observed  behavior  of  superconductors,  which  permit  no 
fields  in  their  interior.  F.  London  and  H.  London  discovered  that  this  characteristic 
behavior  of  superconductors  could  be  obtained  by  restricting  the  full  set  of  solutions 
of  (34.5)  to  those  that  obey31 

V  x  j  =  -  B,  (34.7) 

me 

which  is  known  as  the  London  equation.  Equation  (34.5),  which  characterizes  any  me¬ 
dium  that  conducts  electricity  without  dissipation,  requires  that  V  x  j  +  (nse2/mc) B 
be  independent  of  time;  the  more  restrictive  London  equation,  which  specifically 
characterizes  superconductors  and  distinguishes  them  from  mere  “perfect  conduc¬ 
tors,”  requires  in  addition  that  the  time-independent  value  be  zero. 


30  We  assume  that  the  rate  of  time  variation  is  so  slow  that  the  displacement  current  can  be  neglected. 
We  also  take  the  field  in  (34.6)  to  be  B  rather  than  H;  this  is  because  j  represents  the  mean  microscopic 
current  flowing  in  the  superconductor.  The  field  H  would  appear  only  if  we  represented  j  by  an  effective 
magnetization  density  satisfying  V  x  M  =  j/r.  and  defined  H  in  the  usual  way  as  H  =  B  —  4tiM.  In 
that  case  Eq.  (34.6)  would  be  replaced  by  the  equation  V  x  H  =  0.  Given  the  definitions  of  H  and  M. 
this  would  be  a  completely  equivalent  formulation. 

31  This  is  a  local  relation;  i.e.,  the  current  at  the  point  r  is  related  to  the  field  at  the  same  point. 
A.  B.  Pippard  pointed  out  that,  more  generally,  the  current  at  r  should  be  determined  by  the  field  within 
a  neighborhood  of  the  point  r  according  to  a  relation  of  the  form 

V  x  j(r)  =  -  Jc/r'K(r  -  r')B(r'), 

where  the  kernel  K( r)  is  appreciable  only  for  r  less  than  a  length  £0.  The  distance  £0  is  one  of  several 
fundamental  lengths  characterizing  a  superconductor,  all  of  which,  unfortunately,  are  indiscriminately 
referred  to  as  “the  coherence  length.**  In  pure  materials  well  below  the  critical  temperature  all  such  co¬ 
herence  lengths  are  the  same,  but  near  Tc  or  in  materials  with  short  impurity  mean  free  paths,  the  “coherence 
length*’  may  vary  from  one  context  to  another.  We  shall  avoid  this  tangle  of  coherence  lengths  by  restricting 
our  comments  on  its  significance  to  the  low-temperature  pure  case,  where  all  coherence  lengths  agree. 
It  turns  out  that  in  such  cases  the  criterion  for  whether  a  superconductor  is  type  I  or  type  II  is  that  the 
coherence  length  be  large  (type  I)  or  small  (type  II)  compared  with  the  London  penetration  depth  A 
(Eq.  (34.9)). 
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The  reason  for  replacing  (34.5)  by  the  more  restrictive  London  equation  is  that  the 
latter  leads  directly  to  the  Meissner  effect.32  Equations  (34.6)  and  (34.7)  imply  that 

,  4m»,e2  „ 

V2B  = - V-  B> 


V2j  = 


4nnse2 

me2 


(34.8) 


These  equations,  in  turn,  predict  that  currents  and  magnetic  fields  in  superconductors 
can  exist  only  within  a  layer  of  thickness  A  of  the  surface,  where  A,  known  as  the 
London  penetration  depth,  is  given  by33 


Thus  the  London  equation  implies  the  Meissner  effect,  along  with  a  specific 
picture  of  the  surface  currents  that  screen  out  the  applied  field.  These  currents  occur 
within  a  surface  layer  of  thickness  102-103  A  (well  below  Tc — the  thickness  can  be 
considerably  greater  near  the  critical  temperature,  where  tis  approaches  zero).  Within 
this  same  surface  layer  the  field  drops  continuously  to  zero.  These  predictions  are 
confirmed  by  the  fact  that  the  field  penetration  is  not  complete  in  superconducting 
films  as  thin  as  or  thinner  than  the  penetration  depth  A. 


MICROSCOPIC  THEORY:  QUALITATIVE  FEATURES 

The  microscopic  theory  of  superconductivity  was  put  forth  by  Bardeen,  Cooper,  and 
Schricffer  in  1957.34  In  a  broad  survey  such  as  this  we  cannot  develop  the  formalism 
necessary  for  an  adequate  description  of  their  theory,  and  can  only  describe  in  a 
qualitative  way  the  underlying  physical  principles  and  the  major  theoretical  predic¬ 
tions. 

The  theory  of  superconductivity  requires,  to  begin  with,  a  net  attractive  interaction 
between  electrons  in  the  neighborhood  of  the  Fermi  surface.  Although  the  direct 
electrostatic  interaction  is  repulsive,  it  is  possible  for  the  ionic  motion  to  “overscreen” 
the  Coulomb  interaction,  leading  to  a  net  attraction.35  We  described  this  possibility 


32  We  shall  see  below  that  the  London  equation  is  also  suggested  by  certain  features  of  the  microscopic 
electronic  ordering. 

33  Consider,  for  example,  the  case  of  a  semiinfinite  superconductor  occupying  the  half  space  x  >  0. 
Then  Eq.  (34.8)  implies  that  the  physical  solutions  decay  exponentially: 

B(x)  =  B{  0)e“"\ 

Other  geometries  are  examined  in  Problem  2. 

34  J.  Bardeen,  L.  N.  Cooper,  and  J.  R.  SchriefTer,  Phys.  Re v.  108, 1 1 75  (1957).  The  theory  is  generally 
referred  to  as  the  BCS  theory. 

35  Direct  evidence  that  the  ionic  motion  plays  a  role  in  establishing  superconductivity  is  provided 
by  the  isotope  effect :  The  critical  temperature  of  different  isotopes  of  a  given  metallic  element  varies  from 
one  isotope  to  another,  frequently  (but  not  always)  as  the  inverse  square  root  of  the  ionic  mass.  The  fact 
that  there  is  any  dependence  on  ionic  mass  demonstrates  that  the  ions  cannot  play  a  merely  static  role 
in  the  transition,  but  must  be  dynamically  involved. 
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in  Chapter  26,  where  we  found,  in  a  simplified  model,  that  allowing  the  ions  to 
move  in  response  to  motions  of  the  electrons  led  to  a  net  interaction  between  electrons 
with  wave  vectors  k  and  k'  of  the  form36 


t’S  fk,  k') 


4ne2  co2 

q2  +  k02  io2  -  to  2  ’ 


(34.10) 


where  hco  is  the  difference  in  electronic  energies,  k0  is  the  Thomas-Fermi  wave  vector 
(17.50),  q  is  the  difference  in  electron  wave  vectors,  and  o>q  is  the  frequency  of  a  phonon 
of  wave  vector  q. 

Thus  screening  by  the  ionic  motion  can  yield  a  net  attractive  interaction  between 
electrons  with  energies  sufficiently  close  together  (roughly,  separated  by  less  than 
haD,  a  measure  of  the  typical  phonon  energy).  This  attraction37  underlies  the  theory 
of  superconductivity. 

Given  that  electrons  whose  energies  differ  by  0(hcoD)  can  experience  a  net  attrac¬ 
tion,  the  possibility  arises  that  such  electrons  might  form  bound  pairs.38  This  would 
appear  to  be  doubtful,  since  in  three  dimensions  two  particles  must  interact  with  a 
certain  minimum  strength  to  form  a  bound  state,  a  condition  that  the  rather  limited 
effective  attraction  would  be  unlikely  to  meet.  However,  Cooper39  argued  that  this 
apparently  implausible  possibility  was  made  quite  likely  by  the  influence  of  the 
remaining  N  —  2  electrons  on  the  interacting  pair,  through  the  Pauli  exclusion 
principle. 

Cooper  considered  the  problem  of  two  electrons  with  an  attractive  interaction  that 
would  be  far  too  weak  to  bind  them  if  they  were  in  isolation.  He  demonstrated, 
however,  that  in  the  presence  of  a  Fermi  sphere  of  additional  electrons40  the  exclusion 
principle  radically  altered  the  two-electron  problem  so  that  a  bound  state  existed  no 
matter  how  weak  the  attraction.  Aside  from  indicating  that  the  net  attraction  need 
not  have  a  minimum  strength  to  bind  a  pair.  Cooper’s  calculation  also  indicated  how 
the  superconducting  transition  temperature  could  be  so  low  compared  with  all  other 
characteristic  temperatures  of  the  solid.  This  followed  from  the  form  of  his  solution, 
which  gave  a  binding  energy  that  was  very  small  compared  with  the  potential  energy 
of  attraction  when  the  attraction  was  weak. 

Cooper’s  argument  applies  to  a  single  pair  of  electrons  in  the  presence  of  a  normal 
Fermi  distribution  of  additional  electrons.  The  theory  of  Bardeen,  Cooper,  and 


36  See  pages  518-519.  That  such  an  attraction  was  possible  and  might  be  the  source  of  supercon¬ 
ductivity  was  first  emphasized  by  H.  Frohlich. 

37  Any  other  mechanism  leading  to  a  net  attractive  interaction  between  electrons  near  the  Fermi 
surface  would  also  lead  to  a  superconducting  state  at  low  enough  temperature.  However,  no  cases  of 
superconductivity  due  to  other  mechanisms  have  been  convincingly  established  in  metals. 

38  More  generally,  one  might  inquire  into  the  possibility  of  n  electrons  binding  together,  but  the  weak 
interaction  and  the  Pauli  exclusion  principle  make  the  case  n  =  2  the  most  promising. 

39  L.  N.  Cooper.  Phys.  Rei  .  104,  1 189  (1956). 

40  The  degenerate  Fermi  distribution  of  additional  electrons  was  taken  to  play  no  role  other  than 
prohibiting  the  two  electrons  from  occupying  any  levels  with  wave  vectors  less  than  kF .  Thus  the  Cooper 
calculation  was  basically  a  two-electron  calculation  except  that  analysis  was  restricted  to  states  built  out 
of  one-electron  levels  from  which  all  plane  waves  with  wave  vectors  less  than  kF  had  been  excluded.  See 
Problem  4. 
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SchriefTer  took  an  essential  further  step,  constructing  a  ground  state  in  which  all 
electrons  form  bound  pairs.  This  is  a  considerable  extension  of  the  Cooper  model, 
for  each  electron  now  plays  two  roles:  It  provides  the  necessary  restriction  on  allowed 
wave  vectors  (via  the  exclusion  principle)  that  makes  possible  the  binding  of  other 
pairs  in  spite  of  the  weakness  of  the  attraction;  at  the  same  time,  the  electron  itself  is 
participating  in  one  of  the  bound  pairs. 

The  BCS  approximation  to  the  electronic  ground  state  wave  function  can  be 
described  as  follows:  Group  the  N  conduction  electrons  into  N/2  pairs41  and  let  each 
pair  be  described  by  a  bound-state  wave  function  <£>(rs,  r's'),  where  r  is  the  electronic 
position  and  s  is  the  spin  quantum  number.  Then  consider  the  N-electron  wave 
function  that  is  just  the  product  of  N/2  identical  such  two-electron  wave  functions: 

*F(rtSi.  •  •  ■ .  i >s.v)  =  #TS„  r2s2) . . .  r NsN).  (34.11) 

This  describes  a  state  in  which  all  electrons  are  bound,  in  pairs,  into  identical  two- 
electron  states.  However,  it  lacks  the  symmetry  required  by  the  Pauli  principle.  To 
construct  a  state  that  changes  sign  whenever  the  space  and  spin  coordinates  of  any 
two  electrons  are  interchanged,  we  must  antisymmetrize  the  state  (34.1 1).  This  leads 
to  the  BCS  ground  state:42 

‘f'ecs  =  G-'i’-  (34.12) 

It  may  seem  surprising  that  the  state  (34. 1 2)  satisfies  the  Pauli  principle  even  though 
all  the  pair  wave  functions  <j>  appearing  in  it  are  identical.  Indeed,  if  we  had  constructed 
a  product  state  analogous  to  (34.1 1)  out  of  N  identical  one-electron  levels,  subsequent 
antisymmetrization  would  cause  it  to  vanish.  The  fundamental  requirement  of  anti¬ 
symmetry  implies  that  no  one-electron  level  can  be  doubly  occupied  when  the  states 
are  antisymmetrized  products  of  one-electron  levels.  However,  the  requirement  of 
antisymmetry  does  not  imply  a  corresponding  restriction  on  the  occupancy  of  two- 
electron  levels  in  states  that  are  antisymmetrized  products  of  two-electron  levels.43 

It  can  be  demonstrated  that  if  the  state  (34. 1 2)  is  taken  as  a  trial  state  in  a  variational 
estimate  of  the  ground-state  energy,  then  the  optimum  choice  of  <p  must  lead  to  a 
lower  energy  than  the  best  choice  of  Slater  determinants  (i.e.,  the  best  independent 
electron  trial  function)  for  any  attractive  interaction,  no  matter  how  weak. 

In  the  BCS  theory  the  pair  wave  functions  4>  are  taken  to  be  singlet  states;44  i.e.. 


41  The  odd  electron  (if  A’  is  odd)  is  of  no  significance  in  the  limit  of  a  large  system. 

42  The  antisymmetrizer  G  simply  adds  to  the  function  it  acts  upon  each  of  the  N !  —  1  other  functions 
obtained  by  all  possible  permutations  of  the  arguments,  weighted  with  +1  or  —  1  according  to  whether 
the  permutation  is  constructed  out  of  an  even  or  odd  number  of  pair  interchanges. 

43  This  is  why  it  is  possible  for  a  pair  of  fermions  to  behave  statistically  like  a  boson.  Indeed,  if  the 
binding  energy  of  each  pair  were  so  strong  that  the  size  of  the  pair  were  small  compared  with  the  inter¬ 
particle  spacing  rs .  then  the  ground  state  would  consist  of  A72  bosons,  all  condensed  into  the  same  two- 
electron  level.  As  we  shall  see,  however,  the  size  of  a  Cooper  pair  is  large  compared  with  rs.  and  it  can  be 
highly  misleading  to  view  the  Cooper  pairs  as  independent  bosons. 

44  If  the  pair  states  were  triplets  (spin  I)  this  would  imply  characteristic  magnetic  properties  that 
are  not  observed.  Triplet  pairing  has,  however,  been  observed  in  liquid  helium-3,  a  degenerate  Fermi 
liquid  that  bears  many  resemblances  to  the  electron  gas  in  metals.  See,  for  example,  Nobel  Symposium  24, 
Collective  Properties  of  Physical  Systems.  B.  Lundqvist  and  S.  Lundqvist,  eds..  Academic  Press,  New  York, 
1973,  pages  84-120. 
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the  two  electrons  in  the  pair  have  opposite  spin  and  the  orbital  part  of  the  wave 
function,  4>( r,  r')  is  symmetric.  If  the  pair  state  is  chosen  to  be  translationally  invariant 
(ignoring  possible  complications  due  to  the  periodic  potential  of  the  lattice)  so  that 
4>( r,  r')  has  the  form  y(r  -  r'),  then  one  can  write: 


x(r  -  O  =  Xte* '  re_ik " r'-  (34.13) 

*  k 

Thus  x  can  be  viewed  as  a  superposition  of  products  of  one-electron  levels  in  each 
term  of  which  electrons  with  equal  and  opposite  wave  vectors  are  paired.45 

One  result  of  the  variational  calculation  of  TBCS  is  that  the  spatial  range  of  the 
pair  wave  function46  is  very  large  compared  with  the  spacing  between  electrons  rs. 
A  crude  estimate  of  £0  can  be  constructed  as  follows:  The  pair  wave  function  </i(r)  is 
presumably  a  superposition  of  one-electron  levels  with  energies  within  0(A)  of  SF, 
since  outside  of  that  energy  range  tunneling  experiments  indicate  that  the  one-electron 
level  density  is  little  altered  from  the  form  it  has  in  a  normal  metal.  The  spread  in 
momenta  of  the  one-electron  levels  making  up  the  pair  state  is  therefore  fixed  by  the 
condition 

A  =  Ss  =  S  6P  ~  vf  SP-  (34.14) 

The  spatial  range  of  <£(r)  is  thus  of  order 


t  ~  A  ^  Ae  ^  _L?£ 

0  5p  A  kF  A " 


(34.15) 


Since  £f  is  typically  103-I04  times  A,  and  kF  is  of  order  108  cm"  £0  is  typically  103  A. 

Thus  within  the  region  occupied  by  any  given  pair  will  be  found  the  centers  of 
many  (millions,  or  more)  pairs.  This  is  a  very  crucial  feature  of  the  superconducting 
state:  The  pairs  cannot  be  thought  of  as  independent  particles,  but  are  spatially 
interlocked  in  a  very  intricate  manner,  which  is  essential  to  the  stability  of  the  state. 

The  above  description  summarizes  the  essential  features  of  the  electronic  ground 
state  in  a  superconductor.  To  describe  the  excited  states,  or  the  thermal  or  transport 
properties  of  a  superconductor,  one  must  resort  to  more  sophisticated  formalisms. 
We  shall  not  go  into  these  here,  except  to  emphasize  that  the  underlying  physical 
picture  remains  that  of  a  system  of  paired  electrons.  In  nonequilibrium  processes  the 
pair  state  can  be  more  complex.  At  nonzero  temperatures  a  fraction  of  the  pairs  are 
thermally  dissociated,  and  the  density  of  superconducting  electrons  ns  is  determined 
by  the  fraction  that  remain  paired.  Furthermore,  because  of  the  intricate  self-con¬ 
sistent  nature  of  the  pairing,  the  thermal  dissociation  of  some  of  the  pairs  at  nonzero 


45  This  aspect  of  the  ground  state  is  often  emphasized,  and  the  assertion  is  made  that  electrons  with 
opposite  spins  and  wave  vectors  are  bound  in  pairs.  This  is  no  more  (or  less)  accurate  than  the  assertion 
that  any  translationally  invariant  bound  state  of  two  identical  particles  pairs  them  with  equal  and  opposite 
momenta;  i.e.,  the  assertion  correctly  focuses  attention  on  the  fact  that  the  total  momentum  of  the  bound 
pair  is  zero,  but  it  misleadingly  distracts  attention  from  the  fact  that  the  state  is  a  superposition  of  such 
pairs,  and  therefore  localized  in  the  relative  position  coordinate  (unlike  a  single  product  of  plane  waves). 

In  pure  superconductors  well  below  Tt  this  turns  out  to  be  the  same  as  the  coherence  length, 
described  in  footnote  31.  It  is  therefore  denoted  by  the  same  symbol. 
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temperatures  results  in  a  temperature  dependence  in  the  characteristic  properties 
(for  example,  the  range  of  the  pair  function)  of  those  pairs  that  remain  bound.  As  T 
rises  through  Tc  all  pairs  become  dissociated,  and  the  ground  state  reverts  con¬ 
tinuously  back  to  the  normal  ground  state  of  the  independent  electron  approximation. 


QUANTITATIVE  PREDICTIONS  OF  THE  ELEMENTARY 
MICROSCOPIC  THEORY 

In  its  simplest  form  the  BCS  theory  makes  two  gross  oversimplifications  in  the  basic 
Hamiltonian  that  describes  the  conduction  electrons: 

1.  The  conduction  electrons  are  treated  in  the  free  electron  approximation;  band 
structure  effects  are  ignored. 

2.  The  rather  complicated  net  attractive  interaction47  (34.10)  between  electrons  near 
the  Fermi  energy  is  further  simplified  to  an  effective  interaction  V.  The  matrix 
element  of  V  between  a  two-electron  state  with  electronic  wave  vectors  kj  and 
k2,  and  another  with  wave  vectors  k3  and  k4,  is  taken  in  a  volume  Q  to  be 

<k1k2|F|k3k4>  =  —  V0/Q  when  k,  +  k2  =  k3  +  k4, 

|e(k()  -  £,,|  <  tiw,  i  =  1, ....  4, 

=  0,  otherwise.  (34.16) 

The  restriction  on  wave  vectors  is  required  for  any  translationally  invariant  potential; 
the  significant  aspect  of  the  interaction  (34.16)  is  the  attraction  experienced  when¬ 
ever  all  four  free  electron  energies  are  within  an  amount  tico  (usually  taken  to  be  of 
order  fio)D)  of  the  Fermi  energy. 

Equation  (34.16)  is  a  gross  oversimplification  of  the  actual  net  interaction,  and  any 
results  depending  on  its  detailed  features  are  to  be  viewed  with  suspicion.  Fortunately, 
the  theory  predicts  a  number  of  relations  from  which  the  two  phenomenological 
parameters  V0  and  hco  are  absent.  These  relations  are  rather  well  obeyed  by  a  large 
class  of  superconductors,  with  certain  notable  exceptions  (such  as  lead  and  mercury). 
Even  these  exceptions,  known  as  “strong  coupling  superconductors’  have  been  con¬ 
vincingly  brought  into  the  more  general  framework  of  the  BCS  theory,  provided 
that  the  simplifications  inherent  in  the  approximate  interaction  (34.16)  are  abandoned, 
along  with  certain  other  overly  simple  representations  of  the  effects  of  phonons.48 

From  the  model  Hamiltonian  (34.16),  the  BCS  theory  deduces  the  following  major 
equilibrium  predictions: 


«  One  must  not  forget  that  even  Eq.  (34.10)  is  a  comparatively  crude  representation  of  the  detailed 
dynamic  interaction  induced  among  the  electrons  by  the  phonons.  In  so-called  strong  coupling  super- 
conductors  (see  below)  even  Eq.  (34.10)  is  inadequate. 

48  In  the  theory  of  strong-coupling  superconductors  one  treats  the  full  electron-phonon  system, 
without  at  the  start  trying  to  eliminate  the  phonons  in  favor  of  an  effective  interaction  of  the  form  (34.16), 
or  even  (34.10).  As  a  result,  the  net  interaction  between  electrons  is  more  complicated  and  no  longer  instan¬ 
taneous  but  retarded.  Furthermore,  the  lifetimes  due  to  electron-phonon  scattering  of  the  electronic  levels 
within  heoD  from  the  Fermi  level  may  be  so  short  that  the  picture  of  well-defined  one-electron  levels  out  of 
which  pairs  are  formed  also  requires  modification. 
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Critical  Temperature 

In  zero  magnetic  field,  superconducting  ordering  sets  in  at  a  critical  temperature 
given  by 

kBTc  =  l.l3h(oe~llN°v°,  (34.17) 

where  JV0  is  the  density  of  electronic  levels  for  a  single  spin  population  in  the  normal 
metal49  and  co  and  V0  are  the  parameters  ofthe  model  Hamiltonian  (34.16).  Because  of 
the  exponential  dependence,  the  effective  coupling  V0  cannot  be  determined  precisely 
enough  to  permit  very  accurate  computations  of  the  critical  temperature  from  (34.17). 
However,  this  same  exponential  dependence  accounts  for  the  very  low  critical  tem¬ 
peratures  (typically  one  to  three  orders  of  magnitude  below  the  Debye  temperature), 
for  although  hco  is  of  order  kB®„,  the  strong  dependence  on  N0V0  can  lead  to  the 
observed  range  of  critical  temperatures  with  N0V0  in  the  range  from  0.1  to  0.5,  i.e., 
with  V0n  in  the  range50  0. 1  £,.  to  0.5E,..  Note  also  that  no  matter  how  weak  the 
coupling  F0,  the  theory  predicts  a  transition,  though  the  transition  temperature 
(34.17)  may  be  unobservably  low. 


Energy  Gap 

A  formula  similar  to  (34.17)  is  predicted  for  the  zero-temperature  energy  gap: 

A(0)  =  2hcoe~llN°v°.  (34.18) 


The  ratio  of  (34.18)  to  (34.17)  gives  a  fundamental  formula  independent  of  the 
phenomenological  parameters: 


m 

kBTc 


1.76. 


(34.19) 


This  result  appears  to  hold  for  a  large  number  of  superconductors  to  within  about 
10  percent  (Table  34.3).  Those  for  which  it  fails  (for  example,  lead  and  mercury, 
where  the  discrepancy  is  closer  to  30  percent)  tend  systematically  to  deviate  from 
other  predictions  ofthe  simple  theory  as  well,  and  can  be  brought  closer  into  line  with 
theoretical  predictions  by  using  the  more  elaborate  analysis  of  the  strong-coupling 
theory. 

The  elementary  theory  also  predicts  that  near  the  critical  temperature  (in  zero 
field)  the  energy  gap  vanishes  according  to  the  universal  law51 


49  The  quantity  N0  is  simply  g(£f)/2.  This  notation  for  the  density  of  levels  is  widely  used  in  the 
literature  of  superconductivity. 

50  The  quantity  N0  is  of  order  n/£f.  See  Eq.  (2.65). 

51  Equation  (34.20)  is  a  characteristic  result  of  mean  field  theory  (cf.  the  prediction  of  mean  field  theory 
that  the  spontaneous  magnetization  vanishes  as  ( Tc  -  T)112,  Chapter  33,  Problem  6).  Mean  field  theory  is 
known  to  be  wrong  in  ferromagnets  sufficiently  near  the  critical  temperature.  Presumably  it  fails  suffi¬ 
ciently  near  Tc  in  a  superconductor  as  well,  but  arguments  have  been  advanced  that  the  region  inside  of 
which  mean  field  theory  fails  is  exceedingly  small  (typically  (Tc  -  T)/Tc  «  10-8).  Superconductors  provide 
a  rare  example  of  a  phase  transition  that  is  well  described  by  a  mean  field  theory  quite  near  the  critical 
point. 
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Table  34.3 

MEASURED  VALUES"  OF  2A(0 )lksTc 


ELEMENT 

2A(0  )/k„Tc 

A1 

3.4 

Cd 

3.2 

Hg  (a) 

4.6 

In 

3.6 

Nb 

3.8 

Pb 

4.3 

Sn 

3.5 

Ta 

3.6 

T1 

3.6 

V 

3.4 

Zn 

3.2 

“  A(0)  is  taken  from  tunneling  experiments. 
Note  that  the  BCS  value  for  this  ratio  is  3.53. 
Most  of  the  values  listed  have  an  uncertainty 

of  +0.1. 

Source:  R. 

Mersevey  and  B.  B.  Schwartz,  Super- 

conductivity 

1969. 

,  R.  D.  Parks,  ed.,  Dekker,  New  York, 

Critical  Field 

The  elementary  BCS  prediction  for  HC{T)  is  often  expressed  in  terms  of  the  deviation 
from  the  empirical  law:52 


The  quantity  [f/c(T)//7c(0)]  —  [1  —  (T/Tc)2]  is  shown  for  several  superconductors 
in  Figure  34.8,  along  with  the  BCS  prediction.  The  departure  is  small  in  all  cases, 
but  note  that  the  strong-coupling  superconductors  lead  and  mercury  are  more  out 
of  line  than  the  others. 

Specific  Heat 

At  the  critical  temperature  (in  zero  magnetic  field)  the  elementary  BCS  theory  predicts 
a  discontinuity  in  the  specific  heat  that  can  also  be  put  in  a  form  independent  of  the 
parameters  in  the  model  Hamiltonian  (34.16):53 

=  1.43.  (34.22) 

Tc 


52  The  BCS  prediction  can  again  be  cast  in  a  parameter  independent  form.  At  low  temperatures  it  is 
HC(T)/HC( 0)  %  1  -  1.06(T/TC)2,  while  near  Tc  it  is  Hc(T)/Hc(0)  =  1.74[1  -  (T/TJ]. 

53  A  specific  heat  discontinuity  at  Te  is  also  a  characteristic  mean  field  theory  result.  Presumably, 
very  close  to  Tc  the  specific  heat  may  diverge. 
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Figure  34.8 

The  deviation  from  the  crude  empirical  relation 
Hc(T)/Hc(0)  =  1  —  [T/T]2,  as  measured  in 
several  metals  and  as  predicted  by  the  simple 
BCS  theory.  Deviations  from  the  simple  BCS 
prediction  are  more  pronounced  in  the  "strong¬ 
coupling"  superconductors,  lead  and  mercury. 
(J.  C.  Swihart  et  al.,  Phys.  Rev.  Lett.  14,  106 
(1965).) 


The  agreement  of  this  prediction  with  experiment  is  again  good  to  about  10  percent 
except  for  the  strong-coupling  superconductors  (Table  34.4). 

The  low-temperature  electronic  specific  heat  can  also  be  cast  in  a  parameter 
independent  form. 


(34.23) 


where  y  is  the  coefficient  of  the  linear  term  in  the  specific  heat  of  the  metal  in  the 
normal  state  (Eq.  (2.80) ).  Note  the  exponential  drop,  on  a  scale  determined  by  the 
energy  gap  A(0). 


MICROSCOPIC  THEORY  AND  THE  MEISSNER  EFFECT 

In  the  presence  of  a  magnetic  field,  diamagnetic  currents  will  flow  in  the  equilibrium 
state  of  a  metal,  whether  it  is  normal  or  superconducting,  although  the  currents  are 
vastly  greater  in  a  superconductor.  In  a  free  electron  model  the  current  will  be 
determined  to  first  order  in  the  field  by  an  equation  of  the  form54 

Y  x  j(r)  =  -  Jdr'  K(r  -  r')B(r’).  (34.24) 

If  it  happens  that  the  kernel  K( r)  satisfies 

Jdr  K(r)  =  K0  ^  0,  (34.25) 

then  in  the  limit  of  magnetic  fields  that  vary  slowly  over  the  range  of  K'(r),  Eq.  (34.24) 
reduces  to 

V  x  j(r)  =  —  A!0B(r),  (34.26) 


54  This  is  the  same  kernel  K  mentioned  in  footnote  31. 
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I 


Table  34.4 

MEASURED  VALUES  OF  THE  RATIO” 
[(<A  -  O/cJr, 


Cs  Cn 


ELEMENT 

L  J 

A1 

1.4 

Cd 

1.4 

Ga 

1.4 

Hg 

2.4 

In 

1.7 

La  (HCP) 

1.5 

Nb 

1.9 

Pb 

2.7 

Sn 

1.6 

Ta 

1.6 

T1 

1.5 

V 

1.5 

Zn 

1.3 

°  The  simple  BCS  prediction  is  [(cs  —  c„)/c„]  rc  = 
1.43. 

Source:  R.  Mersevey  and  B.  B.  Schwartz,  Supercon¬ 
ductivity,  R.  D.  Parks,  ed.,  Dekker,  New  Y ork,  1969. 


which  is  nothing  but  the  London  equation  (34.7),  with  ns  given  by 

ns  =  ^K0.  (34.27) 

Since  the  London  equation  implies  the  Meissner  effect,  it  follows  that  in  normal 
metals  the  constant  K0  must  vanish.  To  demonstrate  that  BCS  theory  implies  the 
Meissner  effect,  one  calculates  the  kernel  K( r)  by  perturbation  theory  in  the  applied 
field,  and  verifies  explicitly  that  K0  0. 

The  actual  demonstration  that  K0  /  0  is  a  fairly  complex  application  of  BCS 
theory.  However,  a  more  intuitive  explanation  for  the  London  equation  was  offered 
at  the  time  the  equation  was  first  put  forth,  by  the  Londons.  This  explanation  can 
be  made  somewhat  more  compelling  through  a  phenomenological  theory  of  V.  L. 
Ginzburg  and  L.  D.  Landau,55  which,  though  proposed  seven  years  prior  to  the  BCS 
theory,  can  be  quite  naturally  described  in  terms  of  some  of  the  fundamental  notions 
of  the  microscopic  theory. 


THE  GINZBURG-LANDAU  THEORY 

Ginzburg  and  Landau  asserted  that  the  superconducting  state  could  be  charac¬ 
terized  by  a  complex  “order  parameter”  ^(r),  which  vanishes  above  Tc  and  whose 


55 


V.  L.  Ginzburg  and  L.  D.  Landau,  Zh.  Eksp.  Teor.  Fiz.  20, 1064  (1950). 
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magnitude  measures  the  degree  of  superconducting  order  at  position  r  below  Tc ,56 
From  the  perspective  of  the  BCS  theory  the  order  parameter  can  be  viewed  as  a 
one-particle  wave  function  describing  the  position  of  the  center  of  mass  of  a  Cooper 
pair.  Since  all  Cooper  pairs  are  in  the  same  two-electron  state,  a  single  function 
suffices.  Because  the  order  parameter  does  not  refer  to  the  relative  coordinate  of  the 
two  electrons  in  the  pair,  the  description  of  a  superconductor  in  terms  of  ij/(r)  is 
valid  only  for  phenomena  that  vary  slowly  on  the  scale57  of  the  dimensions  of  the 
pair. 

In  the  ground  state  of  the  superconductor  each  pair  is  in  a  translationally  invariant 
state  that  does  not  depend  on  the  center  of  mass  coordinate;  i.e.,  the  order  parameter 
is  a  constant.  The  order  parameter  develops  interesting  structure  when  currents  flow, 
or  when  an  applied  field  appears.  A  fundamental  assumption  of  the  Ginzburg-Landau 
theory  is  that  the  current  flowing  in  a  superconductor  characterized  by  order  para¬ 
meter  ip(r)  in  the  presence  of  a  magnetic  field  given  by  the  vector  potential  A(r)  is 
given  by  the  ordinary  quantum-mechanical  formula  for  the  current  due  to  a  particle 
of  charge  —  2e  and  mass  2m  (i.e.,  the  Cooper  pair  itself)  described  by  a  wave  function 
i/>(r),  namely 


j  =  - 


(34.28) 


The  London  equation  (34.7)  follows  from  (34.28),  provided  one  also  assumes  that 
the  significant  spatial  variation  of  the  order  parameter  \j/  —  | ^[ew>  is  through  the 
phase  <p,  and  not  the  magnitude  |i//|.  Since  the  magnitude  of  the  order  parameter 
measures  the  degree  of  superconducting  ordering,  this  assumption  restricts  consider¬ 
ation  to  disturbances  in  which  the  density  of  Cooper  pairs  is  not  appreciably  altered 
from  its  uniform  thermal  equilibrium  value.  This  should  be  the  case  in  phenomena 
in  which  the  pairs  can  flow,  but  not  accumulate  or  be  destroyed.58 

Given  this  assumption,  the  relation  (34.28)  for  the  current  simplifies  to 


J  = 


2e2  eh 

—  A  -) - V<£ 

me  m 


H2 


(34.29) 


56  It  is  sometimes  helpful  to  bear  in  mind  an  analogy  to  a  Heisenberg  ferromagnet,  where  the  order 
parameter  may  be  viewed  as  the  mean  value  of  the  local  spin  s(r).  Above  Tc,  sfr)  vanishes;  below,  it  gives 
the  local  value  of  the  spontaneous  magnetization.  In  the  ground  state  s(r)  is  independent  of  r  (and,  corre¬ 
spondingly,  in  a  uniform  superconductor  that  carries  no  current,  ip(r)  is  constant).  However,  one  can 
consider  more  complicated  configurations  of  the  ferromagnet,  in  which,  for  example,  the  magnetization 
is  constrained  by  applied  fields  to  point  in  different  directions  at  two  ends  of  a  bar.  A  position-dependent 
s(r)  is  also  useful  in  investigating  aspects  of  domain  structure.  In  a  similar  way,  a  position-dependent  ip( r) 
is  used  to  investigate  current-carrying  configurations  of  a  superconductor. 

57  Well  below  Tc  this  is  just  the  length  £0.  described  on  page  742. 

58  More  generally,  when  the  degree  of  superconducting  order  does  have  significant  spatial  variation, 
one  must  use  a  second  Ginzburg-Landau  equation  in  conjunction  with  (34.28)  to  determine  both  1 1/  and 
the  current.  The  second  equation  relates  the  spatial  rate  of  change  of  the  order  parameter  to  the  vector 
potential,  and  bears  a  (to  some  extent  deceptive)  resemblance  to  a  one-particle  Schrodinger  equation.  The 
use  of  the  full  set  of  Ginzburg-Landau  equations  is  essential,  for  example,  in  the  description  of  vortices  in 
type  II  superconductors,  for  in  the  core  of  the  vortex  the  magnitude  of  the  order  parameter  drops  rapidly 
to  zero,  yielding  a  region  in  which  the  magnetic  flux  is  appreciable. 
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Since  the  curl  of  any  gradient  vanishes,  and  since  \<J/\2  is  essentially  constant,  we 
immediately  deduce  the  London  equation  (34.7)  provided  that  we  identify  the  super¬ 
fluid  density  ns  with  2\\p\2,  which  is  reasonable  in  view  of  the  interpretation  of  ip  as 
a  wave  function  characterizing  particles  of  charge  2e. 


FLUX  QUANTIZATION 


Equation  (34.29)  has  an  even  more  striking  implication  than  the  London  equation. 
Consider  a  superconductor  in  the  shape  of  a  ring  (Figure  34.9).  If  we  integrate  (34.29) 


Figure  34.9 

A  ring  of  superconducting  material,  showing  a  path  encircling  the 
aperture  and  lying  well  within  the  interior  of  the  superconductor. 


over  a  path  deep  inside  the  superconducting  material  enclosing  the  hole  in  the  ring, 
then,  since  appreciable  currents  can  flow  only  near  the  surface  of  the  superconductor, 
we  find  that 

eh  \ 

A  +  -\<t>ydL  (34.30) 

Stokes  theorem  gives 


Ja*^  =  jV  x  A-dS  =  Je-dS  =  <t>, 


(34.31) 


where  <I>  is  the  flux  enclosed  by  the  ring.59  Furthermore,  since  the  order  parameter 
is  single-valued,  its  phase  must  change  by  2n  times  an  integer  n  when  the  ring  is 
encircled : 

(fi \<p  ■  dt  =  A<t>  =  2ml.  (34.32) 


Combining  these  results,  we  conclude  that  the  magnetic  flux  enclosed  by  the  ring 
must  be  quantized: 

I®  =  ~  =  n®0-  (34.33) 


The  quantity  <X>0  =  hc/2e  —  2.0679  x  10“ 7  gauss-cm2  is  known  as  the  fiuxoid,  or 
flux  quantum.  Flux  quantization  has  been  observed  and  is  one  of  the  most  compelling 
pieces  of  evidence  for  the  validity  of  the  description  of  a  superconductor  by  means 
of  a  complex  order  parameter.60 


59  Since  the  magnetic  field  cannot  penetrate  the  superconducting  material,  the  enclosed  flux  will  not 
depend  on  the  choice  of  path,  as  long  as  the  path  is  deep  inside  the  material. 

60  B.  S.  Deaver  and  W.  M.  Fairbank,  Phys.  Rev.  Lett.  7,  43  (1961);  R.  Doll  and  M.  Nabauer,  Phys. 
Rev.  Lett.  7,  51  (1961).  The  Ginzburg- Landau  theory  also  predicts,  and  experiments  have  confirmed,  that 
each  vortex  in  a  type  II  superconductor  contains  a  single  quantum  of  magnetic  flux. 
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MICROSCOPIC  THEORY  AND  PERSISTENT  CURRENTS 

The  property  for  which  superconductors  are  named  is  unfortunately  one  of  the  most 
difficult  to  extract  from  the  microscopic  theory.  In  a  sense  perfect  conductivity  is 
implied  by  the  Meissner  effect,  for  macroscopic  currents  must  flow  without  dissipation 
in  order  to  screen  macroscopic  magnetic  fields  in  equilibrium.  Indeed,  the  direct 
microscopic  derivation  of  persistent  currents  is  not  dissimilar  to  that  of  the  Meissner 
effect.  One  calculates  to  linear  order  the  current  induced  by  an  electric  field,  and 
demonstrates  that  there  is  a  piece  in  the  AC  conductivity  that  has  the  form  (34.3) 
appropriate  to  an  electron  gas  without  dissipation.  For  this  it  suffices  to  prove  that61 

hm  to  Im  rr(co)  #  0.  (34.34) 

The  value  of  the  nonzero  constant  determines,  by  comparison  with  (34.3),  the  value 
of  the  density  of  superconducting  electrons  ns. 

Demonstrating  that  (34.34)  holds  is  more  complicated  than  demonstrating  the 
condition  (34.25)  for  the  Meissner  effect,  because  it  is  essential  to  include  the  effects 
of  scattering:  If  there  were  no  scattering  any  metal  would  obey  (34.34),  but  even  in 
the  absence  of  scattering  a  calculation  of  diamagnetism  in  a  normal  metal  reveals 
no  Meissner  effect.  However,  the  calculation  has  been  done62  and  the  value  of  ns 
deduced  from  the  low-frequency  conductivity  is  found  to  agree  with  that  deduced 
from  the  calculation  of  the  Meissner  effect.  The  calculation  is  unfortunately  quite 
formal,  and  does  not  provide  any  intuitive  explanation  for  the  remarkable  fact  that 
none  of  the  familiar  scattering  mechanisms  are  effective  in  degrading  a  current,  once 
it  has  been  established  in  a  superconducting  metal.  Such  an  intuitive  explanation  is 
at  least  suggested  by  the  following  line  of  reasoning:63 

Suppose  that  we  use  an  electric  field  to  establish  a  current  in  a  metal,  and  then 
turn  off  the  field  and  ask  how  the  current  can  decay.  In  a  normal  metal  the  current 
can  be  degraded  one  electron  at  a  time;  i.e.,  scattering  processes  can  reduce  the  total 
momentum  of  the  electronic  system  by  a  series  of  collisions  of  individual  electrons 
with  impurities,  phonons,  imperfections,  etc.,  each  of  which,  on  the  average,  drives 
the  momentum  distribution  back  to  its  equilibrium  form,  in  which  the  total  current 
vanishes.  When  a  current  is  established  in  a  superconductor,  all  the  Cooper  pairs 
move  together:  The  single  two-electron  state  describing  each  of  the  pairs  is  one  with 


1  Equation  (34.34)  is  not  dissimilar  in  structure  to  the  condition  (34.25)  for  a  Meissner  effect.  The 
integral  over  all  space  of  the  kernel  K  is  equal  to  the  k  =  0  limit  of  its  spatial  Fourier  transform.  In  both 
cases  one  must  establish  the  failure  of  a  certain  electromagnetic  response  function  to  vanish,  in  an  appro- 
pnate  long- wavelength  or  low-frequency  limit. 

_.  "  „See’  for  examPle.  A.  A-  Abrikosov,  L.  P.  Gorkov,  and  I.  E.  Dzyaloshinski,  Methods  of  Quantum 
held  Theory  m  Statistical  Physics ,  Prentice-Hall,  Englewood  Cliffs,  N.J.,  1963,  pp.  334-341. 

63  A  variety  of  “intuitive”  arguments  have  been  offered  on  this  point,  many  of  them  quite  spurious. 
There  is,  for  example,  an  argument  (based  on  an  old  argument  of  Landau’s  to  explain  superfluidity  in  4He) 
purporting  to  deduce  persistent  currents  from  the  existence  of  a  gap  in  the  one-electron  excitation  spectrum 
But  this  merely  explains  why  currents  cannot  be  degraded  by  one-electron  excitations,  leaving  open  the 
possibility  of  degrading  the  current  pair  by  pair.  The  argument  we  indicate  here  can  be  found  under  many 
guises,  associated  with  the  notions  of  “rigidity  of  the  wave  function,”  “off-diagonal  long-range  order  ”  or 
long-range  phase  coherence  ” 
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a  nonzero  center  of  mass  momentum.64  One  might  expect  that  such  a  current  could 
be  degraded  by  single-pair  collisions,  analogous  to  the  one-electron  collisions  in  a 
normal  metal,  in  which  individual  pairs  have  their  center  of  mass  momenta  reduced 
back  to  zero  by  collisions.  Such  a  suggestion,  however,  fails  to  take  into  account  the 
delicate  interdependence  of  the  pairs.65  Essential  to  the  stability  of  the  paired  state 
is  the  fact  that  all  the  other  pairs  exist  and  are  described  by  identical  pair  wave 
functions.  Thus  one  cannot  change  the  pair  wave  functions  individually  without 
destroying  the  paired  state  altogether,  at  an  enormous  cost  in  free  energy. 

The  supercurrent-degrading  transition  that  is  least  costly  in  free  energy  depends, 
in  general,  on  the  geometry  of  the  specimen,  but  it  usually  requires  destruction  of 
the  pairing  in  some  macroscopic  portion  of  the  sample.  Such  processes  are  possible, 
but  the  cost  in  free  energy  will  in  general  be  so  large  that  the  lifetime  of  the  super- 
current  is  infinite  on  any  practical  time  scale.66 


SUPERCURRENT  TUNNELING;  THE  JOSEPHSON  EFFECTS 

We  have  described  (page  735)  the  tunneling  of  single  electrons  from  a  superconducting 
metal  through  a  thin  insulating  barrier  into  a  normal  metal,  and  indicated  how 
measurements  of  the  tunneling  current  give  information  about  the  density  of  one- 
electron  levels  in  the  superconductor.  Measurements  can  also  be  made  of  the  tunneling 
currents  when  both  metals  are  superconductors,  and  the  results  are  fitted  well  by 
assuming  that  both  metals  have  one-electron-level  densities  of  the  form  predicted  by 
the  BCS  theory  (Figure  34.10).  In  1962  Josephson67  predicted  that  in  addition  to  this 
“normal  tunneling”  of  single  electrons  there  should  be  another  component  to  the 
tunneling  current  carried  by  paired  electrons:  Provided  that  the  barrier  is  not 
too  thick,  electron  pairs  can  traverse  the  junction  from  one  superconductor  to  the 
other  without  dissociating. 

One  immediate  consequence  of  this  observation  is  that  a  supercurrent  of  pairs 
should  flow  across  the  junction  in  the  absence  of  any  applied  electric  field  (the  DC 
Josephson  effect).  Because  the  two  superconductors  are  only  weakly  coupled  (i.e., 
because  the  paired  electrons  must  cross  a  gap  of  nonsuperconducting  material),  the 
typical  tunneling  current  across  the  junction  will  be  far  smaller  than  typical  critical 
currents  for  single  specimens. 


64  That  a  supercurrent  is  very  well  described  by  regarding  all  paired  electrons  as  occupying  a  single 
quantum  state  is  confirmed  by  the  absence  of  thermoelectric  effects  in  a  superconductor  (see  page  730). 
If  a  supercurrent  resembled  the  disorderly  flow  of  electrons  that  constitutes  a  current  in  a  normal  metal, 
then  there  would  be  an  accompanying  thermal  current  (Peltier  effect). 

65  Recall  (page  742)  that  within  the  radius  of  a  given  pair  will  be  found  the  centers  of  millions  of 
other  pairs. 

66  Thus,  in  principle,  supercurrent-carrying  states  are  only  metastable.  For  suitable  geometries  (i.e., 
specimens  that  are  very  small  in  one  spatial  dimension  or  more)  the  fluctuations  necessary  to  destroy  a 
supercurrent  need  not  be  overwhelmingly  improbable,  and  one  can  observe  the  decay  of  the  “persistent 
current.”  A  very  appealing  microscopic  picture  of  such  processes  has  been  given  by  V.  Ambegaokar  and 
J.  S.  Langer,  Pliys.  Rev.  164,  498  (1967). 

67  B.  D.  Josephson,  Phys.  Lett.  1,  251  (1962).  See  also  the  articles  by  Josephson  and  Mercereau  in 
Superconductivity,  R.  D.  Parks,  ed.,  Dekker,  New  York,  1969. 
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Figure  34.10 

Normal  tunneling  current  between  two  super¬ 
conductors  (tin  and  lead).  The  solid  curve  is  the 
prediction  of  BCS  theory.  (S.  Shapiro  et  al., 
IBM  J.  Res.  Develop.  6,  34  (1962).) 


Josephson  predicted  a  variety  of  further  effects  by  assuming  that  the  supercon¬ 
ducting  ordering  on  both  sides  of  the  junction  could  be  described  by  a  single  order 
parameter  He  showed  that  the  tunneling  current  would  be  determined  by  the 
change  in  phase  of  the  order  parameter  across  the  junction.  Furthermore,  by  using 
gauge  invariance  to  relate  the  phase  of  the  order  parameter  to  the  value  of  an  applied 
vector  potential,  he  was  able  to  show  that  the  tunneling  current  would  depend  in 
a  sensitive  way  upon  any  magnetic  field  present  in  the  junction.  Specifically,  the 
tunneling  current  in  the  presence  of  a  magnetic  field  should  have  the  form 


J  r  sin  7r<D/<D0 
1  =  °  7t®/®0  ’ 


(34.35) 


where  ®  is  the  total  magnetic  flux  in  the  junction,  ®0  is  the  flux  quantum  hc/2e,  and 
J0  depends  on  the  temperature  and  the  structure  of  the  junction,  but  not  upon  mag¬ 
netic  field.  Such  effects  were  subsequently  observed  (Figure  34.11),  thereby  providing 
impressive  further  confirmation  of  the  fundamental  validity  of  the  order  parameter 
description  of  the  superconducting  state,  as  well  as  vindicating  Josephson’s  highly 
imaginative  application  of  the  theory.68 

Similar  considerations  led  Josephson  to  predict  further  that  if  a  DC  electric 
potential  were  applied  across  such  a  junction  the  induced  supercurrent  would  be 
oscillatory  (the  AC  Josephson  effect)  with  angular  frequency 

2eV 

cOj=-^~.  (34.36) 


This  remarkable  result — that  a  DC  electric  field  should  induce  an  alternating  cur¬ 
rent — was  not  only  observed,  but  has  been  made  the  basis  for  highly  accurate 
techniques  for  measuring  voltages  as  well  as  the  precise  value  of  the  fundamental 
constant  e/h.69 


68  The  value  of  the  flux  quantum  is  exceedingly  small,  making  the  effect  of  practical  importance  as 
an  highly  sensitive  way  to  measure  magnetic  field  strengths. 

69  W.  H.  Parker  et  ah,  Phys.  Rev.  177,  639  (1969). 
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Figure  34.11 

Josephson  tunneling  current  as  a  function  of  magnetic  field  in  an  Sn-SnO-Sn  junction. 

(R.  C.  Jaklevic,  quoted  in  Janies  E.  Mercereau,  Superconductivity ,  vol.  1,  R.  D.  Parks,  ed., 

Dekker,  New  York,  1969,  p.  393.) 

It  is  fitting  that  this  volume  should  conclude  with  this  sketchy  and  tantalizingly 
incomplete  survey  of  superconductivity.  The  rich  and  highly  original  theories,  both 
microscopic  and  phenomenological,  that  have  successfully  evolved  over  the  past  two 
decades  to  cope  with  superconducting  phenomena  are  indicative  of  the  fundamental 
health  and  future  promise  of  the  contemporary  theory  of  solids.  In  spite  of  the 
novelty  and,  at  times,  the  forbidding  complexity  of  the  concepts  upon  which  the 
theory  of  superconductivity  is  based,  one  must  not  forget  that  it  rests  on  a  broad 
foundation  extending  over  almost  all  the  important  areas  of  the  theory  of  solids  we 
have  examined  in  earlier  chapters.  In  no  other  subject  are  the  two  fundamental 
branches  of  solid  state  physics — the  dynamics  of  electrons  and  the  vibrations  of  the 
lattice  of  ions — so  intimately  fused,  with  such  spectacular  consequences. 


PROBLEMS 


1.  Thermodynamics  of  the  Superconducting  State 

The  equilibrium  state  of  a  superconductor  in  a  uniform  magnetic  field  is  determined  by  the 
temperature  T  and  the  magnitude  of  the  field  H.  (Assume  that  the  pressure  P  is  fixed,  and  that 
the  superconductor  is  a  long  cylinder  parallel  to  the  field  so  that  demagnetization  effects  are 
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unimportant.)  The  thermodynamic  identity  is  conveniently  written  in  terms  of  the  Gibbs  free 
energy  G: 

dG  =  - SdT  -  3 VLdH  (34.37) 

where  S  isthe  entropy  and  3 K,  the  total  magnetization  (3ti  =  MV,  where  M  is  the  magnetization 
ensity)  The  phase  boundary  between  the  superconducting  and  normal  states  in  the  H-T  plane 
is  given  by  the  critical  field  curve,  lift)  (Figure  34.3). 

(a)  Deduce,  from  the  fact  that  G  is  continuous  across  the  phase  boundary,  that 


dHc(T)  Sn  -  S 


dT 


sir*  -  sir’ 


(34.38) 


(where  the  subscripts  s  and  n  indicate  values  in  the  superconducting  and  normal  phase 
respectively).  ’ 

(b)  Using  the  fact  that  the  superconducting  state  displays  perfect  diamagnetism  (B  =  0) 
while  the  normal  state  has  negligible  diamagnetism  (M  *  0),  show  from  (34.38)  that  the  entropy 
discontinuity  across  the  phase  boundary  is 


c  _  V  H  dHc 


and  thus  the  latent  heat,  when  the  transition  occurs  in  a  field,  is 

Q 


y-j  .  ^ t  dHc 
4n~dT' 


(34.39) 


(34.40) 


(c)  Show  that  when  the  transition  occurs  at  zero  field  (i.e.,  at  the  critical  point)  there  is  a 
specific  heat  discontinuity  given  by 


(C  )  _  (c  )  _  _  T  (dH\2 
(p’n  K’s-  4 n{lf) 


(34.41) 

/ 

2.  The  London  Equation  for  a  Superconducting  Slab 
Consider  an  infinite  superconducting  slab  bounded  by  two  parallel  planes  perpendicular  to  the 
y-axis  at  y  =  ±d.  Let  a  uniform  magnetic  field  of  strength  H0  be  applied  along  the  z-axis 

(a)  Taking  as  a  boundary  condition  that  the  parallel  component  of  B  be  continuous  at  the 
surface,  deduce  from  the  London  equation  (34.7)  and  the  Maxwell  equation  (34.6)  that  within 
the  superconductor 

B  =  B(y)z,  B(y)  =  H0  C™hMAl  34 

0  cosh  (d/A) 

(b)  Show  that  the  diamagnetic  current  density  flowing  in  equilibrium  is 

J=M  j(y)  =  ^  Hosmhsm 
47rA  0  cosh  (d/A) 

(c)  The  magnetization  density  at  a  point  within  the  slab  is  M(y)  =  (B(y)  -  H0)/4n  Show 
that  the  average  magnetization  density  (averaged  over  the  thickness  of  the'slab)  is 


M  = 


Ho 

47t 


(34.43) 


and  give  the  limiting  form  for  the  susceptibility  when  the  slab  is  thick  (d  »  A)  and  thin  (d  «  A). 
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3.  Critical  Current  in  a  Cylindrical  Wire 

A  current  of  /  amperes  flows  in  a  cylindrical  superconducting  wire  of  radius  r  cm.  Show  that  when 
the  field  produced  hy  the  current  immediately  outside  the  wire  is  Hc  (in  gauss),  then 


I  =  5rHc.  (34.44) 

4.  The  Cooper  Problem 

Consider  a  pair  of  electrons  in  a  singlet  state,  described  by  the  symmetric  spatial  wave  function 


4>(r  -  «■') 


f  dk  , 

J(SFx,k)e' 


(34.45) 


In  the  momentum  representation  the  Schrodinger  equation  has  the  form 

(£  ~  2  ^9  =  J (2k)3  ViK  klxik']-  (3446> 

We  assume  that  the  two  electrons  interact  in  the  presence  of  a  degenerate  free  electron  gas,  whose 
existence  is  felt  only  via  the  exclusion  principle:  Electron  levels  with  k  <  kF  are  forbidden  to  each 
of  the  two  electrons,  which  gives  the  constraint: 


*(k)  =  0,  k<kF.  (34.47) 

We  take  the  interaction  of  the  pair  to  have  the  simple  attractive  form  (cf.  Eq.  (34.16)): 

h2k2 

F(k,  k2)  ~  -V,  SF  <  *£  SF  +  ha>,  i  =  1,  2; 

=  0,  otherwise,  (34.48) 

and  look  for  a  bound-state  solution  to  the  Schrodinger  equation  (34.46)  consistent  with  the 
constraint  (34.47).  Since  we  are  considering  only  one-electron  levels  which  in  the  absence  of  the 
attraction  have  energies  in  excess  of  2£F,  a  bound  state  will  be  one  with  energy  E  less  than  2£f , 
and  the  binding  energy  will  be 

A  =  2£f  —  E.  (34.49) 

(a)  Show  that  a  bound  state  of  energy  E  exists  provided  that 


1  =  V 


CEp  +  hci} 

jEf 


A' (8)  dZ 

2E  -  E’ 


(34.50) 


where  A-(E)  is  the  density  of  one-electron  levels  of  a  given  spin. 

(b)  Show  that  Eq.  (34.50)  has  a  solution  with  E  <  2Ef  for  arbitrarily  weak  V,  provided  that 
IV(£F)  yt  o.  (Note  the  crucial  role  played  by  the  exclusion  principle:  If  the  lower  cutoff  were  not 
£f,  but  0,  then  since  Ar(0)  =  0,  there  would  not  be  a  solution  for  arbitrarily  weak  coupling). 

(c)  Assuming  that  A'(£)  differs  negligibly  from  iV(£F)  in  the  range  £f  <  £  <  £F  +  hw,  show 
that  the  binding  energy  is  given  by 


A  =  2hco 


e  —  2/N(lf)V 
1  _  e-2IWF)V’ 


(34.51) 


or,  in  the  weak-coupling  limit: 


A  =  2  hcoe~2/mF)v. 


(34.52) 


Appendix  A 


Summary  of  Important  Numerical 
Relations  in  The  Free  Electron 
Theory  of  Metals 

We  collect  here  those  results  of  free  electron  theory  from  Chapters  1  and  2  that 
are  useful  in  making  rough  numerical  estimates  of  metallic  properties.  We  use  the 
following  values  of  the  fundamental  constants:1 


Electronic  charge: 

Speed  of  light : 
Planck’s  constant: 

Electronic  mass: 
Boltzmann’s  constant: 

Bohr  radius: 

Rydberg: 

Electron  volt: 


e  =  1.60219  x  10  19  coulomb 
=  4.80324  x  10“ 10  esu 
c  =  2.997925  x  1010  cm/sec 
h  =  6.6262  x  10“27  erg-sec 
h/2n  =  h  =  1.05459  x  10“ 27  erg-sec 
>n  =  9.1095  x  10“28  gm 
kB  =  1.3807  x  10“ 16  erg/K 
=  0.8617  x  10“4  eV/K 
h2/me2  =  a0  =  0.529177  A 
e2/2a0  =  13.6058  eV 
1  eV  =  1.60219  x  10~12erg 
=  1.1604  x  104  K 


IDEAL  FERMI  GAS: 


(2.23) 

(2.24) 
(2.26) 
(2.33) 


The  quantity  rs(Eq.  (1.2) )  is  listed  for  selected  metals  in  Table  1.1,  and  is  numerically 
given  by 

-  =  5.44[«22]“i;3, 

a0 

where  the  electronic  density  is  n  =  n22  x  1022/cm3. 

RELAXATION  TIME  AND  MEAN  FREE  PATH: 


kF  =  [3.63  A  *]  x  [rs/c0]  1 

vF  =  [4.20  x  108  cm/sec]  x  \rJa0Y  1 

SF  =  [50.1  eV]  x  [rsK]“2 

7>  =  [58.2  x  104  K]  x  [rs/ci0]“ 2 


(1.8) 

(2.91) 


Here  p „  is  the  resistivity  in  microhm  centimeters,  listed  for  selected  metals  in  Table  1 .2. 


x  =  [2.2  x  10  15  sec]  x  [(rs/a0)3/pj 
/  =  [92  A]  x  [(r>0)2/pj 


1  B.  N.  Taylor,  W.  H.  Parker,  and  D.  N.  Langenberg,  Rev.  Mod.  Phys.  41,  375  (1969).  We  quote  the 
values  to  far  greater  precision  than  it  is  meaningful  to  use  in  free  electron  calculations.  Other  constants 
are  given  on  the  back  i*  cover. 
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CYCLOTRON  FREQUENCY: 


vc  =  a  Jin  =  2.80  H  x  106  Hz 
hac  =  1.16 H  x  10~8  eV 
=  1.34H  x  10~4  K 


Here  coc  _  eH/mc  (Eq.  (1.18))  and  H  (above)  is  the  magnetic  field  in  gauss. 


PLASMA  FREQUENCY: 


vj»  =  &p/2n  =  [11.4  x  1015  Hz]  x  (rja0) ~  3/2 
h(DP  =  [47.1  eV]  x  (rja0)~3n 


Here  a,,  =  [4wie2/m]1'2  (Eq.  (1.38)). 


(1.22) 


(1.40) 


Appendix  B 

The  Chemical  Potential 


One  believes1  that  in  the  limit  of  a  large  system  the  Helmholtz  free  energy  per  unit 
volume  approaches  a  smooth  function  of  density  and  temperature : 


lim  ~F(N,V,T)  =  f(n,T), 

N,V~*oo  y 
N/V~*rt 

or,  to  an  excellent  approximation  for  large  N  and  V, 

F(N,  V,  T)  =  Vf(n,  T). 

Since  (Eq.  (2.45))  the  chemical  potential  is  defined  by 

H  =  F(N  +  1,  V,  T)  -  F(N,  V,  T), 
we  have,  for  a  large  system, 

( N 


=  V 


f  ( ~~Y~  >T  ]  —  f 


7>T 


=  v  [f  (" +  v> T)  ~  f{n>  r)]’ 

-  f-V 

'-<»  \dnJT 


(B.1) 

(B.2) 

(B.3) 


(B.4) 


The  pressure  is  given  by  P  =  -~(8F/dV)T,  which  (B.2)  and  (B.4)  reduce  to  P  — 
— /  +  nn.  Since  F  =  U  —  TS,  where  U  is  the  internal  energy  and  S  is  the  entropy, 
it  follows  that  the  chemical  potential  is  just  the  Gibbs  free  energy  per  particle: 


V  = 


N' 


G  =  U  —  TS  +PV. 


(B.5) 


Since  T  =  ( du/Bs)„  (where  the  energy  density  u  =  U/V  and  entropy  density 
s  =  S/V are  defined  in  the  same  way  as  the  free  energy  density/was  defined  in  (B.l)), 
it  follows  from  (B.4)  that  fi  can  also  be  written  in  the  forms 


(du\ 

\8n): 

(B.6) 

T  ( 

\pnju 

(B.7) 

1  In  many  cases  it  can  be  proved.  See,  for  example,  J.  L.  Lebowitz  and  E.  H.  Lieb,  Phys.  Rev.  Leu. 
22,  631  (1969). 
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The  Sommerfeld  Expansion 

The  Sommerfeld  expansion  is  applied  to  integrals  of  the  form 

dZ  //(£)/(£),  /(£)  =  ,JhI  (C.l) 

J  —  00  C  T  A 

where  H(£)  vanishes  as  £  -*•  —  co  and  diverges  no  more  rapidly  than  some  power  of 
£  as  £  ->  +  oo.  If  one  defines 


so  that 


K(£)  = 


H(S')  dS\ 


H(S)  = 


dK(£) 

dt  ’ 


(C.2) 

(C.3) 


then  one  can  integrate  by  parts1  in  (C.l)  to  get 

mm  dt  =  m  d&.  (c.4> 

Since  /  is  indistinguishable  from  zero  when  £  is  more  than  a  few  kBT  greater 
than  n,  and  indistinguishable  from  unity  when  £  is  more  than  a  few  kBT  less  than 
H,  its  £-derivative  will  be  appreciable  only  within  a  few  kBT  of  fj.  Provided  that  H 
is  nonsingular  and  not  too  rapidly  varying  in  the  neighborhood  of  £  =  ^,  it  is  very 
reasonable  to  evaluate  (C.4)  by  expanding  K(£)  in  a  Taylor  series  about  £  =  n,  with 
the  expectation  that  only  the  first  few  terms  will  be  of  importance: 

When  we  substitute  (C.5)  in  (C.4),  the  leading  term  gives  just  K{fi),  since 

f  {-Bf/B£)d£  =  1. 


Furthermore,  since  Bf  /BS  is  an  even  function  of  £  —  /t,  only  terms  with  even  n  in 
(C.5)  contribute  to  (C.4),  and  if  we  reexpress  K  in  terms  of  the  original  function  H 
through  (C.2),  we  find  that: 


d£  //(£)/(£)  = 


V 

//(£)  d£ 

—  00 


+ 


(£-F)2 Y  d2”^ 

(2 «)!  V  BzJ  de,2"-1 


(C.6) 


1  The  integrated  term  vanishes  at  oo  because  the  Fermi  function  vanishes  more  rapidly  than  K 
diverges,  and  at  —  co  because  the  Fermi  function  approaches  unity  while  K  approaches  zero. 
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Finally,  making  the  substitution  (£  -  n)/kBT  =  x,  we  find  that 

d2"~ 1 

a„[kBir 

n=  1 

where  the  an  are  dimensionless  numbers  given  by 


i: 


//(£)/(£)  dz  = 


H(Z)  dZ  +  X  aJkuTf 


dz2 


T  «(6)l 


|e  =  M’ 


v.(~ic^Ti)dx- 


J  -  os  (2«) ! 

By  elementary  manipulations  one  can  show  that 

1  1  \ 

+  52"  "  ')■ 

This  is  usually  written  in  terms  of  the  Riemann  zeta  function,  £(»),  as 


.  1  1 

°n  *  22”  +  32"  41 


where 


°n  22<"~  >>^) 


,  ,  1  1  1 

C(«)  =  l+  xr  +  —  +  —  +  •••. 
2  3  4 


For  the  first  few  n.  c(2/i)  has  the  values2 


(C.7) 

(C.8) 

(C.9) 

(C.10) 

(C.ll) 


C(2n)  =  22"- 


(2n)! ' 


where  the  B„  are  known  as  Bernoulli  numbers,  and 


(C.12) 


B. 


5_ 

66' 


(C.13) 


In  most  practical  calculations  in  metals  physics,  one  rarely  needs  to  know  more 
than  C(2)  =  ti2/6,  and  never  goes  beyond  f(4)  =  7t4/90.  Nevertheless,  if  one  should 
wish  to  carry  the  Sommerfeld  expansion  (2.70)  beyond  n  =  5  (and  hence  past  the 
values  of  the  B„  listed  in  (C..13)),  by  the  time  2n  is  as  large  as  12  the  a„  can  be  eval¬ 
uated  to  five-place  accuracy  by  retaining  only  the  first  two  terms  in  the  alternating 
series  (C.9). 


See,  for  example,  E.  Jahnke  and  F.  Emde,  Tables  of  Functions,  4th  ed.,  Dover  New  York  1945 
p.  272.  '  ’ 
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Plane-  Wave  Expansions  of  Periodic 
Functions  in  More  Than  One  Dimension 


We  start  from  the  general  observation  that  the  plane  waves  e‘v'T  form  a  complete  set 
of  functions  in  which  any  function  (subject  to  suitable  conditions  of  regularity)  can  be 
expanded.1  If  a  function/(r)  has  the  periodicity  of  a  Bravais  lattice,  i.e.,  if/(r  +  R)  = 
/( r)  for  all  r  and  all  R  in  the  Bravais  lattice,  then  only  plane  waves  with  the  periodicity 
of  the  Bravais  lattice  can  occur  in  the  expansion.  Since  the  set  of  wave  vectors  for 
plane  waves  with  the  periodicity  of  the  lattice  is  just  the  reciprocal  lattice,  a  function 
periodic  in  the  direct  lattice  will  have  a  plane  wave  expansion  of  the  form 


m  =  lAe 

K 


fK  •  r 


(D.l) 


where  the  sum  is  over  all  reciprocal  lattice  vectors  K. 
The  Fourier  coefficients  fK  are  given  by 


/« 


-if 

v  Jc 


die  iK  r/( r) 


(D.2) 


where  the  integral  is  over  any  direct  lattice  primitive  cell  C,  and  v  is  the  volume  of 
the  primitive  cell.2  This  follows  from  multiplying  (D.l)  by  e~aL''/v  and  integrating 
over  the  primitive  cell  C,  provided  that3  we  can  show 


L 


die*’’  =  0 


(D.3) 


for  any  nonvanishing  reciprocal  lattice  vector  K. 

To  establish  (D.3)  we  merely  note  that  since  eiK  ’[  has  the  periodicity  of  the  lattice 
(K  being  a  reciprocal  lattice  vector)  its  integral  over  a  primitive  cell  is  independent  of 
the  choice  of  cell.2  In  particular,  it  will  not  be  changed  if  we  translate  the  primitive 
cell  C  through  a  vector  d  (not  necessarily  a  Bravais  lattice  vector).  However,  the 
integral  over  the  translated  cell  C'  can  be  written  as  the  integral  of  e*'(r+d)over  the 


1  No  attempt  at  mathematical  rigor  is  made  here.  The  mathematical  subtleties  in  three  dimensions 
are  no  more  difficult  than  they  are  in  one,  since  the  function  can  be  considered  one  variable  at  a  time. 
Our  concern  is  more  one  of  bookkeeping  and  notation:  how  most  compactly  to  express  the  basic  formulas 
of  three-dimensional  Fourier  series,  quite  aside  from  how  they  can  be  rigorously  derived. 

2  The  choice  of  cell  is  immaterial,  since  the  integrand  is  periodic.  That  the  integral  of  a  periodic 
function  over  a  primitive  cell  does  not  depend  on  the  choice  of  cells  is  most  easily  seen  by  recalling  that 
any  primitive  cell  can  be  cut  up  and  reassembled  into  any  other,  by  translating  its  pieces  through  Bravais 
lattice  vectors.  But  translating  a  periodic  function  through  a  Bravais  lattice  vector  leaves  it  unchanged. 

3  In  our  mathematically  lax  manner,  we  ignore  possible  difficulties  in  interchanging  orders  of  integra¬ 
tion  and  summation. 
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original  cell.  Thus 


or 

(eiKd  —  1)J  dre*'r  =  0.  (D.5) 

Since  eiKd  -  1  can  vanish  for  arbitrary  d  only  if  K  itself  vanishes,  this  establishes 
(D.3)  for  nonzero  K. 

These  formulas  have  several  kinds  of  application.  They  can  be  applied  directly  to 
functions  with  the  real-space  periodicity  of  a  crystal  Bravais  lattice.  They  can  also 
be  applied  to  functions  that  are  periodic  in  k-space  with  the  periodicity  of  the  recip¬ 
rocal  lattice.  In  that  case,  noting  that  the  reciprocal  of  the  reciprocal  is  the  direct 
lattice  and  that  the  volume  of  a  reciprocal  lattice  primitive  cell  is  (liif/v,  we  can 
transcribe  (D.l)  and  (D.2)  to  read: 

(D.6) 

for  any  </>(k)  with  the  periodicity  of  the  reciprocal  lattice  {<!>( k  +  K)  =  d>(k)  for  all  k 
and  all  reciprocal  lattice  vectors  K),  where  the  sum  is  over  all  direct  lattice  vectors 
R  and 

(D.7) 

where  v  is  the  volume  of  the  direct  lattice  primitive  cell,  and  the  integral  is  over  any 
reciprocal  lattice  primitive  cell  (such  as  the  first  Brillouin  zone). 

Another  important  application  is  to  functions  in  real  space  whose  only  periodicity 
is  that  imposed  by  the  Born- von  Karman  boundary  condition  (Eq.  (8.22)): 

/( r  +  Nfli)  =  /( r),  i  =  1,  2,  3.  (D.8) 

Such  functions  are  periodic  in  a  very  large  (and  unphysical)  Bravais  lattice  generated 
by  the  three  primitive  vectors  i  =  1,  2,  3.  The  reciprocal  to  this  lattice  has 
primitive  vectors  b where  the  b;  are  related  to  the  a,-  by  Eq.  (5.3).  A  vector  of  this 
reciprocal  lattice  has  the  form 

k  =  L  Tf  b;>  m‘  integral-  (D.9) 

i=i  A; 

Since  the  volume  of  the  primitive  cell  associated  with  the  Born-von  Karman  peri¬ 
odicity  is  the  volume  V  of  the  entire  crystal,  (D.l)  and  (D.2)  now  become 

(D.10) 

for  any  f  satisfying  the  Born-von  Karman  boundary  condition  (D.8),  where  the 
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sum  is  over  all  k  of  the  form  (D.9)  and 

/i  =  y  jdr  e“‘l  ’r/(r), 

where  the  integral  is  over  the  entire  crystal.  Note  also  the  analogue  of  (D.3): 

|  dr  eil ' '  =  0 

for  any  k  of  the  form  (D.9)  different  from  0. 


(D.ll) 


(D.12) 


Appendix  E 


The  Velocity  and  Effective  Mass  of 
Bloch  Electrons 


One  may  evaluate  the  derivatives  c&Jdkj  and  d2SjdkidkJ  by  noting  that  they  are 
the  coefficients  of  the  linear  and  quadratic  terms  in  q,  in  the  expansion 

s-»  +  O  -  CJk)  +  ?  it: *  +  2  ?  +  (E-" 

Since,  however,  E„(k  +  q)  is  the  eigenvalue  of  Hk+q  (Eq.  (8.48)),  we  can  calculate  the 
required  terms  from  the  fact  that 


Hk+q  —  Hk  + 


(E.2) 


as  an  exercise  in  perturbation  theory. 

Perturbation  theory  asserts  that  if  H  =  H0  +  V  and  the  normalized  eigenvectors 
and  eigenvalues  of  H0  are 

H0ip„  =  E°Jn,  (E.3) 


then  to  second  order  in  V,  the  corresponding  eigenvalues  of  H  are 


£„  =  + 


dr>p*Vil/n 


|  J dr  \p*V\p„.\2 

hn  (E°-E°n.) 


(E.4) 


To  calculate  to  linear  order  in  q  we  need  only  keep  the  term  linear  in  q  in  (E.2) 
and  insert  it  into  the  first-order  term  in  (E.4).  In  this  way,  we  find  that 


I 


(E.5) 


(where  the  integrations  are  either  over  a  primitive  cell  or  over  the  entire  crystal, 
depending  on  whether  the  normalization  integral  J  dr  |i(nl|2  has  been  taken  equal  to 
unity  over  a  primitive  cell  or  over  the  entire  crystal).  Therefore 


<3k 


(E.6) 


If  we  express  this  in  terms  of  the  Bloch  functions  \p„k  via  (8.3),  it  can  be  written  as 


d£„  h2 

r  i 

rk  m 

dt  ip*k  -  Vi 

(E.7) 


Since  (l/m)(h/i)V  is  the  velocity  operator,1  this  establishes  that  (l/fi)(3Sn(k)/dk)  is  the 
mean  velocity  of  an  electron  in  the  Bloch  level  given  by  n,  k. 


1  The  velocity  operator  is  v  =  dr/dt  =  (l/ift)[r,  H]  =  p/m  =  h\ f mi. 
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I 


To  calculate  d2Z„/cki  dkj  we  need  £„(k  +  q)  to  second  order  in  q.  Equations  (E.2) 
and  (E.4)  give2 


V1  8%  h2  2 

5  2  dktdkj9*9'  2  mq  +„5„ 


h2 

( i  \ 

*  i£  —  q  • 

t  V  +  k  )  tvk 

.. 

m 

V1  ) 

C,k  —  £n'k 

Again  using  (8.3),  we  can  express  (E.8)  in  terms  of  Bloch  functions: 


v-  1  _  h2  2  v 

Zldkidk™*  2 mq  +  „?„ 


<nk|  —  q  •  V  ln'k> 
1  mi  1 


£„* 


where  we  have  used  the  notation: 


Consequently, 


J*«**rt  =  <»k|JA|n'k>. 


c2?.„(k)  h2 


8kt  dkj  m  " 


S::  + 


(E.8) 


(E.9) 


(E.10) 


+ 


fh2y  <,lkl  7  V-I'!'k><"'kl  7  vjl',k>  +  <"kl  7  v>'k><n'k|  |  V,|«k> 

UJ  n5„ 


£„(k)  -  £„.(k) 


(E.11) 

The  quantity  on  the  right  side  of  (E.ll)  (times  a  factor  1/ft2)  is  the  inverse  “effective 
mass  tensor’  (page  228)  and  the  formula  (E.ll)  is  often  called  “the  effective  mass 
theorem.” 


2  The  first  term  on  the  right  of  (E.8)  comes  from  placing  the  second-order  term  in  Hl+q(Eq.  (E.2)) 
into  the  first-order  term  in  the  perturbation  theory  formula  (Eq.  (E.4) ).  The  second  term  on  the  right  comes 
from  placing  the  first-order  term  in  /fk+qinto  the  second-order  term  in  the  perturbation  theory  formula. 


Appendix  F 


Some  Identities  Related  to 
Fourier  Analysis  of  Periodic  Systems 

To  derive  Fourier  inversion  formulas  it  suffices  to  establish  the  identity 


I>'kR  =  N6kfi, 


where  R  runs  through  the  N  sites  of  the  Bravais  lattice 


R  =  Z  0  <  n,  <  N„  N tN2N3  =  N, 


(F.l) 


(F.2) 


and  k  is  any  vector  in  the  first  Brillouin  zone  consistent  with  the  Born-von  Karman 
boundary  condition  appropriate  to  the  N  points  specified  by  (F.2). 

The  identity  is  most  simply  proved  by  noting  that  because  k  is  consistent  with 
the  Born-von  Karman  periodic  boundary  condition,  the  value  of  the  sum  in  (F.l) 
is  unchanged  if  every  R  is  displaced  by  the  same  R0,  where  R0  is  itself  any  vector 
of  the  form  (F.2): 


£glk-R  _  £  gik-rti+Ro)  _  gik  -  Ro  £  gik  ■  R  (F.3) 

R  R  R 

Consequently  the  sum  must  vanish  unless  e,l  R°  =  1,  for  all  R0  of  the  form  (F.2), 
i.e.,  for  all  vectors  R0  of  the  Bravais  lattice.  This  is  possible  only  if  k  is  a  reciprocal 
lattice  vector.  But  k  =  0  is  the  only  reciprocal  lattice  vector  in  the  first  Brillouin 
zone.1  Hence  the  left  side  of  (F.l)  does  indeed  vanish  if  k  #  0,  and  is  trivially  equal 
to  N,  when  k  =  0. 

A  closely  related  identity  of  similar  importance  is 


X>,1‘R  =  A<W 


(F.4) 


where  R  is  any  vector  of  the  form  (F.2),  and  the  sum  on  k  runs  through  all  sites  in 
the  first  Brillouin  zone  consistent  with  the  Born-von  Karman  boundary  condition 
The  sum  in  (F.4)  is  now  unchanged  if  every  k  is  translated  by  the  same  vector  k0 
lying  in  the  first  zone  and  consistent  with  the  Born-von  Karman  boundary  condition, 
for  the  primitive  cell  constructed  by  shifting  the  entire  first  zone  by  k0,  can  be  re¬ 
assembled  into  the  first  zone  by  shifting  appropriate  pieces  of  it  through  reciprocal 
lattice  vectors.  Since  no  term  of  the  form  e®'1*  is  changed  when  k  is  shifted  by  a 


1  If  k  is  not  restricted  to  the  first  zone,  the  sum  in  (F.l)  will  therefore  vanish  unless  k  is  a  reciprocal 
lattice  vector  K,  in  which  case  it  will  be  equal  to  N. 
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reciprocal  lattice  vector,  the  sum  over  the  shifted  zone  is  identical  to  the  sum  over 
the  original  zone.  Thus: 

£  eik ' R  =  £  ’ R  =  e*- ' R  £  e'1  ’ R,  (F.5) 

k  k  k 

and  therefore  the  sum  on  the  left  side  of  (F.4)  must  vanish  unless  eiklJ ' R  is  unity  for 
all  k0  consistent  with  the  Born-von  Karman  boundary  condition.  The  only  R  of 
the  form  (F.2)  for  which  this  is  possible  is  R  =  0.  And  when  R  =  0,  the  sum  in  (F.4) 
is  trivially  equal  to  N. 


Appendix  G 


The  Variational  Principle  for 
Schrodinger' s  Equation 

We  wish  to  show  that  the  functional  £[>]  (Eq.  (11.17))  is  made  stationary  over  all 
differentiable  functions  i//  satisfying  the  Bloch  condition  with  wave  vector  k,  by  the 
ipk  that  satisfy  the  Schrodinger  equation: 

h2  , 

-  Ym  V  ^  +  U(r)^  =  (G.l) 


By  this  we  mean  the  following:  Let  ip  be  close  to  one  of  the  ipk,  so  that 

=  ! h  +  Sip,  (G.2) 

where  Sip  is  small.  Let  i p  satisfy  the  Bloch  condition  with  wave  vector  k,  so  that  dip 
does  as  well.  Then  v 


E[ip]  =  E[ipk]  +  0(dip)2. 

To  keep  notation  simple  in  the  proof  it  helps  to  define 


Fl<P,  X]  = 

and  to  use  the  standard  notation 

Note  that  Tfj/']  is  then  given  by 


'  fh2 

dr  (  —  \(p*  ■  Vy  +  U(r)tp*-/ 


> 


(4>: 


-I 


di  c/;*y. 


£M  =  EL Ml 

The  variational  principle  follows  directly  from  the  fact  that 


(G.3) 


(G.4) 


(G.5) 


(G.6) 


Fl<P,  M  =  s k(<p,  ipk), 

FIA.  <'/■']  =  £i(i Pk,  <P),  (G.7) 

for  arbitrary  <j>  that  satisfy  the  Bloch  condition  with  wave  vector  k.  This  is  because 
the  Bloch  condition  requires  the  integrands  in  (G.7)  to  have  the  periodicity  of  the 
lattice.  Consequently,  one  can  use  the  integration-by-parts  formulas  of  Appendix  I 
to  transfer  both  gradients  onto  ipv;  (G.7)  then  follows  at  once  from  the  fact  that  ipk 
satisfies  the  Schrodinger  equation  (G.  1). 

We  can  now  write 


F|>.  lA]  =  £|>k  +  Sip,  \ pk  +  dip] 

=  F [ipk,  ipk]  +  F[5\p,  i pk]  +  F\_ipk,  dip]  4-  0(5\p)2 
=  «Ak)  +  (>Pt,  Sip)  +  (dip,  ipk)}  +  0(Sip)2.  (G.8) 


769 


770  Appendix  G 
Furthermore, 

{ip,  i P)  =  (iAk.  >Pk)  +  W'i,  fy)  +  (Sip,  ipi)  +  0{Sip)2.  (G.9) 

Dividing  (G.8)  by  (G.9),  we  have 

£[<//]  =  Ijr1~  =  £k  +  oW)2,  (G.10) 

Ip) 

which  establishes  the  variational  principle.  (The  fact  that  E[^k]  =  £k  follows  at  once 
from  (G.10)  by  setting  Sip  equal  to  zero.) 

Note  that  nothing  in  the  derivation  requires  t p  to  have  a  continuous  first  derivative. 


Appendix  H 


Hamiltonian  Formulation  of  the 
Semiclassical  Equations  of  Motion , 
and  Liouville’s  Theorem 


The  semiclassical  equations  of  motion  (12.6a)  and  (12.6b)  can  be  written  in  the 
canonical  Hamiltonian  form 

r  =  — ,  p  =  -  — ,  (H.l) 

rp  c  r 

where  the  Hamiltonian  for  electrons  in  the  nth  band  is 


H( r.  p)  =  6„  (  '  [p  +  7  A(r,  r)]j  - 


e(p(r,  f), 


yh  L‘  c 

the  fields  are  given  in  terms  of  the  vector  and  scalar  potentials  by 

H  =  V  x  A,  E  =  —\<p  ~-Cf, 

c  ct 

and  the  variable  k  appearing  in  (12.6a)  and  (12.6b)  is  defined  to  be 


(H.2) 


(H.3) 


6k  =  p  +  -  A(r.  (). 
c 


(H.4) 


To  verify  that  (12.6a)  and  (12.6b)  follow  from  (H.l)  through  (H.4)  is  a  somewhat 
complicated,  but  conceptually  straightforward,  exercise  in  differentiation  (just  as  it 
is  in  the  free  electron  case). 

Note  that  the  canonical  crystal  momentum  (the  variable  that  plays  the  role  of  the 
canonical  momentum  in  the  Hamiltonian  formulation)  is  not  6k,  but  (from  Eq.  (H.4) ), 


p  =  6k  -  -  A(r,  r).  (H.5) 

c 

Because  the  semiclassical  equations  for  each  band  have  the  canonical  Hamiltonian 
form,  Liouville’s  theorem1  implies  that  regions  of  six-dimensional  rp-space  evolve  in 
time  in  such  a  way  as  to  preserve  their  volume.  Because,  however,  k  differs  from  p 
only  by  an  additive  vector  that  does  not  depend  on  p,  any  region  in  rp-space  has  the 
same  volume  as  the  corresponding  region  in  r/c-space.2  This  establishes  Liouville’s 
theorem  in  the  form  used  in  Chapters  12  and  13. 


1  The  proof  depends  only  on  the  equations  of  motion  having  the  form  (H.l).  See,  for  example,  Keith 
R.  Symon.  Mechanics ,  3rd  ed.,  Addison-Wesley,  Reading,  Mass,  1971,  p.  395. 

2  Formally,  the  Jacobian  <3(r,  p)/d(r,  k)  is  unity. 
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Green’s  Theorem  for  Periodic  Functions 

If  u(r)  and  t(r)  both  have  the  periodicity  of  a  Bravais  lattice,1  then  the  following 
identities  hold  for  integrals  taken  over  a  primitive  cell  C : 


J*  dr  u  Vp  =  —  J  dr  v  Vu, 

/»  /» 

(1-1) 

J  dr  u  V2u  =  1  dr  v  V2u. 

(1-2) 

These  are  proved  as  follows: 

Let/(r)  be  any  function  with  the  periodicity  of  the  Bravais  lattice.  Since  C  is  a 
primitive  cell,  the  integral 

f(r')  =  j*/(r  +  0 

(1.3) 

is  independent  of  r'.  Therefore,  in  particular. 

^T(r')  =  j"  dr  \'f( r  +  r')  =  J  dr  V/(r  +  r')  =  0, 

(1.4) 

V'2/(r')  =  J"  dr  V'2/(r  f  r')  =  j"  dr  V2/(r  +  r')  =  0. 

(1.5) 

Evaluating  these  at  r'  =  0  we  find  that  any  periodic  /  satisfies 

|  dr  V/(r)  =  0, 

(1.6) 

J  dr  V2/(r)  =  0. 

(1.7) 

Equation  (1. 1)  follows  directly  from  (1.6)  applied  to  the  case/  =  uv. 
set  /  =  uv  in  (1.7)  to  find 

To  derive  (1.2) 

|  dr  (V2u)u  +  |  dr  t((V2r)  +  2  |  dr  •  \v  =  0. 

fl-8) 

We  can  apply  (1.1)  to  the  last  term  in  (1.8),  taking  the  two  periodic  functions  in  (1.1) 
to  be  v  and  the  various  components  of  the  gradient  of  u.  This  gives 

2  J  dr  Su  ■  \v  =  —  2  J  dr  uV2v,  (1.9) 

which  reduces  (1.8)  to  (1.2). 


1  We  use  a  real-space  notation,  although  the  theorem,  of  course,  also  holds  for  periodic  functions  in 
Jc-space. 
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Conditions  for  the  Absence  of 
Interband  Transitions  in 
Uniform  Electric  or  Magnetic  Fields 

The  theories  of  electric  breakdown  and  magnetic  breakthrough  that  underly  the 
conditions  (12.8)  and  (12.9)  are  quite  intricate.  In  this  appendix  we  present  some 
very  crude  ways  of  understanding  the  conditions. 

In  the  limit  of  vanishingly  small  periodic  potential,  electric  breakdown  occurs 
whenever  the  wave  vector  of  an  electron  crosses  a  Bragg  plane  (page  219).  When 
the  periodic  potential  is  weak,  but  not  zero,  we  can  ask  why  breakdown  might  still 
be  expected  to  occur  near  a  Bragg  plane,  and  how  strong  the  potential  must  be  for 
this  possibility  to  be  excluded. 

In  a  weak  periodic  potential,  points  near  Bragg  planes  are  characterized  by  £(k) 
having  a  large  curvature  (see,  for  example.  Figure  9.3).  As  a  result,  near  a  Bragg 
plane  a  small  spread  in  wave  vector  can  cause  a  large  spread  in  velocity,  since 


Arm  Sv  ai  l/^\ 
Ac(k)  =  --Ak*-(^jAL 


(J.l) 


For  the  semiclassical  picture  to  remain  valid,  the  uncertainty  in  velocity  must  remain 
small  compared  with  a  typical  electronic  velocity  vF.  This  sets  an  upper  limit  on  Ak: 


A  k  « 


hvF 


d2Z/dk2 


(J.2) 


Since  the  periodic  potential  is  weak,  we  can  estimate  the  maximum  value  of 
d2Z/dk2  by  differentiating  the  nearly  free  electron  result  (9.26)  in  a  direction  normal 
to  the  Bragg  plane,  and  evaluating  the  result  on  the  plane: 

<^£  _  (h2K/m)2 
dk2  ~ 


(J.3) 


I  £41  ' 

Since  hK/m  x  vF,  and  £gap  a;  1 0-K[  (see  Eq.  (9.27)),  Eq.  (J.2)  becomes 

Ak  «  (J.4) 

flVp 

which  sets  a  lower  limit  on  the  uncertainty  in  the  electron’s  position, 

,  1  hvF 

<J-5> 

As  a  result  of  this  uncertainty  in  position  the  potential  energy  of  the  electron  in 
the  applied  field  is  uncertain  by 

e  A <p  =  eE  Ax  » - — .  (J.6) 
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If  this  uncertainty  in  potential  energy  becomes  comparable  to  the  band  gap,  an 
interband  transition  can  occur  without  violating  energy  conservation.  To  prohibit 
this  we  must  have 


£gap  » 


eEhtp 


(J.7) 


Since  twF/a  «  £F,  when  a  is  the  lattice  constant,  (J.7)  can  also  be  written  in  the  form 


»  eEa. 

£>f 


(J.8) 


A  similarly  rough  argument  gives  the  condition  for  magnetic  breakthrough.  Since 
energy  cannot  be  gained  from  a  magnetic  field,  breakthrough  requires  a  lack  of 
definition  in  an  electron’s  wave  vector  comparable  to  the  distance  between  points 
of  equal  energy  on  two  different  branches  of  the  Fermi  surface.  The  k-space  distance 
to  be  bridged  is  of  order  £eap/|d£/c'k|  (Fig.  Jl),  which  is  of  order  Z^Jhvp.  A  condition 


Figure  J  .1 

When  two  bands  come  close  together,  the 
separation  in  k-space  of  points  with  the  same 
energy  is  roughly  Afc  =  £gap/|d£/ck|.  Here  £gap  is 
the  minimum  vertical  separation  between  the 
two  bands. 


prohibiting  breakthrough  is  therefore 


A  k 


(J-9) 


On  page  230  we  found  that  in  a  magnetic  field  the  semiclassical  orbit  in  real 
space  is  given  by  rotating  the  k-space  orbit  through  90°  about  the  field  direction 
and  scaling  it  with  the  factor  hc/eH.  Consequently  the  uncertainty  relation 

A ky  Ay  >  1  (J.10) 

implies  a  k-space  uncertainty  relation : 

Aky  A kx  >  (J.ll) 

y  he 

Thus  in  a  magnetic  field  an  electron  cannot  be  localized  in  fc-space  to  better  than  a 
region  of  dimensions 

eH 


1/2 
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In  conjunction  with  (J.9)  this  means  that  to  avoid  breakthrough  it  is  necessary  that 

feHV'2  k<  Egap  (J.13) 

\  he  )  hvF 

Taking  £f  =  \ mvF2,  we  can  rewrite  this  condition  as 


(J.14) 
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Optical  Properties  of  Solids 

Consider  a  plane  electromagnetic  wave  with  angular  frequency  co  propagating  along 
the  z-axis  through  a  medium  with  conductivity  cr(to)  and  dielectric  constant1  e°(co). 
(We  ignore  in  this  discussion  the  case  of  magnetic  media,  assuming  that  the  magnetic 
permeability  p  is  unity ;  i.e.,  we  take  B  equal  to  H  in  Maxwell’s  equations.)  If  we  define 
D(z,  co),  E(z,  co),  and  j(z,  to)  in  the  usual  way : 

j (z,  t)  =  Re  [j(z,  co)e“i“],  etc.,  (K.l) 

then  the  electric  displacement  and  current  density  are  related  to  the  electric  field  by: 

j(z,  co)  =  cr(co)E(z,  «);  D(z,  co)  =  e°(co)  E(z,  to).  (K.2) 

ASSUMPTION  OF  LOCALITY 

Equation  (K.2)  is  a  local  relation;  i.e.,  the  current  or  displacement  at  a  point  is  entirely 
determined  by  the  value  of  the  electric  field  at  the  same  point.  This  assumption  is 
valid  (see  pages  16-17)  provided  that  the  spatial  variation  of  the  fields  is  small 
compared  with  the  electronic  mean  free  path  in  the  medium.  Once  the  fields  have 
been  calculated  assuming  locality,  the  assumption  is  easily  checked  and  found  to  be 
valid  at  optical  frequencies. 


ASSUMPTION  OF  ISOTROPY 

For  simplicity  we  assume  the  medium  is  sufficiently  simple  that  <Ti;(ej)  =  <7(10)8^, 
=  e°(w)5y:  i.e.,  D  and  j  are  parallel  to  E.  This  will  be  true,  for  example,  in  any 
crystal  with  cubic  symmetry  or  in  any  polycrystalline  specimen.  To  study  birefringent 
crystals  the  assumption  must  be  dropped. 


CONVENTIONAL  NATURE  OF  THE 
DISTINCTION  BETWEEN  €°(co)  AND  a(ro) 

The  dielectric  constant  and  conductivity  enter  into  a  determination  of  the  optical 
properties  of  a  solid  only  in  the  combination: 


e(co)  =  €°(co)  + 


4mo(w) 

co 


(K.3) 


As  a  result,  one  is  free  to  redefine  e°  by  adding  to  it  an  arbitrary  function  of  frequency, 
provided  one  makes  a  corresponding  redefinition  of  a  so  that  the  combination 
(K.3)  is  preserved:' 


e°(oj)  -»  e°(co)  +  de(co),  a(oj)  -*  cr(co)  -  8e(co). 

4m 


(K.4) 


1  We  use  Gaussiatyunits,  in  which  the  dielectric  constant  of  empty  space  is  unity.  We  use  e°  for  the 
dielectric  constant  of  the  medium. 
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This  freedom  of  choice  reflects  a  genuine  ambiguity  in  the  physical  definitions  of 
e°  and  a,  which  describe  distinguishable  physical  processes  only  in  the  DC  case, 
where  a  describes  the  “free  charges”  (those  that  can  move  freely  over  arbitrary 
distances  in  response  to  the  DC  field)  and  e°  describes  the  “bound  charges”  (those 
that  are  bound  to  equilibrium  positions  and  only  stretched  to  new  equilibrium 
positions  by  the  DC  field;  see  Figure  K.1). 

- - £ - =  eE  rfm 

(Drude)  ^ 

Free  charge 


Before 

field 


In  presence 
of  field 


Bound 

charge 


Figure  K.l 

The  response  of  “free”  and  “bound"  charges  to  a  DC  electric  field.  The 
free  charge  moves  for  as  long  as  the  field  acts,  but  the  bound  charge  is 
constrained  by  restoring  forces,  and  can  be  displaced  (“polarized”) 
only  to  a  new  equilibrium  position. 


In  the  case  of  an  AC  field  the  distinction  blurs.  The  free  charges  do  not  move 
arbitrarily  far,  but  oscillate  back  and  forth  with  the  frequency  of  the  field,  whereas 
the  bound  charges  no  longer  come  to  rest  at  new  equilibrium  positions,  but  also 
oscillate  at  the  field  frequency. 

If  the  frequency  of  the  field  is  sufficiently  low  (w  «  1/t)  the  distinction  can  still 
be  preserved,  but  on  rather  different  grounds:  The  free  charge  velocities  will  respond 
in  phase  with  the  field  (i.e.,  <r(<y)  w  ill  be  predominantly  real)  while  the  bound  charge 
velocities  will  respond  out  of  phase  with  the  field  (i.e.,  e0(m)  will  be  predominantly 
real).  At  higher  frequencies  even  this  distinction  disappears:  The  free  charges  can 
have  a  substantial  out-of-phase  response  (indeed  in  metals,  at  optical  frequencies, 
cr(oi)  is  predominantly  imaginary)  and  the  bound  charges  can  have  a  considerable 
in-phase  response  (which  may  or  may  not  be  present  at  optical  frequencies,  depending 
on  the  material). 

Thus  at  high  frequencies  the  distinction  between  free  and  bound  charges  (and 
hence  between  o(a>)  and  e°(a>) )  is  entirely  a  conventional  one.  In  discussing  metals, 
at  least  two  different  conventions  are  widely  used: 


1. 

2. 


<t  is  reserved  for  the  response  of  electrons  in  partially  filled  bands  (i.e.,  the  con¬ 
duction  electrons)  and  e°  describes  the  response  of  the  electrons  in  completely 
filled  bands  (e.g.,  the  core  electrons).2 

The  response  of  all  electrons  (core  and  conduction)  is  lumped  into  the  single 
dielectric  constant, 


e(cu)  =  €0(cu)  + 


4nia(o) 

w 


(K.5) 


where  e{a>)  now  includes  the  contributions  of  both  partially  filled  and  filled 
bands  to  the  current.  This  convention  enables  the  optical  theories  of  metals  to 


2  From  the  point  of  view  of  the  semiclassical  theory  of  Chapters  12  and  13,  the  filled  bands  are  inert, 
and  do  not  respond  to  the  fields.  However,  corrections  to  the  semiclassical  theory  do  predict  some  core 
polarization. 
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use  the  same  notation  as  is  used  for  insulators,  where,  by  universal  convention, 
all  charge  is  considered  bound. 


THE  REFLECTIVITY 


When  a  plane  wave  is  normally  incident  from  vacuum  on  a  medium  with  dielectric 
constant  e  (in  the  sense  of  Eq.  (K.5) )  the  fraction  r  of  power  reflected  (the  reflectivity) 
is  given  by3 


where 


r  l£f 

1  -  K 

2  (1  —  n)2  +  k1 

|£f- 

1  +  K 

(1  +  it)2  +  k1 

k  =  yje,  n  =  Re  k,  k  =  lm  k. 


(K.6) 

(K.7) 


DETERMINATION  OF  e(co)  FROM  THE 
MEASURED  REFLECTIVITY 


To  determine  n  and  k  (and  hence  e  =  (n  +  ik)2)  from  the  reflectivity  (K.6),  more 
information  is  required.  Two  approaches  are  followed: 


1. 


One  can  exploit  the  fact  that  n  and  k  are  related  by  the  Kramers- Kronig  relations  :4 


n(a>)  =  1  +  P 


da'  k(co') 

7t  a  —  a 


k(a) 


da'  n(a‘)  —  1 
7t  a'  —  a 


(K.8) 


Either  of  these  equations,  plus  a  knowledge  of  r(a)  at  all  frequencies,  permits  one, 
in  principle,  to  disentangle  the  separate  values  of  11(0)  and  k(a).  In  practice  the 
numerical  analysis  can  be  quite  complicated,  and  the  method  has  the  disadvantage 
of  requiring  measurements  to  be  made  at  enough  frequencies  to  give  reliable 
extrapolations  to  the  entire  frequency  range,  as  required  in  (K.8). 

2.  One  can  use  the  generalization  of  (K.6)  to  nonnormal  angles  of  incidence.5  One 
then  obtains  a  second  expression  for  the  reflectivity  at  angle  of  incidence  6, 
involving  0,  n(a>),  k(a),  and  the  polarization  of  the  incident  radiation.  By  com¬ 
paring  this  with  the  measured  reflectivity  at  angle  6,  one  obtains  a  second  equation 
involving  11(a))  and  k(a),  and  the  two  can  then  be  extracted. 


THE  RELATION  BETWEEN  €  AND  THE  INTERBAND 
ABSORPTION  IN  A  METAL 

In  a  metal  the  core  polarization  generally  leads  to  a  real  contribution  to  e°,  and 
Eq.  (K.5)  gives,  for  the  imaginary  part, 

471 

Im  e  =  —  Re  cr.  (K.8) 

a 


3  This  result  is  proved  in  most  texts  on  electromagnetism.  See,  for  example.  L.  D.  Landau  and  E.  M. 
Lifshitz,  Electrodynamics  of  Continuous  Media,  Addison-Wesley,  Reading,  Mass.,  1960,  p.  274  The  quan¬ 
tities  E'  and  Er  are  the  amplitudes  of  the  incident  and  reflected  electric  fields.  The  square  root  in  (K.7)  is 
the  one  yielding  non-negative  n. 

4  Landau  and  Lifshitz,  op.  cit.,  p.  259.  (The  indicates  a  principal  value  integral.) 

5  Ibid.,  pp.  272  -277. 


Optical  Properties  of  Solids  779 


In  the  semiclassical  theory  the  AC  conductivity  (13.34)  has  a  real  part  only  by  virtue 
of  collisions.  In  the  absence  of  collisions  the  semiclassical  conductivity  is  purely 
imaginary  and  c  entirely  real,  signifying  no  dissipation  of  electromagnetic  energy 
within  the  metal.  If,  however,  we  use  the  correction  to  the  semiclassical  result  (13.37), 
then  the  conductivity  has  a  real  part  even  in  the  absence  of  collisions,  and  we  find 
that6 


Im  e  = 


4jiV 


CO 


dk  v  _  (BJM 

D„,Ak)b 


(2n) 


Mk) 


(K.9) 


where 


A,„(k) 


/(Mk))  /(£„(k))l^  |<|A|  |„.k>|2 

£„(k)  -  £,,(*)  3t'  11  ' 


(K.10) 


The  quantity  Dn„.(k)  is  nonnegative,  and  negligibly  small  (when  kBT  «  £f )  unless 
one  of  the  pair  of  levels  nk,  n'k  is  occupied  and  the  other  is  unoccupied.  Therefore 
the  additional  contribution  to  the  real  (absorptive)  part  of  the  conductivity  arises 
whenever  fun  is  equal  to  the  energy  difference  between  two  levels  with  the  same  k, 
one  occupied  and  the  other  empty.  This  is  precisely  the  condition  for  interband 
absorption  suggested  by  intuitive  arguments  about  photons  (see  page  293). 


6  We  specialize  to  the  case  of  a  metal  w  ith  cubic  symmetry.  The  frequency  appearing  in  the  denomi¬ 
nator  of  (13.37)  is  to  be  evaluated  in  the  limit  of  a  vanishingly  small  positive  imaginary  part  t;,  and  we 
use  the  identity: 

lim  Im— — —  =  —nS(x). 

1-0  x  +  it] 
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Quantum  Theory  of  the 
Harmonic  Crystal 

We  first  summarize  the  quantum  theory  of  a  single  (one-dimensional)  harmonic 
oscillator  with  Hamiltonian 

h  =  fw  +  l«i«2r/2.  (L.l) 

The  structure  of  this  Hamiltonian  is  simplified  by  defining  the  “lowering”  operator 

Into  I  i 

a  =  ^q  +  iyj2b^P'  W 

and  its  adjoint,  the  “raising”  operator 

t  Imo  ,  I  i 

\j  2h  q  Ihmoj P  (L.3) 

The  canonical  commutation  relations  [r/,  p ]  =  ih  imply  that 

[«,  o’]  =  1.  (L.4) 

If  the  Hamiltonian  is  expressed  in  terms  of  a  and  a'  rather  than  q  and  p,  it  takes  on 
the  simple  form: 

h  =  ho{aa  +  \).  (L.5) 

It  is  not  difficult  to  show1  that  the  commutation  relations  (L.4)  imply  that  the 

eigenvalues  of  (L.5)  are  of  the  form  (n  +  j)ho,  where  n  =  0,  1,  2, _ If  the  ground 

state  of  h  is  denoted  by  |0>,  then  the  nth  excited  state  |n>  is 

|n>  =~(at)10>,  (L.6) 

and  satisfies 

ara\n}  =  n\n),  h\n >  =  (n  +  j)ficy|n>.  (L.7) 

The  matrix  elements  of  a  and  af  in  this  complete  set  of  states  are  given  by: 

<n'|a|/i)  =0,  n'  #  »i  —  1, 

<n—  l|o|n>  =  yfn, 

(njo^n)  =  <n|fl|n'>.  (L.8) 

All  these  results  follow  in  a  straightforward  way  from  Eqs.  (L.4)  and  (L.5). 

The  procedure  in  the  case  of  a  harmonic  crystal  is  similar.  The  Hamiltonian  is  now 
given  by  Eq.  (23.2). 2  Let  ojs(k)  and  €s(k)  be  the  frequency  and  polarization  vector  for 


1  See,  for  example,  D.  Park,  Introduction  to  the  Quantum  Theory,  McGraw-Hill,  New  York,  1964, 

p.  110. 

2  We  summarize  the  derivation  only  for  monatomic  Bravais  lattices,  stating  below  how  the  conclusions 
are  to  be  generalized  when  there  is  a  polyatomic  basis. 
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the  classical  normal  mode  with  polarization  s  and  wave  vector  k,  as  described  on 
page  439.  In  analogy  to  (L.2)  we  now  define3  the  “phonon  annihilation  operator” 


1 


I  2hMms{k) 


P(R)  , 


and  its  adjoint,  the  “phonon  creation  operator” 


Mml k) 

2  h 


u(R)  -  i 


1 


2hM<os(k ) 


P(R)J. 


(L.9) 


(L.10) 


The  canonical  commutation  relations 


the  identity4 5 


K,(R),  Pv(R')]  =  ih  btc.  <5e,r', 

K(R),  nv(R')]  =  [^(R),  Pv(R')]  =  0, 


£e,k-R 

R 


0,  k  not  a  reciprocal  lattice  vector, 
N,  k  a  reciprocal  lattice  vector. 


(L.ll) 

(L.12) 


and  the  orthonormality  of  the  polarization  vectors  (Eq.  (22.61) )  yield  the  commutation 
relations 

[  ^LV'  , 

Ok,,  Ury]  =  OL  =  0,  (L.13) 


which  are  analogous  to  (L.4). 

One  can  invert  (L.9)  to  express  the  original  coordinates  and  momenta  in  terms  of 
the  a and 


u(R)  = 


P(R)  = 


J_  y  /  h _ 

x/Art\'2MoOk) 


—  i  y  jhMcoJk) 
2 


K,  +  aiks)«s(k)eik'R, 
Ok,  -  atks)ejk)e*-* 


(L.14) 


Equation  (L.14)  can  be  verified  by  direct  substitution  of  (L.9)  and  (L.10),  and  by  use 
of  the  “completeness  relation”  that  holds  for  any  complete  set  of  real  orthogonal 
vectors, 

£  0.,(k)L  Os(k)]„  =  V,  (115) 

S=  1 

together  with  the  identity4,5 

=  0,  R^O.  (L.16) 

k 


3  If  tos(k)  =  0,  this  definition  fails.  The  problem  occurs  for  only  three  of  the  normal  modes  out  of  N 
(the  k  =  0  acoustic  modes)  and  can  usually  be  ignored.  It  is  a  reflection  of  the  fact  that  the  three  degrees 
of  freedom  describing  translations  of  the  entire  crystal  as  a  whole  cannot  be  described  as  oscillator  degrees 
of  freedom.  Only  in  problems  in  which  one  wishes  to  consider  translations  of  the  crystal  as  a  whole,  or 
the  total  momentum  of  the  crystal,  does  it  become  important  to  treat  these  degrees  of  freedom  correctly 
as  well.  The  problem  is  discussed  further  in  Appendix  M. 

4  See  Appendix  F. 

5  Use  has  also  been  made  of  the  fact  that  in  a  monatomic  Bravais  lattice  co^k)  =  cos{  -  k),  and  es(k)  = 

«s(-4 
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We  can  express  the  harmonic  Hamiltonian  in  terms  of  the  new  oscillator  variables 
by  substituting  (L.14)  into  (23.2).  If  the  identity  (L.16)  and  the  orthonormality  of  the 
polarization  vectors  of  a  given  k,  are  used,  it  can  be  shown  that  the  kinetic  energy 
is  given  by: 

2^  X  p(R)2  =  \  X  ft®Jk)(als  -  -  tf-J-  (L.17) 

The  potential  energy  takes  on  a  similar  structure  when  one  exploits  the  fact  that  the 
polarization  vectors  are  eigenvectors  of  the  dynamical  matrix  D(k)  (see  (22.57): 

V  =  \  X  +  “t.T  (L..18) 

Adding  these  together,  we  find  that 

H  =  j  X!  fic'->s(k)(akjajs  +  %/iJ,  (L.19) 

or,  using  the  commutation  relations  (L.13), 

H  =  X  t^s(k)(ala^  +  i).  (L.20) 

This  is  nothing  more  than  the  sum  of  3 N  independent  oscillator  Hamiltonians, 
one  for  each  wave  vector  and  polarization.  When  a  Hamiltonian  separates  into  a 
sum  of  commuting  sub-Hamiltonians,  its  eigenstates  are  simply  all  products  of  the 
eigenstates  of  each  of  the  separate  sub-Hamiltonians,  and  the  corresponding  eigen¬ 
values  are  the  sum  of  the  individual  eigenvalues  of  the  sub-Hamiltonians.  We  can 
therefore  specify  an  eigenstate  of  H  by  giving  a  set  of  3N  quantum  numbers  nls,  one 
for  each  of  the  3 N  independent  oscillator  Hamiltonians  fia>s(k)(oisoVj  +  ?).  The  energy 
of  such  a  state  is  just 

£  =  X  ("c,  +  i)/tos(k).  (L.21) 

In  many  applications  (such  as  those  in  Chapter  23)  it  is  necessary  to  know  only 
the  form  (L.21)  of  the  eigenvalues  of  H.  However,  in  problems  involving  the  inter¬ 
action  of  lattice  vibrations  with  external  radiation  or  with  each  other  (i.e.,  in  problems 
where  anharmonic  terms  are  important)  it  is  essential  to  use  the  relations  (L.14),  for 
it  is  the  u’s  and  P’s  in  terms  of  which  physical  interactions  are  simply  expressed,  but 
the  a’s  and  of’s  that  have  simple  matrix  elements  in  the  harmonic  stationary  states. 

A  similar  procedure  is  used  to  transform  the  Hamiltonian  for  a  lattice  with  a 
polyatomic  basis.  We  quote  here  only  the  final  result: 

The  definitions  (L.9)  and  (L.10)  (which  now  define  the  al5  and  for  s  =  1, . . .  3 p, 
where  p  is  the  number  of  ions  in  the  basis)  remain  valid  provided  one  makes  the 
replacements 

u(R)  u'(R), 

P(R)  -  P'(R), 

M  -  A/', 

es(k)  -  jM'eA k)  in  (L.9), 

cs(k)  N/M‘e;(k)*  in  (L.10),  (L.22) 

and  sums  over  the  index  i  (which  specifies  the  type  of  basis  ion).  The  es'(k)  are  now 
the  polarization  vectors  of  the  classical  normal  modes  as  defined  in  Eq.  (22.67), 
obeying  the  orthonormality  relation  (22.68),  the  completeness  relation 
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I  [«s‘(k)*]„  [€/(k)]v  =  4-  *U  (L-23) 

s  —  I  iW> 

and  the  condition6 

«j‘(— k)  =  <(k)*.  (L.24) 

The  inversion  formulas  (L.14)  remain  valid  provided  the  replacements7  specified  in 
(L.22)  are  made,  and  the  commutation  relations  (L.13)  and  the  form  (L.20)  of  the 
harmonic  Hamiltonian  are  unchanged. 


6  Together  with  the  condition  tus(k)  =  o,(  -  k)  this  holds  quite  generally.  However  in  the  polyatomic 
case  the  polarization  vectors  are  not,  in  general,  real. 

1  The  first  of  the  two  prescriptions  for  the  polarization  vectors  is  to  he  used  in  generalizing  both  of 
equations  (L.14). 


Appendix  M 


Conservation  of  Crystal  Momentum 

Associated  with  every  symmetry  of  a  Hamiltonian  is  a  conservation  law.  The  Hamil¬ 
tonian  of  a  crystal  possesses  a  symmetry,  closely  related  to  the  translational  symmetry 
of  the  Bravais  lattice,  that  leads  to  a  very  general  conservation  law  known  as  the 
conservation  of  crystal  momentum.  Consider  the  Hamiltonian: 


=  z 

R 


P(R)1 2 

2M 


+  \l4>[ R  +  u(R)  -  R'  -  u(R')] 

^  R.R 


n 


+  I 


Pi 2 
2  m( 


+  \l  -  ri) 

Z  l*j 


+  X  >v’.(r,  —  R  —  u(R))-  (M.1) 

RJ 

The  first  two  terms  are  the  Hamiltonian  of  the  ion  cores.  Note  that  we  have  not 
made  the  harmonic  approximation,1  but  have  represented  the  ion-ion  interaction  by 
a  general  sum  of  pair  potentials.2  The  next  two  terms  are  the  Hamiltonian  of  n 
additional  particles,  and  the  last  term  gives  the  interaction  of  these  particles  with 
the  ions.  To  keep  the  discussion  general,  we  do  not  specify  the  nature  of  the  n  particles, 
though  the  following  possibilities  are  of  interest: 

1.  If  n  =  0,  we  are  discussing  an  isolated  insulating  crystal. 

2.  If  n  —  1,  we  can  apply  the  discussion  to  the  scattering  of  a  single  particle,  say 
a  neutron,  by  an  insulating  crystal. 

3.  If  we  wish  to  discuss  an  isolated  metal,  we  can  let  the  n  particles  be  the  conduction 
electrons  (n  a  I023).  in  which  case  all  the  would  be  the  electron  mass  m  and 
the  t'jj(r)  would  all  be  the  same  function  of  r. 

4.  We  can  let  the  n  particles  be  the  conduction  electrons  and  an  incident  external 
particle,  if  we  wish  to  discuss  neutron  scattering  by  a  metal. 

Note  that  the  Hamiltonian  (M.l)  is  invariant  under  the  uniform  translation: 

rf  -*  r;  +  r0,  i  =  1, . . . ,  n, 
u(R)  -»  u(R)  +  r0,  for  all  R.  (M.2) 

This  familiar  symmetry  leads  to  conservation  of  the  total  momentum  of  the  ions  and 
particles,  and  is  not  the  symmetry  we  are  interested  in.  Crystal  momentum  conservation 
arises  from  the  fact  that  when  the  translation  vector  r0  is  a  Bravais  lattice  vector  R0, 
then  it  is  possible  to  simulate  the  translation  of  the  ions  by  a  simple  permutation  of  the 


1  We  shall,  however,  be  introducing  operators  defined  in  terms  of  the  phonon  operators  a  and  c* 
(Appendix  L).  These  will  not  be  well  defined,  if  the  system  is  so  unlike  a -harmonic  crystal  with  Bravais 
lattice  {R}  as  to  lie  in  a  completely  different  Hilbert  space.  Thus  although  the  procedure  we  shall  follow 
makes  formal  sense  for  any  system  (e.g.,  a  liquid,  or  a  crystal  with  a  Bravais  lattice  different  from  that 
specified  by  the  {R}).  the  conclusions  can  be  meaningfully  applied  only  to  the  case  of  a  crystal  with  the 
Bravais  lattice  {R}. 

2  Even  the  assumption  of  pair  interactions  is  unnecessary.  It  is  made  only  to  give  H  a  concrete  form 
that  is  not  too  complex  to  obscure  the  argument. 
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ionic  variables;  i.e.,  the  Hamiltonian  (M.l)  is  also  invariant  under  the  transformation: 
r;  -*  r,  +  R0,  f  =  1, 

u(R)  -v  u(R  -  R0),  />( R)  -v  P(R  -  R0),  for  all  R,  (M.3) 

as  can  be  verified  explicitly  by  direct  substitution  into  (M.l). 

To  emphasize  the  difference  between  symmetry  (M.2)  (with  r0  =  R0)  and  symmetry 
(M.3),  consider  the  following  two  symmetry-breaking  terms  one  might  add  to  the 
Hamiltonian  (M.l): 


1.  We  might  add  a  term: 

\  K  I  MR)]2, 

^  R 


(M.4) 


which  ties  each  ion  to  its  equilibrium  site  with  a  harmonic  spring.  This  term  destroys 
the  translational  symmetry  (M.2)  of  the  Hamiltonian,  and  momentum  will  not  be 
conserved  in  its  presence.  However,  the  term  (M.4)  is  invariant  under  the  permuta¬ 
tion  symmetry  (M.3),  so  its  addition  will  not  destroy  the  conservation  of  crystal 
momentum. 

2.  Suppose,  on  the  other  hand,  we  alter  the  Hamiltonian  (M.l)  so  as  to  maintain 
the  translational  symmetry  but  destroy  the  permutation  symmetry.  We  might,  for 
example,  give  each  ion  a  different  mass,  replacing  the  ionic  kinetic-energy  term  by: 


y  P(R)2 
V2A*(R)‘ 


(M.5) 


The  resulting  Hamiltonian  will  continue  to  be  invariant  under  the  spatial  translation 
(M.2)  so  total  momentum  will  continue  to  be  conserved.  However,  it  will  no  longer  be 
invariant  under  the  permutation  (M.3),  and  crystal  momentum  will  not  be  conserved. 


Of  the  two  conservation  laws,  crystal  momentum  conservation  is  by  far  the  more 
important.  In  practice,  crystals  are  not  free  to  recoil  as  a  whole,  and  even  if  they  were, 
the  minute  changes  in  the  total  momentum  of  the  entire  crystal  produced  by  the 
scattering  of  a  single  neutron  would  be  impossible  to  measure  directly. 


DERIVATION  OF  THE  CONSERVATION  LAW 


To  derive  the  conservation  law  implied  by  the  symmetry  (M.3)  we  must  describe  the 
quantum-mechanical  operators  that  produce  this  transformation.  That  part  of  the 
transformation  affecting  the  particle  coordinates,  r,  ->  r,  +  R0,  is  effected  by  the 
particle  translational  operator  TRn  (see  Chapter  8).  It  is  a  fundamental  result  of 
quantum  mechanics  that  this  transformation  can  be  written  as  a  unitary  transforma¬ 
tion  involving  the  total  momentum  operator  for  the  particles:3 


r.  -  r,  +  R0  =  7Kr,  7  R;'  =  '  K’ t,  e~'Mp-  \ 

P=t  P- 


(M.6) 


3  See,  for  example.  K.  Gottfried,  Quantum  Mechanics ,  vol.  I,  W.  A.  Benjamin,  Menlo  Park,  Calif., 
1966,  p.  245 
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In  addition,  we  need  the  operator  that  produces  the  transformation  (M.3)  on  the  ionic 
variables.  Crystal  momentum  conservation  takes  on  a  simple  form  primarily  because 
this  transformation  has  a  structure  very  much  like  Eq.  (M.6): 

u(R)  -  u(R  -  R0)  =  3Ru  u(R)  3Ro-1  =  e‘x ' R-  u(R)  e~lx-R°, 

P(R)  -»  P(R  -  R0)  =  .1Rc  P(R)  nRn  ~ 1  =  e‘x ‘  R°  P(R)  £!-*•*■.  (M.7) 

The  operator  3C  is  not  related  in  any  way  to  the  total  momentum  operator  P  =  £P(R) 
for  the  ions,  but  is  specified4  by  taking  the  eigenstates  of  X  to  be  the  eigenstates  of 
the  harmonic  part  of  the  ion-ion  Hamiltonian,  and  its  eigenvalue  in  a  state  with 
phonon  occupation  numbers  nls  to  be  given  by 


To  verify  that  the  operator  X  defined  by  (M.8)  does  produce  the  transformation 
(M.7),  we  must  use  the  representation  (L.14)  that  gives  u(R)  and  P(R)  in  terms  of  the 
oscillator  raising  and  lowering  operators  for  each  normal  mode: 


u(R) 


I 

7^ 


h 

2Mc>s(k) 


(ols  + 


oIks)e'kRes(k). 


(M.9) 


(We  consider  explicitly  only  the  u(R).  The  argument  for  the  P(R)  is  virtually  the 
same.)  Since  the  only  operators  in  (M.9)  are  nks  and  we  have 

elx ' R"  u(R)  <r'*-R» 


=  — —  y 

v/^t- 


I  2Mois(k) 


(e 


'i3CR”ols<?"3CR" 


+  e‘x'  R"alue~ 


cE')t,llKei(k).  (M.10) 


We  will  therefore  have  established  (M.7)  when  we  prove  that 


e'*R'V.  e'x' R" 


t*  ■  Ro  _  p-t  -Rcn 

—  L  °ks> 


p-ik-  R»„t 
e  °-k5> 


(M.ll) 


for  substitution  of  (M.1 1)  into  (M.10)  reduces  it  back  to  the  form  (M.9)  with  R  replaced 
by  R  -  R0.  ’  v 

Both  results  in  (M.ll)  will  follow  if  we  can  establish  the  single  identity: 


e‘*R°«Ld-'*R"  =  e*%L  (M.12) 

for  the  first  of  (M.ll)  is  just  the  adjoint  of  (M.12)  and  the  second  follows  from  letting 
k  ~  k  Equation  (M.12)  will  be  established  if  we  can  demonstrate  that  the  operators 
on  both  sides  of  the  equation  have  the  same  effect  on  a  complete  set  of  states,  for 
they  will  then  have  the  same  effect  on  any  linear  combination  of  states  from'  the 
complete  set,  and  thus  on  any  state  whatsoever.  We  again  choose  the  complete  set  to 
be  the  eigenstates  of  the  harmonic  Hamiltonian.  The  operator  al,  being  an  oscillator- 


An  operator  is  defined  by  giving  a  complete  set  of  eigenstates  and  the  corresponding  eigenvalues, 
since  any  state  can  be  written  as  a  linear  combination  of  eigenstates.  Note  that  here  the  subtle  assumption 
that  the  solid  is  actually  crystalline  with  Bravais  lattice  {Rj  appears;  if  it  were  not,  its  states  could  not  be 
written  as  linear  combinations  of  eigenstates  of  a  harmonic  crystal  with  Bravais  lattice  {R}. 
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raising  operator  for  the  ks  normal  mode,  simply  acts  on  a  state  with  a  particular  set 
of  phonon  occupation  numbers  to  produce  (to  within  a  normalization  constant)  a 
state  in  which  the  occupation  number  for  the  ks  normal  mode  is  increased  by  one, 
and  all  other  occupation  numbers  remain  the  same.  We  may,  to  begin  with,  write 

«ks}>  =  exp^-i 

which  simply  exploits  (M.8).  Since  the  state  flkv|{nkv})  differs  from  |{nkv})  only  in  that 
the  occupation  number  of  mode  ks  is  increased  by  one,  it  will  also  be  an  eigenstate  of 
x  with  eigenvalue  2k'nk>f  +  k.  Therefore  every  term  in  the  eigenvalue  of  e'k  R°  can¬ 
cels  the  corresponding  term  in  the  eigenvalue  of  e_,k'Ro  except  for  the  single  extra 
term  e'k'R",  and  we  have: 

exP  ‘  Ro)^  XJ{«J>  =  (M.14) 

Equations  (M.14)  and  (M.13)  establish  (M.12),  and  therefore  X  gives  the  desired 
transformation.  The  operator  fiX  is  called  the  crystal  momentum  operator. 


e  '  0  Uks  e  lX 


R{ 


Zk'^-Ro  ^  (M.13) 


APPLICATIONS 

We  illustrate  the  workings  of  crystal  momentum  conservation  in  several  cases: 

1.  Isolated  Insulator  If  only  the  ions  are  present,  then  the  invariance  (M.3) 
implies  that  the  Hamiltonian  commutes  with  the  operator  e'*'*',  since 

e*'  RTf({ u(R),  P(R)})e~,x'  R«  =  H{{ u(R  -  R0),  P(R  -  R0)})  =  H, 

or  eix'  R°H  =  He**'*0.  (M.15) 

This  means  that  the  operator  3Ro  =  e'x ' Rt  is  a  constant  of  the  motion;  i.e.,  if  the 
crystal  is  in  an  eigenstate  of  3Ro  at  time  t  =  0,  it  will  remain  in  an  eigenstate  at  all 
subsequent  times.  In  particular,  suppose  that  the  crystal  is  in  an  eigenstate  of  the 
harmonic  Hamiltonian  at  t  =  0  with  phonon  occupation  numbers  nks.  Because  the 
full  Hamiltonian  is  not  harmonic  this  will  not  be  a  stationary  state.  However,  crystal 
momentum  conservation  requires  that  it  remain  an  eigenstate  of  JRo  for  all  Bravais 
lattice  vectors  R0.  This  means  that  the  state  at  future  times  can  only  be  a  linear 
combination  of  eigenstates  of  the  harmonic  Hamiltonian  with  phonon  occupation 
numbers  nks  leading  to  the  same  eigenvalue  of  3Ro  as  the  original  state: 

exp  (i  £  knks  •  R0)  =  exp  (i  £  k ■  R0).  (M.16) 

Since  this  must  hold  for  arbitrary  Bravais  lattice  vectors  R0  (the  3Ro  commute  with 
one  another  for  different  R0)  we  have 

exP  {‘E  -  Ks)  •  Ro]}  =  1  for  all  R0,  (M.17) 

which  requires  that 


]T  knks  =  k/ik.  +  reciprocal  lattice  vector. 


(M.18) 
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Thus  the  total  phonon  wave  vector  in  an  anharmonic  crystal  is  conserved  to  within 
an  additive  reciprocal  lattice  vector. 

2.  Scattering  of  a  Neutron  by  an  Insulator  Suppose  that  at  the  start  of  the  ex¬ 
periment  the  crystal  is  in  an  eigenstate  of  the  harmonic  Hamiltonian  with  phonon 
occupation  numbers  nksand  the  neutron  is  in  a  state  with  real  momentum  p,  satisfying: 

TR>(r)  =  (i.e.,  >p(t)  =  (M.19) 

where  TR  is  the  neutron  translation  operator.  The  invariance  of  the  total  neutron-ion 
Hamiltonian  under  (M.3)  implies  that  the  product  of  the  neutron  translation  and  ion 
permutation  operators  commutes  with  H  for  any  R0 : 

[Tb  3Ro,  H]  =  0.  (M.20) 

In  the  initial  state  <£  we  have: 

7ruVI>  =  exP  [‘(p/fi  +  •  Ro]<J\  (M.21) 

and  therefore  subsequent  states  must  continue  to  be  eigenstates  with  the  same  eigen¬ 
value.  They  can  therefore  be  represented  as  linear  combinations  of  states  in  which 
the  neutron  has  momentum  p'  and  the  crystal  has  occupation  numbers  n'kl  with  the 
restriction  that 

p'  +  h  knks  =  P  +  h  £  knks  +  h  y.  reciprocal  lattice  vector.  (M.22) 

Thus  the  change  in  the  momentum  of  the  neutron  must  be  balanced  by  a  change  in 
the  crystal  momentum 5  of  the  phonons,  to  within  an  additive  reciprocal  lattice  vector 
times  h. 

3.  Isolated  Metal  If  the  particles  are  conduction  electrons  then  we  can  consider 
at  t  =  0  a  state  in  which  the  electrons  are  in  a  specified  set  of  Bloch  levels.  Now  each 
Bloch  level  (see  Eq.  (8.21))  is  an  eigenstate  of  the  electron  translation  operator: 

r)  =  e*-B"<A„k(r).  (M.23) 

If,  in  addition,  the  crystal  is  in  an  eigenstate  of  the  harmonic  Hamiltonian  at  t  =  0, 
then  the  combined  electron  translation  and  ion  permutation  operator  TRaRi  will  have 
the  eigenvalue 

exp  [/(kt,  +  IknJ  ■  R„],  (M.24) 

where  kc  is  the  sum  of  the  electronic  wave  vectors  of  all  the  occupied  Bloch  levels 
(i.e.,  hkt,  is  the  total  electronic  crystal  momentum).  Since  this  operator  commutes 
with  the  electron-ion  Hamiltonian,  the  metal  must  remain  in  an  eigenstate  at  all 
subsequent  times.  Therefore  the  change  in  the  total  electronic  crystal  momentum  must 
be  compensated  for  by  a  change  in  the  total  ionic  crystal  momentum  to  within  an  additive 
reciprocal  lattice  vector. 

4.  Scattering  of  a  Neutron  by  a  Metal  In  the  same  way.  we  can  deduce  that 
when  neutrons  are  scattered  by  a  metal,  the  change  in  the  neutron  momentum  must  be 


5  The  eigenvalue  tiXknksof  the  crystal  momentum  operator  /i3C,  is  called  the  crystal  momentum, 
(cf.  page  472). 
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balanced  by  a  change  in  the  total  crystal  momentum  of  the  electrons  and  ions,  to  within 
an  additive  reciprocal  lattice  vector  times  h.  Neutrons,  however,  interact  only  weakly 
with  electrons,  and  in  practice  it  is  only  the  lattice  crystal  momentum  that  changes. 
This  case  is  therefore  essentially  the  same  as  case  2.  Note,  however,  that  X  rays  do 
interact  strongly  with  electrons,  so  crystal  momentum  can  be  lost  to  the  electronic 
system  in  X-ray  scattering. 
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Theory  of  the  Scattering  of 
Neutrons  by  a  Crystal 

Let  a  neutron  with  momentum  p  be  scattered  by  a  crystal  and  emerge  with  momentum 
p'.  We  assume  that  the  only  degrees  of  freedom  of  the  crystal  are  those  associated  with 
ionic  motion,  that  before  the  scattering  the  ions  are  in  an  eigenstate  of  the  crystal 
Hamiltonian  with  energy  E„  and  that  after  the  scattering  the  ions  are  in  an  eigenstate 
of  the  crystal  Hamiltonian  with  energy  Ef.  We  describe  the  initial  and  final  states 
and  energies  of  the  composite  neutron-ion  system  as  follows: 

Before  scattering: 

Vi  =  lApW,  ^  =-7Veif',m’ 
v  v 

£,  =  Ej  +  p2/2M„; 

After  scattering: 

Vv=™eip"r/*. 


E,  =  Ef  +  p'2/2M„. 


(N.l) 


It  is  convenient  to  define  variables  co  and  q  in  terms  of  the  neutron  energy  gain  and 
momentum  transfer: 


fiq  =  p'  -  p. 


(N.2) 


We  describe  the  neutron-ion  interaction  by 

F(r)  =  £  r(r  -  r(R))  =  ~  (N.3) 

R  V  k,R 

Because  the  range  of  i>  is  of  order  10“ 13  cm  (a  typical  nuclear  dimension),  its  Fourier 
components  will  vary  on  the  scale  of  k  «  1013  cm-1,  and  therefore  be  essentially 
independent  of  k  for  wave  vectors  of  order  108  cm-1,  the  relevant  range  for  experi¬ 
ments  that  measure  phonon  spectra.  The  constant  v0  is  conventionally  written  in 
terms  of  a  length  a,  known  as  the  scattering  length,  defined  so  that  the  total  cross 
section  for  scattering  of  a  neutron  by  a  single  isolated  ion  is  given  in  Born  approxi¬ 
mation  by  47H72.1  Eq.  (N.3)  is  thus  written 

7nh2a 

K<r>=Tnr  (N.4) 

2K/  „  V  l;  R 


1  We  assume  that  the  nuclei  have  spin  zero  and  are  of  a  single  isotope.  In  general,  one  must  consider 
the  possibility  of  a  depending  on  the  nuclear  state.  This  leads  to  two  types  of  terms  in  the  cross  section: 
a  coherent  term,  which  has  the  form  of  the  cross  section  we  derive  below,  but  with  a  replaced  by  its 
mean  value,  and  an  additional  piece,  known  as  the  incoherent  term,  which  has  no  striking  energy  dependence 
and  contributes,  along  with  the  multiphonon  processes,  to  the  diffuse  background. 
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The  probability  per  unit  time  for  a  neutron  to  scatter  from  p  to  p'  by  virtue  of  its 
interaction  with  the  ions  is  almost  always  calculated  with  the  “golden  rule”  of  lowest- 
order  time-dependent  perturbation  theory:2 


,  2  n 


,  2n 


ZyP/-  E i  +  hco) 


i  |'drei’-'(<D„F(r)<D/ 
k  (CD,,  r<R)(D/; 


(N.5) 


The  transition  rate  P  is  related  to  the  measured  cross  section,  da/dQ  dE,  by  the  fact 
that  the  cross  section,  transition  rate,  and  incident  neutron  flux,  j  =  (,p/Mn) |  ' 

(1  /V)(p/M„)  satisfy3 


j  dQ  dE 


p  d<T  dndE  =  PVd * 


dQ  dE  M„V  dQ  dE  ““  (2rr/i)3 

PVp'2  dp'  dQ  _  PVMnp'  dE  dQ 


{2nh)3 


(: 2nh )3 


(N.6) 


For  a  given  initial  state  i  (and  all  final  states  f  compatible  with  the  energy-conserving 
<5-function),  Eqs.  (N.5)  and  (N.6)  give 


da  p'  Na 2 

=  - — 7  Sj(q,  w), 


where 


dQ  dE  p  h 

To  evaluate  Si  we  use  the  representation 


E(fl>l,^-rm®/) 


m 


i: 


dt_ 
,  2n 


(N.7) 

(N.8) 

(N.9) 


and  note  that  any  operator  A  obeys  the  relation  e,(Ef  E|‘),/''(ci)J.t  /Iflx)  —  ((Dy,  -4(f)®,), 
where  A(t)  =  Furthermore  for  any  pair  of  operators  A  and  B, 


X  (<D„  BO,)  =  (0„  ABO,). 

/ 

Therefore 


S,(q,  cj)  = 


N 


£  e-",  lR-R  )(01,  exp  [/q  •  u(R')]  exp  [-iq  •  u(R,  r)]Oj). 

RR- 


(N.10) 

(N.ll) 


2  See,  for  example,  D.  Park  Introduction  to  the  Quantum  Theory  McGraw-Hill,  New  York,  1964, 
p.  244.  The  analysis  of  neutron  scattering  data  relies  heavily  on  this  use  of  lo west-order  perturbation 
theory — i.e.,  on  the  Born  approximation.  Higher  order  perturbation  theory  produces  so-called  multiple 
scattering  corrections. 

3  We  use  the  fact  that  a  volume  element  dp'  contains  V  dp/(27tfc)3  neutron  states  of  a  given  spin.  (The 
argument  is  identical  to  that  given  in  Chapter  2  for  electrons.) 
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Generally  the  crystal  will  be  in  thermal  equilibrium,  and  we  must  therefore  average 
the  cross  section  for  the  given  i  over  a  Maxwell-Boltzmann  distribution  of  equi¬ 
librium  states.  This  requires  us  to  replace  Sf  by  its  thermal  average 


S(q,  cu)  =  ",  (R  R'! 

RR 


e‘“'  <exp  [iq  •  u(R')]  exp  [  —  iq  ■  u(R,  ()]>, 


(N.12) 


where 


Finally, 


<A>  = 


I  e-W(®fc  AO,) 

Y  e  e>lkBr 


da 

dil  dE 


p'  Na 2 
P  h 


S(q,  co). 


(N.13) 


(N.14) 


S(q,  to)  is  known  as  the  dynamical  structure  factor  of  the  crystal,  and  is  entirely 
determined  by  the  crystal  itself  without  reference  to  any  properties  of  the  neutrons.4 
Furthermore,  our  result  (N.14)  has  not  even  exploited  the  harmonic  approximation, 
and  is  therefore  quite  general,  applying  (with  the  appropriate  changes  in  notation) 
even  to  the  scattering  of  neutrons  by  liquids.  To  extract  the  peculiar  characteristics 
of  neutron  scattering  by  a  lattice  of  ions,  we  now  make  the  harmonic  approximation. 

In  a  harmonic  crystal  the  position  of  any  ion  at  time  t  is  a  linear  function  of  the 
positions  and  momenta  of  all  the  ions  at  time  zero.  It  can  be  proved,5  however,  that 
if  A  and  B  are  operators  linear  in  the  u(R)  and  P(R)  of  a  harmonic  crystal,  then 

<eAeB)  =  eiU2KA2+2AB+Bb  (N.15) 

This  result  is  directly  applicable  to  (N.12): 

<exp  [iq  •  u(R')]  exp  [-  iq  •  u(R,  t)]>  = 

exp  (  — 2<[q  •  u(R')]2>  -  i<[q  •  u(R,  t)]2>  +  <[q  •  u(R')][q  •  u(R,  f)]».  (N.16) 


This  can  be  further  simplified  from  the  observation  that  the  operator  products  depend 
only  on  the  relative  positions  and  times 


<[q  •  u(R')]2>  =  <[q  •  u(R,  r)]2>  =  <[q  •  u(0)]2>  ee  2W, 

<[q  ■  »(R')][q  •  u(R,  t)]>  =  <[q  •  u(0)][q  ■  u(R-R',  r)]>,  (N.17) 


and  therefore: 

S(q,  co)  =  e~2W 


’dt 

In' 


Ye  R exp  < [q  •  u(0)] [q  •  u(R,  r)] >. 


(N.18) 


Equation  (N.18)  is  an  exact  evaluation  of  S(q,  co),  Eq.  (N.12),  provided  that  the 
crystal  is  harmonic. 


4  It  is  simply  the  Fourier  transform  of  the  density  autocorrelation  function. 

5  N.  D.  Mermin,  J.  Math.  Phys.  7,  1038  (1966)  gives  a  particularly  compact  proof. 
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In  Chapter  24  we  classified  neutron  scatterings  according  to  the  number,  m,  of 
phonons  emitted  and/or  absorbed  by  the  neutron.  If  one  expands  the  exponential 
occurring  in  the  integrand  of  S, 

«=  1  (  \m 

exp  <[q  •  u(0)][q  -  u(R.  f)]>  =  X  ,(<[<!'  u(0)][q  ■  U(R,  t)]>  ,  (N.19) 

m  =  0  m-  \  J 

then  it  can  be  shown  that  the  mth  term  in  this  expansion  gives  precisely  the  contri¬ 
bution  of  the  m-phonon  processes  to  the  total  cross  section.  We  limit  ourselves  here 
to  showing  that  the  m  =  0  and  m  =  1  terms  give  the  structure  we  deduced  on  less 
precise  grounds  for  the  zero-  and  one-phonon  processes  of  Chapter  24. 


1.  Zero-Phonon  Contribution  ( m  =  0)  If  the  exponential  on  the  extreme  right  of 
(N.18)  is  replaced  by  unity,  then  the  sum  over  R  can  be  evaluated  with  Eq.  (L.12), 
the  time  integral  reduces  to  a  <5-function  as  in  (N.9),  and  the  no-phonon  contribution 
to  S(q,  to)  is  just 


S(0)(q.to)  =  c-2,,^((U)iVX4,K- 

K 


(N.20) 


The  ^-function  requires  the  scattering  to  be  elastic.  Integrating  over  final  energies, 
we  find  that: 


da 


(N.21) 


This  is  precisely  what  one  expects  for  Bragg-reflected  neutrons:  The  scattering  is 
elastic  and  occurs  only  for  momentum  transfers  equal  to  h  times  a  reciprocal  lattice 
vector.  The  fact  that  Bragg  scattering  is  a  coherent  process  is  reflected  in  the  cross 
section  being  proportional  to  N2  times  the  cross  section  a 2  for  a  single  scatterer, 
rather  than  merely  to  N  times  the  single  ion  cross  section.  Thus  the  amplitudes  combine 
additively  (rather  than  the  cross  sections).  The  effect  of  the  thermal  vibrations  of  the 
ions  about  their  equilibrium  positions  is  entirely  contained  in  the  factor  e~  2W ,  which 
is  known  as  the  Debye-Waller  factor.  Since  the  mean  square  ionic  displacement  from 
equilibrium  <[u(0)]2)  will  increase  with  temperature,  we  find  that  the  thermal  vibra¬ 
tions  of  the  ions  diminish  the  intensity  of  the  Bragg  peaks  but  do  not  (as  was  feared 
in  the  early  days  of  X-ray  scattering)  eliminate  the  peaks  altogether.6 


2.  One-Phonon  Contribution  (m  =  1)  To  evaluate  the  contribution  to  dc/dSl  dE 
from  the  m  =  1  term  in  (N.19)  one  requires  the  form  of 

<  [q  •  u(0)]  [q  •  u(R,  f)]  >,  (N.22) 

This  is  readily  evaluated  from  Eq.  (L.14)  and  the  fact  that7 

ojt)  =  akse~^',  al(t)  = 

<aL-a^>  =  <«ibkv>  =  0, 

<akA\>  =  [1  +  «,(k)]  <5kk<5„.,  <fiksaty>  =  0.  (N.23) 


6  This  is  a  mark  of  the  long-range  order  that  always  persists  in  a  true  crystal. 

7  Here,  as  in  (23.10),  n^q)  is  the  Bose-Einstein  occupation  factor  for  phonons  in  mode  s  with  wave 
vector  q  and  energy  tia>s(q). 
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We  then  find  that 


S(1)(q,  co)  -  e'2,v  X 2Mo)s(q)  [q  '  ([1  +  6^t0  + 

+  ns(q)<5[cu  -  ms(q)]j.  (N.24) 

Substituting  this  into  (N.14)  we  find  that  the  one-phonon  cross  section  is: 

tin (iE  =  Ne  2" 7fl  ^ 2Mc7(qj *-q  e'(ql^  (J1  +  "i(q)] s[(!>  +  <«^)] 

+  «s(q)  <5[co  -  C0s(q)]Y  (N.25) 


Note  that  this  does  indeed  vanish  unless  the  one-phonon  conservation  laws  (24.9)  or 
(24.10)  are  satisfied;  thus,  as  a  function  of  energy,  clo/ilQ  dE  is  a  series  of  sharp  delta- 
function  peaks  at  the  allowed  final  neutron  energies. 

This  structure  makes  it  possible  to  distinguish  the  one-phonon  processes  from  all 
the  remaining  terms  in  the  multiphonon  expansion  of  S  or  the  cross  section,  all  of 
which  can  be  shown  to  be  smooth  functions  of  the  final  neutron  energy.  Note  that 
the  intensity  in  the  one-phonon  peaks  is  also  modulated  by  the  same  Debye- Waller 
factor  that  diminishes  the  intensity  of  the  Bragg  peaks.  Note  also  the  factor  [q  -  es(q)]2, 
which  enables  one  to  extract  information  about  the  phonon  polarization  vectors, 
finally,  the  thermal  factors  us(q)  and  1  +  ns(q)  are  for  processes  in  which  phonons 
are  absorbed  or  emitted,  respectively.  These  factors  are  typical  for  processes  involving 
the  creation  and  absorption  of  Bose-Einstein  particles,  and  indicate  (as  is  reasonable) 
that  at  very  low  temperatures  processes  in  which  phonons  are  emitted  will  be  the 
dominant  ones  (when  they  are  allowed  by  the  conservation  laws). 


APPLICATION  TO  X-RAY  SCATTERING 

Aside  from  the  factor  (p'/p)a2,  peculiar  to  the  dynamics  of  neutrons,  the  inelastic 
scattering  cross  section  for  X  rays  should  have  precisely  the  same  form  as  (N.14). 
However,  one  can  generally  not  resolve  the  small  (compared  with  X-ray  energies) 
energy  losses  or  gains  occurring  in  one-phonon  processes,  and  must  therefore,  in 
effect,  integrate  the  cross  section  over  all  final  energies : 

da  f 

dQ  K  J  dw  ®)  «  e~2W  I  e  ",  R  exp  <[q  •  „(0)][q  ■  u(R)]>.  (N.26) 

This  result  is  simply  related  to  our  discussion  of  X-ray  scattering  in  Chapter  6,  in 
which  we  relied  on  the  static  lattice  model.  In  that  chapter  we  found  that  the  scattering 
for  a  monatomic  Bravais  lattice  was  proportional  to  a  factor: 

II^T-  (N.27) 

R 

Equation  (N.26)  generalizes  this  result  by  allowing  for  the  ions  to  be  displaced  from 
their  equilibrium  positions,  R  -»  R  +  u(R),  and  taking  a  thermal  equilibrium  average 
over  ionic  configurations. 
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Making  the  multiphonon  expansion  in  (N.26)  will  yield  the  frequency  integrals  of 
the  individual  terms  in  the  multiphonon  expansion  we  made  in  the  neutron  case.  The 
no-phonon  terms  continue  to  give  the  Bragg  peaks,  diminished  by  the  Debye-Waller 
factor  — an  aspect  of  the  intensity  of  the  Bragg  peaks  that  was  not  taken  into  account 
in  our  discussion  of  Chapter  6.  The  one-phonon  term  yields  a  scattering  cross  section 
proportional  to 

’  A 

da  SU)(q,  co)  =  e~2W  £  ‘  (q  •  es(q))2  coth  ij3ficos(q),  (N.28) 

J  s  2Mws(q) 

where  q  is  the  change  in  X-ray  wave  vector.  Since  the  change  in  photon  energy  is 
minute,  q  is  entirely  determined  by  the  incident  X-ray  energy  and  the  direction  of 
observation.  The  contribution  to  (N.28)  of  the  terms  arising  from  the  various  branches 
of  the  phonon  spectrum  may  be  disentangled,  by  doing  the  experiment  at  several 
values  of  q  differing  by  reciprocal  lattice  vectors.  However  the  major  problem  is 
distinguishing  the  contribution  (N.28)  of  the  one-phonon  processes  to  the  total  scat¬ 
tering  cross  section,  from  the  contribution  of  the  multiphonon  terms,  since  the 
characteristic  structure  of  the  one-phonon  terms  lies  entirely  in  their  singular  energy 
dependence,  which  is  lost  once  the  integral  over  co  has  been  performed.  In  practice, 
one  can  do  little  better  than  to  try  to  estimate  the  multiphonon  contribution  from 
the  general  result  (N.26).  Alternatively,  one  can  work  at  temperatures  so  low,  and 
momentum  transfers  q  sufficiently  small,  as  to  make  the  expansion  (N.I9)  rapidly 
convergent.  If  crystals  were  strictly  classical  this  would  always  be  possible,  since  the 
deviations  from  equilibrium  of  the  ions  would  vanish  as  T  -*  0.  Unfortunately,  how¬ 
ever,  the  zero-point  ionic  vibrations  are  present  even  at  T  =  0,  and  there  is  therefore 
an  intrinsic  limit  to  the  rate  at  which  the  multiphonon  expansion  can  converge. 
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Anharmonic  Terms  and 
n-Phonon  Processes 

An  anharmonic  term  of  nth  degree  in  the  ionic  displacements  u  can  be  written  in 
terms  of  the  normal  mode  lowering  and  raising  operators  a  and  a\  through  Eq. 
(L.14).  Such  a  term  will  consist  of  linear  combinations  of  products  containing  m 
lowering  operators,  ok|S|, . . .  a^,  and  n  -  m  raising  operators,  a£m  ,m+|, . .  -  alr,n 
(0  <  m  <  n).  Each  such  product,  when  acting  on  a  state  characterized  by  a  definite 
set  of  phonon  occupation  numbers,  produces  a  state  in  which  the  nks  are  all  un¬ 
changed  except  for  being  reduced  by  one  when  ks  =  k,s,, . . .  kmsm,  and  increased 
by  one  when  ks  =  km+1sm  +  !, . . .  k„s„.  Thus  an  anharmonic  term  of  the  nth  degree 
has  nonvanishing  matrix  elements  only  between  states  in  which  just  n  phonon 
occupation  numbers  differ.1 


1  The  overwhelming  majority  of  terms  in  the  expansion  of  any  anharmonic  term  will  have  all  the 
kjS,  different,  unless  the  crystal  is  microscopically  small.  Terms  in  which  the  occupation  number  of  a 
given  normal  mode  changes  by  two  or  more  make  a  negligible  contribution  in  the  limit  of  a  large  crystal. 
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Appendix  P 

Evaluation  of  the  Lande  g-Factov 

We  take  the  vector  product  of  both  sides  of  (31.34)  with  <  JLSJ(  |J|  JLSJ2>  and  sum 
over  J'z ,  for  fixed  J.  Since  the  matrix  elements  of  J  vanish  between  states  with  differing 
values  of  J,  we  may  as  well  sum  over  all  states  in  the  (2 L  +  1)(2S  +  1)  dimensional 
manifold  with  given  Land  S.  Having  noted  this,  however,  we  can  use  the  completeness 
relation 

X  |JLSJ'XJLSJ'|  =1,  (P.l) 

jjz 

to  replace  the  sums  of  products  of  matrix  elements  by  matrix  elements  of  the  operator 
products : 

(JLSJZ\(L  +  g0 S)  •  J|LLSL2>  =  g(JLS)( JLSJZ\J2\JLSJZ>.  (P.2) 
Now  we  simply  observe  that  the  operator  identities : 

S2  =  (J  -  L)2  =  J2  +  L2  -  2L  •  J, 

L2  =  (J  -  S)2  =  J2  +  S2  -  2S  •  J, 
fJ2]  (J(J  +  1)) 

(JLSJZ |  j  L2 1  \JLSJ2y  =  L(L  +  1)  >  ,  (P.3) 

[s2]  [s(s  +  1)  j 

.  permit  one  to  evaluate  (P.2)  as 

g(JLS)J(J  +  1)  =  <JLSJZ\(L-  J)\JLSJZ}  +  g0(JLSJz\{S  ■  J)\JLSJzy 
=  1  [J(J  +  1)  +  L(L  +  1)  -  S(S  +  1)] 

+  ^  [J(J  +  1)  +  S(S  +  1)  -  L(L  +  1)],  (P.4) 

which  is  equivalent  to  the  result  (31.37)  quoted  in  the  text. 
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a0,  see  Bohr  radius 
Abrupt  junction,  591 
Acceptor  impurities,  577 
See  also  Impurities  in  semiconductors;  p-n 
junction;  Semiconductors 
Acoustic  modes,  435 
vs.  optical  modes,  437 
and  polyatomic  bases,  442 
See  also  Lattice  vibrations;  Phonons 
Adiabatic  approximation,  425-426 
analytic  justification  of,  529n 
in  metals,  513 

Adiabatic  demagnetization,  659-661, 669 
Alkali  earths,  298 
Alkali  halides,  380-385 
band  structure,  382 
bulk  moduli,  table  of,  408 
charge  distribution,  381 
cohesive  energy,  402-407 
table  of,  406 

color  centers,  622-626, 640 
conductivity,  621 
Debye  temperatures,  table  of,  459 
defects,  620 

diamagnetic  susceptibilities,  table  of,  649 
dielectric  constants,  table  of  553 
Griineisen  parameters,  table  of,  494 
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band  structure  and  Fermi  surface,  284-288 
bulk  moduli,  table  of,  39 
cohesion,  410-412, 413-414 
conduction  electron  densities,  table  of,  5 
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free  electron  parameters,  table  of,  38 
Hall  coefficients,  15,  288 
ionization  potentials,  391 
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optical  properties,  18, 293-296 
Pauli  paramagnetism,  664 
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rst  table  of,  5 
specific  heats,  table  of,  48 
thermal  conductivities,  table  of,  21 
transparency  in  ultraviolet,  18 
Alloys,  309-310 
dilute  magnetic,  685-687 


Amorphous  solids,  62 
anomalous  thermal  properties,  504n 
X-ray  diffraction,  96n 
Anderson  model,  687 

Angular  momentum,  notational  convention,  647n 
Anharmonic  terms,  422,  487-509 
cubic  terms  and  conservation  laws,  490,  509 
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and  Griineisen  parameter,  508,  509 
and  n-phonon  processes,  796 
and  one-phonon  widths,  476-478 
quartic  vs.  cubic  terms,  489-490,  498 
and  reststrahl  resonance,  552 
and  scattering  of  phonons,  497-  499 
and  second  sound,  506-508 
and  soft  modes,  556 

and  specific  heat  at  high  temperature,  428-429, 
455 

and  thermal  conductivity,  495-505 
and  thermal  expansion,  490-492, 495 
and  thermodynamic  properties,  490-492 
and  umklapp  processes,  501-505 
Anion,  380 

Anisotropy  energy,  720-721 
Annealing,  621 

Annihilation  operator  (phonon),  781 
Anomalous  skin  effect,  277-278 
Antibonding  states,  678n 
Antiferromagnetism,  672,  694-696 
energy  of  ground  state,  704,  722 
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Heisenberg  model,  704 
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linear  chain  (Bethe  solution),  704 
mean  field  theory,  718n,  723-724 
susceptibility,  699,  702,  71 8n 
table,  697 

Antimorph  of  color  center,  625 
Anti-Stokes  component,  481 
APW,  see  Augmented  plane- wave  method 
Atomic  form  factor,  107-108 
Atomic  Hamiltonian,  176,  180n 
Atomic  levels  broadened  into  bands,  182-183 
Atomic  orbitals,  linear  combination  of  (LCAO), 
179 

Atomic  wave  functions  and  interatomic  distance 
in  sodium,  1 77 

of  neon,  compared  with  sodium,  375 
Augmented  plane-wave  method  (APW),  200-202 
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Augmented  plane- wave  method  ( continued ) 
compared  with  Green’s  function  (KKR) 
method,  205 

Avogadro’s  number  ( NA ).  4,  49 
Band  gap,  142 

at  Bragg  planes,  161-162,  172 
in  insulators,  562 
in  one  dimension,  148 
in  semiconductors,  562 
measurement  of,  566-568 
table  of,  566 

temperature  dependence  of,  566-567 
Band  index,  134n,  140 
for  nearly  free  electrons,  153 
Band  overlap,  146, 223 
Band  repulsion,  155 
Band  structure,  141 
of  alkali  halides,  382 

and  distinction  between  insulators  and  metals, 
223 

and  ferromagnetism,  684n 
methods  for  calculating,  192-21 1 
augmented  plane-wave  (APW)  method,  200- 
202 

cellular  method,  195-200 
Green’s  function  (KKR)  method,  202-206 
orthogonalized  plane- wave  (OPW)  method, 
206-208 

pseudopotential  method,  208-210 
See  also  Nearly  free  electron  approximation; 
Tight-binding  method 
in  one  dimension,  146-149 
of  selected  metals,  284-311 
of  selected  semiconductors,  568-570 
See  also  Bloch  electrons;  Density  of  levels; 
Fermi  surface 

Band  width,  d  compared  with  s,  307 
free  electron,  in  Hartree-Fock  approximation, 
335 

in  one  dimension,  148-149 
in  tight-binding  method,  182-183 
Bands,  vs.  atomic  levels,  182-183 
“bending'*  of,  594 
filled,  221-224,  250 
partially  filled,  223 

Bardeen,  Cooper,  Schrieffer  theory,  739-747 
and  Cooper  pairs,  740-743 
critical  field,  745 
critical  temperature,  744 
effective  interaction,  743 
energy  gap,  744 
excited  states,  742-743 
ground  state,  741 
and  Meissner  effect,  746-747 
and  persistent  currents,  750—751 


Bardeen,  Cooper.  Schrieffer  theory  {continued) 
specific  heat  discontinuity,  745 
specific  heat  (electronic)  at  low  temperature,  746 
See  also  Cooper  pairs;  Superconductivity 
Bare  ions,  513 

Base-centered  orthorhombic  Bravais  lattice,  117 
Basis,  75 

bcc,  see  Body-centered  cubic  Bravais  lattice 
BCS,  see  Bardeen,  Cooper,  Schrieffer  theory 
Biased  junction,  598 
Binding  energy,  see  Cohesive  energy 
Birefringence,  776 
Blackbody  radiation,  466-467 
Bloch  electrons,  density  of  levels,  143-145 
dynamics  of,  214-241 
effective  mass  (dynamic),  227-229 
effective  mass  theorem,  765-766 
vs.  free  electrons,  214 
holes,  225-229 

no  collisions  with  static  lattice,  215,  418 
in  one  dimension,  146-149,  159-161 
orbits  in  a  magnetic  field,  229-233 
period,  232-233 
quantization,  271-272 
s-  and  p-like.  1 59 
in  static  electric  field,  224 
at  surfaces,  367-369,  369-371 
velocity,  141,  765 
wave  vector,  136,  139 

See  also  Band  gap;  Band  structure;  Density  of 
levels;  Effective  mass;  Fermi  surface; 
Nearly  free  electrons ;  Semiclassical  model ; 
Tight-binding  method 
Bloch  T3'2  law,  708 
Bloch  Ts  law,  526 
Bloch’s  theorem,  133 
first  proof,  1 34 
second  proof,  137 
and  surface  levels,  367-369 
Bloch  wall,  719-721 

Body-centered  cubic  Bravais  lattice,  67-68 
conventional  cell,  75 
coordination  number,  71 
elements  with,  70 
first  Brillouin  zone,  89 
higher  Brillouin  zones,  164 
lattice  sums  of  inverse  powers,  400 
packing  fraction,  83 
primitive  vectors,  68 
reciprocal  of,  88 

relation  to  centered  tetragonal  Bravais  lattice, 
117 

as  simple  cubic  with  structure  factor,  105 
Wigner-Seitz  cell,  74 

Body-centered  orthorhombic  Bravais  lattice,  118 
Bohm-Staver  relation,  514 
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Bohr  magneton,  646 
effective  number,  656-658 
Bohr  radius  («0),  4 
accurate  numerical  value,  757 
of  impurity  level  in  semiconductor,  579 
Bohr-van  Leeuwen  theorem,  269,  646 
Boltzmann’s  constant  { kB ),  23 
accurate  numerical  value,  757 
Boltzmann  distribution,  see  Maxwell-Boltzmann 
distribution 

Boltzmann  equation,  319-328 
and  conservation  laws,  327 
derivation  of  relaxation-time  approximation 
for  isotropic  elastic  impurity  scattering, 
324-326 

solution  to  in  relax ation-time  approximation, 
320 

variational  principle,  327-328 
See  also  Relaxation-time  approximation 
Bonding,  378-379,  389 
Bond  polarizability,  552 
Bonding  states,  678n 
Born-Mayer  potential,  407 
Born-Oppenheimer  method,  see  Adiabatic  ap¬ 
proximation 

Born-von  Karman  boundary  condition,  see 
Boundary  conditions 
Bose-Einstein  condensation,  37n 
Bose-Einstein  distribution,  454n 
Bose  gas,  ideal,  454 

Bound  charge  vs.  free  charge,  534n,  777 
Boundary  conditions,  32 
Born-von  Karman  (periodic)  for  Bloch  elec¬ 
trons,  135 

for  free  electrons,  33 
for  linear  chain,  43 1 
for  spin  waves,  705 n 
for  tight-binding  method,  178n 
for  free  electron  gas,  32 
at  surfaces,  368 
Bragg  condition,  97 

Bragg  formulation  of  X-ray  diffraction,  96-97 
equivalence  to  formulation  of  von  Laue,  99  - 1 00 
Bragg  peaks.  96 
and  Debye-Waller  factor,  795 
and  magnetic  reflections,  696-697 
as  zero-phonon  scattering,  473 
Bragg  planes,  99 

and  nearly  free  electrons,  156-159 
Bragg  reflection,  100 
order  of,  97 
Bravais  lattice,  64-75 

of  average  positions  in  dynamic  system,  422 
definitions  of,  64-65,  70n 
and  finite  crystals,  66 
inversion  symmetry  of,  128 


Bravais  lattice  (continued) 
nontranslational  symmetries  of,  112-129 
number  of,  120 
origin  of  name,  115 
symmetry  group  of,  112,  114,  116 
symmetry  operations,  113,  120-123 
allowed  rotation  angles,  121,  129 
types  enumerated,  115-119 

base-centered  orthorhombic,  117 
body-centered  cubic,  67-68 
body-centered  orthorhombic,  118 
centered  monoclinic,  118 
centered  tetragonal,  115 
face-centered  cubic,  68-69 
face-centered  orthorhombic,  118 
rhombohedral  (trigonal),  1 19 
simple  cubic,  65 
simple  hexagonal,  77,  119 
simple  monoclinic,  118 
simple  orthorhombic,  117 
simple  tetragonal,  1 15 
triclinic,  118 
use  of  term,  70 

See  also  Crystal  Systems;  Reciprocal  lattice 
Breakdown,  electric,  220 
conditions  for  absence  in  semiclassical  model, 
219-220,  773-775 

in  inhomogeneous  semiconductors,  592 
and  Wannier  functions,  188 
Breakthrough,  magnetic,  220 
conditions  for  absence  in  semiclassical  model, 
219-220,  773-775 
in  hexagonal  divalent  metals,  299 
and  spin-orbit  coupling,  220 
and  Wannier  functions,  188 
Brillouin  function,  655 
Brillouin  scattering.  419,  481-486 
classical  picture  of,  482-486 
Stokes  and  anti-Stokes  components,  481 
Brillouin  zone,  first,  89 

for  face-centered  and  body-centered  cubic 
lattices,  89 

and  reduced  zone  scheme.  142-143 
Brillouin  zones,  higher,  162-166,  171-173 
for  face-centered  and  body-centered  cubic 
lattices,  164 
as  primitive  cells,  163 
for  square  lattice,  163,  171 
Bulk  modulus,  39 
of  alkali  halides,  407-408 
table,  408 

free  electron,  39—40 
of  noble  gas  solids,  401  -402 
table,  401 

of  selected  metals,  39 
Burgers  vector,  634-635 
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cju  ratio,  hexagonal  close-packed  elements,  table 
for,  77 

in  hexagonal  reciprocal  lattice.  88-89 
ideal  value,  78 

calories/mole  to  eV/molecule  conversion,  396n 
Carrier  densities  in  semiconductors.  572-577, 
581-584,586  587 

in  equilibrium  p-n  junction,  graph,  597 
extrinsic,  575-577 
intrinsic,  575 
minority,  595n,  598, 606n 
position  dependence,  609 
in  nonequilibrium  p-n  junction,  graph,  597 
thermal  generation,  603 
Carrier  types,  218-219 
Cart,  before  horse,  92 
Casimir  limit,  504 
Cation,  380 

c-axis,  hexagonal  systems,  77 
tetragonal  systems,  115 
Cell,  see  Conventional  unit  cell;  Primitive  cell 
Cellular  method,  195-200 
difficulties  with,  199 

Cellular  wave  function  vs.  atomic,  197-198 
Centered  monoclinic  Bravais  lattice,  118 
Centered  tetragonal  Bravais  lattice,  1 15 

relation  to  face-centered  and  body-centered 
cubic,  1 17 

Cesium  chloride  structure,  80 
in  alkali  halides,  380 
compounds  with,  table  of,  81 
Chambers’  path  integral,  247n 
Charge  density  in  alkali  halides,  381 
near  surface  of  crystal,  358 
Chemical  potential,  41 

electrochemical  potential,  257n,  593-594 
of  electron  gas  at  zero  temperature,  43 
vs.  Fermi  energy,  43. 46, 142n,  573n 
of  intrinsic  semiconductor,  575-576 
local,  245 

in  AC  electric  field,  260 
quasichemical  potential,  613 
temperature  dependence  in  metals,  46-47 
thermodynamic  treatment  of,  759 
Chemical  shift,  665n 
Classical  limit,  50-51 
Clausius-Mossotti  relation,  542 
spelling,  542n 

Close  packing  of  spheres,  77-79 
and  face-centered  cubic  structure,  79 
and  hexagonal  close-packed  structure,  78 
and  other  structures,  79 
packing  fraction,  82-83 
Coercive  force,  721,  722 
Coherence  length  in  superconductors,  738n 
and  size  of  Cooper  pair,  742 


Cohesion  in  covalent  crystals,  409-410 
in  free  electron  metals,  410-412,  413 
in  noble  gas  solids,  398-402 
Cohesive  energy,  396-414 
of  ionic  crystals,  402-407 
table,  406 

of  noble  gas  solids,  401 
table,  401 

Collisions,  of  Bloch  electrons,  215 
in  Drude  model,  6 

effect  on  nonequilibrium  distribution  function, 
317-319 

electron-phonon,  523-528 

with  magnetic  impurities,  687-689 

with  nonmagnetic  impurities  (elastic).  320-322 

between  phonons,  499 

See  also  Anharmonic  terms;  Second  sound 
probability  of,  316-317 
sources  of,  216,  315-316 
and  thermal  equilibrium,  6,  245 
See  also  Relaxation  time;  Relaxation-time 
approximation;  Scattering 
Collision  term,  320 
Collision  time,  see  Relaxation  time 
Color  centers,  622-626 
F-center,  624 
F4-center,  626 
W-center,  625 
M-center,  624 
F-center,  624 
FK-center,  625 

width  of  optical  absorption  line,  625 
Compensated  materials,  236 
Hall  coefficient,  237,  240 
magnetoresistance,  236,  240 
Compressibility,  isothermal  vs.  adiabatic,  492 
See  also  Bulk  modulus 
Compton  scattering,  480 
Conduction  band  in  metals,  223 
in  semiconductors,  562 
Conduction  electrons,  3 
See  also  Bloch  electrons;  Electron  gas;  Free 
electron  approximation;  Independent  elec¬ 
tron  approximation 

Conductivity,  AC  electrical,  vs.  dielectric  con¬ 
stant,  776-778 
in  Drude  model,  16,  59 
at  finite  wavelengths,  260-261 
and  interband  transitions,  253n 
nonlocal,  17 

quantum  mechanical,  252 
and  reflectivity,  778 
in  semiclassical  model,  252-253 
Conductivity,  DC  electrical,  anisotropy,  59,  250 
Bloch  T 5  law,  526 
conversion  of  units,  9 
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Conductivity,  DC  electrical  (continued) 
of  disordered  alloys,  310 
Drude  formula  for,  7 

and  energy  gap  of  intrinsic  semiconductors,  568 
and  Fermi  surface  area,  259-260 
and  holes,  251 

and  impurities  in  semiconductors,  564-565 
of  ionic  crystals,  621-622 
and  lattice  vibrations,  418,  523-528 
in  perfect  crystals,  216 

perfect,  distinguished  from  superconductivity, 
731,  738 

resistance  minimum,  687-689 
in  semiclassical  model,  251 
of  semiconductors,  563 
temperature  dependence,  59, 418,  526,  563 
in  uniform  magnetic  field.  259,  261-262 
Conductivity  tensor,  240 
in  cubic  systems,  250 
in  hexagonal  systems,  259 
Conductivity,  thermal,  of  insulators,  495-505 
Casimir  limit,  504 
high  temperature,  501 
infinite  in  harmonic  approximation,  497 
infinite  without  umklapp  processes,  503 
and  lattice  vibrations,  419 
low  temperature,  501  -505 
vs.  metals,  20, 497n 
and  umklapp  processes,  503 
Conductivity,  thermal,  of  metals,  20-24 
Drude  formula  for,  22 
vs.  insulators,  20, 497n 
in  semiclassical  model,  253-256 
in  Sommerfeld  model,  52 
in  superconductors,  730-731 
table,  21 

See  also  Wiedemann-Franz  law 
Conservation  laws  and  relaxation-time  approxi¬ 
mation,  327 

Contact  (Fermi,  hyperfine)  interaction,  665n 
Contact  potentials,  360,  369 

Kelvin’s  method  for  measuring,  361-362 
See  also  Work  function 
Continuity,  literary,  626n 
Conventional  unit  cell,  73 
for  face-centered  and  body-centered  cubic 
Bravais  lattices,  75 
Cooper  pairs,  740-743 
vs.  bosons,  741 
crude  calculation,  755 
size  of,  74 In,  742 
Coordination  number,  71 
Core-core  repulsion,  379,  397 
Born-Mayer  potential,  407 
in  ionic  crystals,  405 
in  Lennard-Jones  potential,  398 


Core-core  repulsion  ( cominued ) 
and  phonons  in  metals,  529 
and  polarizability  of  ionic  crystals,  545 
Core  electrons,  3,  374 
wave  function,  194 
compared  with  valence,  195 
Correlation  energy.  336 

See  also  Electron-electron  interactions 
Correlation  length,  714 
Correspondence  principle,  271 
Coulomb  potential,  and  cohesive  energy  of  ionic 
crystals,  403  -407 

and  constraint  in  defect  concentrations,  619 
Fourier  transform  of,  334,  351-352 
the  “glue”  of  solids,  378 
and  ionic  plasma,  513 
and  lattice  sums,  403-405 
Madelung  constants,  405 
and  optical  modes  in  ionic  crystals,  547 
screened,  342 
and  surface  effects,  354 
See  also  Long  range  interactions;  Screening 
Covalent  bond,  379,  389,  552 
bond  polarizability,  552 
in  metals,  391 

Covalent  crystals,  376,  387-389 
compared  with  ionic,  388 
dielectric  constant,  553 
electronic  charge  distribution,  377 
polarizability  of,  552-553 
semiconducting,  564-566 
Covalent  radii,  387n 
Creation  operator  (phonon),  781 
Critical  current,  730 
in  cylindrical  wire,  755 
Silsbee  effect,  730 

Critical  field  (Wc),  728-729,  732-734 
lower  (//,.,),  732-734 
relation  to  latent  heat,  754 
relation  to  specific  heat  discontinuity,  754 
in  type  1  superconductors,  732 
in  type  11  superconductors,  732-734 
upper  (Hc2),  732-734 

See  also  Meissner  effect;  Superconductivity 
Critical  point,  699n,  712-714 
exponents,  699-701 
mean  field  theory,  723 
two-dimensional  Ising  model,  713 
mean  field  theory,  715-718,  723 
scaling  equation  of  state,  713-714 
scaling  form  of  correlation  function,  714n 
See  also  Mean  field  theory 
Critical  temperature,  ferroelectric,  556-557 
table,  557 

See  also  Ferroelectricity 
Critical  temperature,  magnetic,  694.  699 
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Critical  temperature,  magnetic  {continued) 
antiferromagnets,  table  for,  697 
ferrimagnets,  table  for,  698 
ferromagnetic  and  strength  of  dipolar  inter¬ 
action,  674n 

ferromagnets,  table  for,  697 
mean  field  vs.  exact,  717 
See  also  Antiferromagnetism;  Ferrimagnetism; 
Ferromagnetism;  Magnetic  ordering 
Critical  temperature,  superconducting,  728 
BCS  prediction,  744 
in  magnetic  field,  728,  732 
table,  729 

See  also  Superconductivity 
Crossed  fields,  electron  motion  in,  233-239 
Crystal  field  splitting,  657-658 
Crystallinity,  criterion  for,  64 
Crystallographic  point  groups,  114, 119-125 
cubic,  121 

international  nomenclature,  121-124 

noncubic,  122 

number  of,  119-120 

operations,  120-123 

relation  to  crystal  systems,  119-120 

Schoenflies  nomenclature,  121-123 

See  also  Crystal  systems 

Crystal  momentum,  classical  view  of  conserva¬ 
tion,  482-486 

conservation  of,  in  electron-phonon  scattering, 
524,  788 

in  neutron  scattering,  471-473,  788-789 
in  phonon-phonon  scattering,  499,  787 
of  electrons,  139 
general  theory  of,  784-789 
and  inlerband  transitions,  294 
vs.  momentum,  139,  219,  472,  784  785 
operator,  787 

and  optical  properties  of  semiconductors,  567 
and  umklapp  processes,  502 
and  vector  potentials,  771 
See  also  Umklapp  processes 
Crystal  structure,  75 
cesium  chloride  structure,  80-81 
table,  81 

determination  by  neutron  scattering,  473 
determination  by  X-ray  scattering,  96-110 
diamond  structure,  76 
table,  76 

hexagonal  close-packed  structure,  76-79 
table,  77 

perovskite  structure,  557 
sodium  chloride  structure,  80 
table,  80 

white  tin  structure,  127 
wurzite  structure,  387n 


Crystal  structure  ( continued ) 
zincblende  structure,  76,  81 
table,  81 

Crystal  systems,  114-120 
as  Bravais  lattice  point  groups,  114,  116, 120 
cubic,  115 
hexagonal,  119 
hierarchy  of,  120 
monoclinic,  118 
number  of,  119-120 
orthorhombic,  117 
tetragonal,  115 
triclinic,  118 
trigonal,  119 

Crystals,  growth  of,  636-637 
strength  of,  630,  635-636 
Cubic  Bravais  lattices,  See  Body-centered  cubic; 
Face-centered  cubic;  Simple  cubic  Bravais 
lattices 

Cubic  crystal  system,  115 
point  group,  nomenclature,  124-125 
table,  121 

relation  to  trigonal  system,  119 
See  also  Body-centered  cubic;  Face-centered 
cubic;  Simple  cubic  Bravais  lattices 
Curie’s  law,  655-656 
corrections  to,  709-712 
and  effective  Bohr  magneton  number,  656-657 
rare  earth  ions,  table  of,  657 
transition  metal  ions,  table  of,  658 
free  ions  vs.  solids,  656-659 
from  high  temperature  expansion,  669,  711 
See  also  Susceptibility 

Curie  temperature,  see  Critical  temperature, 
ferroelectric;  Critical  temperature,  magnetic 
Curie- Weiss  law,  71 2n,  718 
Current  density,  electrical,  7 
thermal,  254 
Cyclotron  frequency,  14 
numerical  formulas  for,  15,  758 
in  semiconductors,  571 
Cyclotron  mass,  233,  239 
in  metals.  278 
in  semiconductors,  571 
See  also  Effective  mass 
Cyclotron  resonance  in  metals,  278-279 
in  semiconductors,  278n,  570-571 

d-bands,  181 

in  noble  metals,  288-289 
in  transition  metals,  306-307 
de  Boer  parameter,  412-413 
table,  41 2 

de  Broglie  wavelength,  34 
numerical  relation  to  energy  of  electrons,  365 
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Debye  frequency  (coD),  458 
compared  with  Fermi  energy,  529 
Debye-Hiickel  theory,  342n 
of  depletion  layer,  611 

Debye  model  of  phonon  spectrum,  457-461, 
465-466 

density  of  levels,  465-466 
vs.  Einstein  model,  462—463 
Griineisen  parameter,  493 
specific  heat  interpolation  formula,  459-461 
graph,  460 
table,  461 

See  also  Specific  heat  (lattice) 
Debye-Scherrer  method,  102-104 
Ewald  construction  for,  104 
Debye  T3  law,  457 
See  also  Specific  heat  (lattice) 

Debye  temperature  (0O) 
of  alkali  halides,  459 
dependence  on  temperature,  459-460 
of  selected  elements,  461 
Debye  unit,  559 
Debye-Waller  factor,  486,  793 
Debye  wave  vector  (kD\  458 
relation  to  Fermi  wave  vector,  458n 
Defects  in  crystals,  616-641 
annealing,  621 
and  charge  neutrality,  619 
and  color  centers,  622-626 
and  conductivity  of  ionic  crystals,  621-622 
dislocations,  628-638 
Frenkel  type,  620 
grain  boundaries,  638 
magnetic  impurities,  687-689 
and  magnetization  process,  722 
mixed  Frenkel  and  Schottky  type,  639 
point,  line,  and  surface,  616 
Schottky  type,  616 
as  source  of  scattering,  216,  315 
stacking  faults,  637 
surface  defects,  637-638 
thermodynamics  of  line  and  surface  defects, 
620-621 

thermodynamics  of  point  defects  (Frenkel, 
Schottky,  or  mixed),  616-620,  638 
See  also  Color  centers;  Dislocations 
Deformation  twin,  637 
Degenerate  semiconductors,  573 
See  also  Semiconductors 
de  Haas- van  Alphen  effect,  264-275 
in  alkali  metals,  285—287 
and  density  of  levels,  272-27 4 
and  electron  spin,  274-275 
and  field  uniformity,  281 
measurement  of,  265-266 


de  Haas- van  Alphen  effect  (continued) 
minimum  specimen  size,  27 In 
in  noble  metals,  289-291 
quantization  of  orbit  area,  271-272 
and  scattering,  274 
thermal  broadening,  274 
in  transition  metals,  307 
Delta,  Kronecker,  87 
Demagnetization  factor,  722 
Density  of  levels  (electron),  Bloch  electrons, 
143-145,  149 
and  electron  spin,  143n 
free  electron,  44-45 

and  Pauli  paramagnetism,  661-662,  663n 
phonon  correction  to,  520 
in  rare  earth  metals,  308-309 
in  semiconductors,  574,  586 
graph  for  impure  case,  579 
graph  for  pure  case,  576 
singularities  in  strong  magnetic  field,  272-274 
and  specific  heat,  47 
in  transition  metals,  307 
in  two  band  nearly  free  electron  model,  170 
in  two  dimensions,  53 
van  Hove  singularities,  145,  150 
Density  of  levels  (fc-space),  35, 136 
Density  of  levels  (phonon),  464-466 
in  Debye  model,  465 
in  Einstein  model,  466 

Density  of  normal  modes,  see  Density  of  levels 
(phonon) 

Density  of  states,  see  Density  of  levels 
Depletion  layer  (space  charge  region),  591 
See  also  p-n  junction 
Depolarization  factor,  540n 
Detailed  balancing,  322 
Diamagnetism,  atomic  (Larmor),  648-652 

compared  with  conduction  electron  magne¬ 
tism  in  metals,  668 

compared  with  Curie  law  paramagnetism, 
668 

in  ionic  crystals,  649 
in  noble  gas  solids,  649 
in  doped  semiconductors,  666 
Landau,  664-665 

perfect  (Meissner  effect),  727,  731,  739 
See  also  de  Haas-van  Alphen  effect;  Suscep¬ 
tibility 

Diamond  structure,  76 
bond  angle,  82 
elements,  table  of,  76 
and  inversions,  128 
packing  fraction,  83 
structure  factor,  106 

Dielectric  constant,  alkali  halides,  table  for,  553 
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Dielectric  constant,  (continued) 
Clausius-Mossotti  relation,  542 
and  conductivity,  776-778 
of  covalent  crystals,  553 
of  electron  gas,  338-344 
Drude,  17 

Lindhard,  343-344,  352 
Thomas-Fermi,  341  -342 
of  ferroelectrics,  556 
“high  frequency,”  546n 
and  index  of  refraction,  534 
in  ionic  crystals,  548-550 
Lyddane-Sachs-Teller  relation,  548-549 
of  a  metal,  17,515-518 
and  optical  modes,  548 
quantum  mechanical  form,  779 
and  reflectivity,  778 
of  semiconductors,  578-579 
See  also  Polarizability 

Diffraction,  see  Electron  diffraction;  Neutron 
scattering;  X-ray  diffraction 
Diffusion  constant,  602 
Diffusion  current,  601 
Diffusion  lengths,  604-605 
Diffusion  region  (p-n  junction),  606-610 
Dipolar  interaction,  see  Magnetic  dipolar  inter¬ 
action 

Dipole  moment  (electric),  see  Ferroelectricity; 

Polarizability;  Pyroelectricity 
Dipole  moment  (magnetic),  see  Magnetic  mo¬ 
ment 

Direct  exchange,  681-682 
Direct  lattice,  86 
Direct  optical  transition,  567 
Direct  term  in  Hartree-Fock  equation,  333 
Directions,  conventions  for  specifying,  92-93 
Dislocations,  616, 628-638 
Burgers  vector,  634-635 
densities  of,  632 
edge,  632 
elastic  energy,  640 
general  dislocation,  633-635 
and  grain  boundaries,  638 
and  growth  of  crystals,  636-637 
observation  of,  637 
screw,  632 
and  slip,  632-633 
and  stacking  faults,  637 
and  strength  of  crystals,  635-636 
and  twinning,  637 
and  work  hardening,  636 
See  also  Defects  in  crystals 
Disordered  alloys,  309-310,  584n 
Dispersion  curve,  432 
for  linear  chain  with  basis,  434-435 


Dispersion  curve  (continued) 
for  monatomic  linear  chain  with  nearest- neigh¬ 
bor  interactions,  432-433 
with  mth  nearest-neighbor  interactions,  448 
for  three-dimensional  lattice  with  basis,  443 
for  three-dimensional  monatomic  Bravais  lat¬ 
tice,  441 

See  also  Lattice  vibrations;  Phonons 
Dispersion  relation  ( Kramers- Kro nig),  778 
Displacement,  electric,  534-535,  541 
Displacement  (translation)  of  Bravais  lattice,  70 
Distribution  function,  Bose-Einstein,  454 
equilibrium  electronic,  40-43 
Fermi-Dirac,  30-31, 40-42 
Maxwell-Boltzinann,  30-31 
nonequilibrium,  electronic,  244 
calculation  of  in  relation-time  approxima¬ 
tion,  246-250 
linearization  of,  249 
phonon,  454 

rate  of  change  due  to  collisions,  317-319 
in  relaxation-time  approximation,  318-319 
See  alsoy  Boltzmann  equation;  Fermi-Dirac 
distribution;  Relaxation-time  approxima¬ 
tion 

Divalent  metals,  298-299 
Domains,  718-722 
and  anisotropy  energy,  720-721 
Bloch  wall,  719-720 
and  dipolar  energy,  718-720 
Donor  impurities,  577 

See  also  Impurities  in  Semiconductors;  p-n 
junction;  Semiconductors 
Doping  of  semiconductors,  590-591 
doping  profile,  591 

Doppler  shift  of  scattered  W'aves,  484 
Double  layer,  358 
Dressed  ions,  513 

and  dielectric  constant  of  metals,  515-518 
Drift  current,  601 
Drift  term,  320 

Drift  velocity  in  crossed  fields,  233 
Drude  model,  2-27 
basic  assumptions,  4-6 

shortcomings  arising  from  classical  statistics, 
23, 25,  30,  58 

Dulong  and  Petit  law,  426-429 
failure  of,  429 

quantum  corrections  at  high  temperature,  455, 
467-468 

and  Rayleigh- Jeans  catastrophe,  466 
See  also  Specific  heat  (lattice) 

Dynamical  matrix,  439 
for  face-centered  cubic  crystal,  449 
symmetries  of,  438 
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Dynamical  structure  factor,  792 
in  harmonic  approximation,  792-794 

Edge  dislocation,  632 
See  also  Dislocations 

Effective  Bohr  magneton  number,  656-658 
Effective  mass,  cyclotron  (metals),  233-239 
measurement  of,  278-279 
cyclotron  (semiconductors),  239,  568 
and  DC  conductivity,  251 
in  Hartree-Fock  approximation,  337,  352 
of  holes,  227 

phonon  contribution  to,  520,  530 
with  screened  exchange,  344 
of  semimetals,  305n 
specific  heat,  239 
table,  48 

tensor,  228-229. 568n,  766 
theorem,  765-766 

Einstein  model  of  phonon  spectrum,  462-463 
vs.  Debye  model,  462-463 
density  of  levels,  466 
formula  for  specific  heat,  463 
graph,  463 

Einstein  relation,  602 
derived  from  kinetic  theory,  612 
in  nondegenerate  case,  612 
Einstein  temperature  (0E),  463 
Elasticity,  theory  of,  443-447 
elastic  constants,  444-447 
and  anharmonic  terms,  489 
compliance  constant,  446 
number  of  independent,  for  seven  crystal 
systems,  445 

stiffness  constant  (modulus),  446 
tabulated  for  cubic  crystals,  447 
related  to  theory  of  lattice  vibrations,  443-447 
See  also  Bulk  modulus 

Elastic  scattering  and  Wiedemann-Franz  law, 
322-323 

Electric  breakdown,  see  Breakdown,  electric 
Electric  displacement,  534-535,  541 
Electric  field,  local,  539-542 
Electrochemical  potential,  257n 
in  semiconductors,  593-594 
Electron  affinity,  380n 
Electron,  Bloch,  see  Bloch  electrons 
Electron  charge,  sign  convention,  3n 
Electron,  core,  see  Core  electrons 
Electron  density,  4 

and  classification  of  solids,  374-379 
in  covalent  crystals,  377 
in  ionic  crystals,  381 
table  (conduction)  for  selected  metals,  5 
Electron  diffraction,  364-366 


Electron-electron  interaction,  330-352 
in  Drude  model,  4-5 
and  exclusion  principle,  346-348 
and  Fermi  liquid  theory,  345-351 
and  ground  state  energy  of  free  electron  gas,  336 
Hartree  approximation,  743 
Hartree-Fock  approximation,  334-337 
and  Hubbard  model,  685 
lifetime,  347-348 
and  magnetic  structure,  672-691 
and  Pauli  paramagnetism,  669-670 
and  periodic  potential,  132, 192,  330-331 
via  phonons,  518-519 
scattering,  5,  316,  346-348 
and  superconductivity,  739-740 
and  tight-binding  method,  186 
See  also  Fermi  liquid  theory;  Hartree-Fock 
equation;  Independent  electron  approxi¬ 
mation;  Screening 

Electron,  free,  see  Free  electron  approximation 
Electron  gas,  2  6,  316n 

See  also  Free  electron  approximation 
Electron  g-factor,  646 

Electron,  independent,  see  Independent  electron 
approximation 

Electron-ion  interaction  (static),  in  Drude  model, 
4 

and  negative  Fermi  energies,  357n 
See  also  Periodic  potential 
Electron,  nearly  free,  see  Nearly  free  electron 
approximation 

Electron-neutron  interaction  (magnetic),  470 
Electron-phonon  interaction,  519-528 
coupling  constant,  523 
and  electron  effective  mass,  520 
and  magnetic  susceptibility,  663n 
and  one  electron  energy,  519-521 
and  resistivity,  523-528 
and  superconductivity,  739-740 
Electron  spin,  see  Spin 

Electron,  tight-binding,  see  Tight-binding  ap¬ 
proximation 

Electron,  valence,  see  Valence  electrons 
Electron  volt  (eV),  37n 
accurate  conversion  factors,  757 
eV/molecule  to  calories/mole  conversion,  396n 
Electrostatic  approximation,  547n,  548n,  551n 
Electrostatic  field  of  array  of  dipoles,  558 
of  uniformly  polarized  sphere,  557 
Energy  band,  141 

See  also  Band  gap;  Band  structure;  Band  width ; 
Density  of  levels 

Energy,  binding,  see  Cohesive  energy 
Energy  gap  (normal  materials),  see  Band  gap 
Energy  gap  (superconducting  materials),  727 
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Energy  gap  (superconductivity)  ( continued) 
and  acoustic  attenuation,  736 
BCS,  relation  to  Tc ,  744 
and  electromagnetic  energy  absorption,  735 
and  magnetic  impurities,  727n 
measured  values,  745 
and  tunneling,  735 
Entropy,  current  of,  253 
of  free  electron  gas,  54 
generation  of,  25 3n 
of  ice,  392 

lack  of  in  persistent  currents,  730 
of  spin  system,  659-661 
Equation  of  state  of  insulators,  490-494 
and  volume  dependence  of  normal  mode 
frequencies,  490-492 
Equipartition  of  energy,  9 
Estheticism,  403n 
Ettingshausen  effect,  258n 
Ewald  construction,  101 
for  Laue  method,  102 
for  powder  method,  103 
for  rotating  crystal  method,  102 
Ewald  sphere,  101 
Exchange  energy,  334-337 
and  cohesion  in  metals,  41 1 
Exchange  interaction,  679 
coupling  constants,  681 

direct,  indirect,  itinerant,  and  super,  681-682 
Exchange  term  in  Hartree-Fock  equation,  333 
approximated  by  one  electron  potential,  337 
screening  of.  344,  518 
and  conduction  electron  magnetism,  684 
Excitons,  626-628 
Frenkel,  626 
Mott-Wannier,  628 

Exclamation  marks,  61,  185,  219,  224  (twice!), 
291,  305,  403,  631,  808 
Exclusion  principle,  see  Pauli  exclusion  principle 
Expansion,  thermal,  see  Thermal  expansion 
Extended  zone  scheme,  159-160 
Extremal  orbits,  in  cyclotron  resonance,  279 
and  de  Haas-van  Alphen  effect,  268-269,  273 
and  density  of  levels,  273 
and  Gantmakher  effect,  280 
and  magnetoacoustic  effect,  277 
Extrinsic  semiconductors,  564 
carrier  densities,  575-577 
See  also  Semiconductors 

Face-centered  cubic  Bravais  lattice,  68-69 
and  close-packed  spheres,  79 
conventional  cell,  75 
coordination  number,  71 


Face-centered  cubic  Bravais  lattice  ( continued ) 
elements  with,  70 
first  Brillouin  zone,  89 

nomenclature  for  symmetry  points,  183 
free  electron  Fermi  surface,  165-166 
free  electron  levels  in,  161 
vs.  hexagonal  close-packed  structure,  78 
higher  Brillouin  zones,  164 
lattice  sums  of  inverse  powers,  400 
packing  fraction,  83 
primitive  reciprocal  lattice  vectors,  88 
primitive  vectors,  69 
reciprocal  of,  88 

relation  to  centered  tetragonal,  117 
tight-binding  p-bands,  189-190 
tight-binding  s-bands,  181-184 
Wigner-Seitz  cell,  74 

Face-centered  orthorhombic  Bravais  lattice,  118 
/^-center,  626 
/-bands,  308-309 

fee,  see  Face-centered  cubic  Bravais  lattice 
F-center  ( Farbzemrum ),  624 
simple  model,  639-640 
Fermi-Dirac  distribution,  30-31,  40-42 
classical  limit,  54 

compared  with  Maxwell-Boltzmann  distribu¬ 
tion,  30-31 
derived,  40-42 
graph,  3 1 

in  thermionic  emission,  363 
in  velocity  space,  30,  50 
Fermi  energy,  Bloch  electrons,  142 
vs.  chemical  potential,  43,  46,  573n 
compared  with  Debye  frequency,  529 
free  electron,  36 
table,  38 

negative  values,  357n 
of  semiconductor,  142n,  573n 
See  also  Chemical  potential 
Fermi  function,  43 
See  also  Fermi-Dirac  distribution 
Fermi  gas,  see  Free  electron  approximation 
Fermi  (contact,  hyperfine)  interaction,  665n 
Fermi  level  of  semiconductor,  57 3n 
See  also  Chemical  potential 
Fermi  liquid  theory,  345-351 
electron-electron  scattering,  346-348 
/-function,  350 
quasiparticles,  348-350 
summary,  351 
Fermi  momentum,  36 
Fermi  sphere,  36 
and  alkali  metals,  285-288 
in  bcc  monovalent  metal,  285 
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Fermi  sphere  ( continued ) 
in  fee  monovalent  metal,  289,  310 
in  fee  tetravalent  metal,  165 
in  fee  trivalent  metal,  jOi,  ji  1 
Fermi  surface,  141-143 
of  alkali  metals,  286 
of  aluminum,  301 
of  beryllium,  300 
at  Bragg  planes,  162 
extended  zone  scheme,  159-160 
extremal,  cross-sectional  areas,  268 
free  electron,  see  Fermi  sphere 
importance  of,  264 
of  lead,  303-304 
measurement  of,  264-281 
nearly  free  electron  construction,  162—166, 169, 
172-173 

of  noble  metals,  290-292 
reduced  zone  scheme,  142 
repeated  zone  scheme,  1 42 
Fermi  temperature,  38-39 
table,  38 

Fermi  velocity,  36 
table  of  free  electron  values,  38 
Fermi  wave  vector,  36 
and  free  electron  density,  36 
relation  to  Debye  wave  vector,  458n 
table,  38 

in  terms  of  rs,  36 
Ferrimagnetism,  695-696 
critical  (Curie)  temperature,  699 
table,  698 

distinguished  from  ferromagnetism,  695-696, 
712n 

mean  field  theory,  723-724 
susceptibility,  712n 
table,  698 

See  also  Magnetic  ordering 
Ferroelectricity,  556-557 
table,  557 

Ferromagnetism,  672,  694-696  * 

anisotropy  energy,  721 
in  band  theories,  684n 
conventions  on  use  of  term,  672n,  695 
coercive  force,  721-722 
critical  exponents,  699 
critical  temperature,  672n,  699 
table,  697 

and  dipolar  interactions,  674,  718-722 
distinguished  from  ferrimagnetism,  695-696, 
712n 

domains,  718-722 
easy  and  hard  axes,  721 
in  free  electron  gas,  682-684 


Ferromagnetism  ( continued ) 

Heisenberg  model,  701-709 
hysterisis,  722 

low  temperature  spontaneous  magnetization, 
707-709 

magnetization  process,  721-722 
mean  field  theory,  715-718 
spin  waves,  704-709 
susceptibility,  699,  711-712,  718 
table,  697 

See  also  Magnetic  interactions;  Magnetic 
ordering;  Spontaneous  magnetization 
/-function,  350 
Fick’s  law,  602 

Field  ion  microscope,  366-367 
Field  theoretic  methods,  332,  349,  727 
First  Brillouin  zone,  see  Brillouin  zone 
Floquet’s  theorem,  133n 

Fluctuating  dipole  (van  der  Waals)  force,  389-390 
in  ionic  crystals,  402 

and  Lennard- Jones  potential,  398  >. 

origin  of,  392-393  J 

Fluxoid,  749 

Flux  quantization,  734n,  749 
Force  constants,  426 
Form  factor,  atomic,  107-108 
Fourier's  law,  20 

Fourier  transform,  of  Coulomb  interaction,  334, 
351-352 

inversion  formulas,  762-764 
notational  convention,  339n 
of  screened  Coulomb  interaction,  342,  351-352 
Frankenheim,  M.  L.,  and  enumeration  of 
Bravais  lattices,  115,  119 
Free  charge  vs.  bound  charge,  534n,  777 
Free  electron  approximation,  4, 60-61 
and  alkali  metals,  285-288 
basic  formulas  (noninteracting)  summarized, 
757-758 

bulk  modulus,  39-40 
chemical  potential,  47 
and  cohesion  in  metals,  410-412, 413 
correlation  energy,  336 
density  of  conduction  electrons,  4,  59 
table,  5 

in  terms  of  kF,  36 
diamagnetism,  664-665 
dielectric  constant,  17,  338-344 
Drude  model,  2-27 

energy  density,  ground  state,  interacting,  336 
noninteracting,  38 
thermal,  47 

equation  of  state,  39,  54 
exchange  energy,  336 
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Free  electron  approximation  ( continued ) 
failure  of,  58-60 
Fermi  liquid  theory.  345-351 
ground  state  properties.  32-40 
Hartree-Fock  approximation,  334-337 
high  density  limit,  336 

vs.  independent  electron  approximation,  4, 
60-61 

and  interactions  among  electrons,  330-352 
levels  in  fee  lattice,  161 

level  quantization  in  magnetic  field,  270-271, 
281 

paramagnetism,  661-664 
nondegenerate  case.  669 
and  screening.  337-344 
Sommerfeld  model,  30-55 
specific  heat,  47-49 
table,  48 

spin  density  wave,  684 
thermal  properties,  40-49 
in  two  dimensions,  53 
Wigner  crystal,  684 
Free  energy,  Gibbs,  754 
Free  energy.  Helmholtz.  40, 490 
magnetic,  645 
Frenkel  defect,  620 
See  also  Defects  in  crystals 
Frenkel  exciton,  626 
Friedel  oscillations,  343 

Gantmakher  effects,  280-281 
Gap,  see  Band  gap;  Energy  gap  (superconducting 
materials) 

Gapless  superconductivity,  727n 
Gas,  classical,  compared  with  electrons  in  nonde- 
gencrate  semiconductors,  585 
compared  with  phonon  gas,  500n,  503n, 
506-507 

sound  propagation  in,  506-507 
Gas,  electron,  see  Electron  gas;  Free  electron 
approximation 

Galvanomagnetic  effects,  see  Hall  effect;  Magne¬ 
toresistance 

Generation  current,  598 
formula  for,  605,  61 1 
See  also  p-n  junction 
Generation  of  carriers,  603 
See  also  Semiconductors 

Geometrical  structure  factor,  see  Structure  factor 
g-factor,  electron,  274,  646 
Lande,  654,  797 

Gibbs  free  energy  (superconductor),  754 
Ginzburg-Landau  theory  .  747-749 
current,  748 

and  flux  quantization,  749 


Ginzburg-Landau  theory  {continued) 
and  London  equation,  748-749 
and  vortices,  748n 

Glasses,  specific  heats  and  thermal  conductivity, 
504n 

Glide  plane,  114n,  126 
Graded  junction,  591 
Grain  boundary,  638 
Graphite  structure,  304 
Green’s  function  method  (KKR),  202-206 
compared  with  augmented  plane-wave  (APW) 
method,  205 

Green’s  function  (field  theoretic)  techniques,  332 
and  Fermi  liquid  theory,  349 
and  phonons  in  metals,  51 8n 
and  screened  exchange,  344n 
and  superconductivity,  727 
Green's  theorem  (for  periodic  functions),  772 
Ground  state  energy  of  free  electron  gas,  336 
Group,  see  Point  group;  Space  group 
Growth  of  crystals,  636-637 
Gruneisen  law.  see  Bloch  Ts  law 
Gruneisen  parameter,  492-494,  508-509 
alkali  halides,  table  for,  494 
in  Debye  model,  493 
See  also  Thermal  expansion 

Hall  angle,  14 

in  materials  with  open  orbits,  238 
Hall  coefficient,  1 3 
Drude  form  for,  14 

high  field,  from  semiclassical  model,  235 
quantum  oscillations  in  high  field,  265 
sign  of,  13-14 
table,  15 

in  two  band  model,  240 
Hall  effect,  1 1—13 
in  aluminum,  15,  300-302 
in  compensated  materials,  240 
and  density  of  carriers,  14,  235-236 
in  Drude  model,  11-15 
high  field,  from  semiclassical  model,  234-236 
problems  with  free  electron  theory  of,  58 
in  semiconductors,  563 
and  sign  of  carriers,  13-14.  235-236 
in  two  band  model,  240 
Hall  field,  12 
Hall  voltage,  12 

Harmonic  approximation,  424-425,  488 
applied  to  lattice  vibrations,  422-450 
distinguished  from  small  oscillations  assump¬ 
tion,  488 

dynamical  structure  factor  in,  792-794 
energy  levels  of  N-ion  crystal,  452 
failure  of,  488-489 
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Harmonic  approximation  (continued) 
form  in  general,  425 
form  for  pair  potentials,  424 
and  infinite  thermal  conductivity,  497 
quantum  theory  of,  780-783 
and  theory  of  specific  heats,  452-468 
and  volume  dependence  of  normal  mode  fre¬ 
quencies,  491-492 

See  also  Anharmonic  terms;  Lattice  vibrations; 
Phonons 

Harmonic  oscillator  (quantum),  780 
Hartree  equations,  331 
from  variational  principle,  332,  351 
See  also  Periodic  potential 
Hartree-Fock  equations,  344-519 
approximation  by  one  electron  potential,  337 
and  effective  mass,  337,  352 
for  free  electrons,  334-337 
and  free  electron  “band  width,”  335 
and  free  electron  ferromagnetism,  682-684 
one  electron  levels,  334-335 
and  Landau  /-function,  350 
and  Pauli  susceptibility  of  free  electrons, 
669-670 

and  specific  heat,  337 
and  spin  density  waves,  684 
H -center,  625 

hep,  see  Hexagonal  close-packed  structure 
Heat  capacity,  see  Specific  heat 
Heisenberg  model,  679-681 
anisotropic,  723 

antiferromagnetic  ground  state,  704,  722 
antiferromagnetic  spin  waves,  708 
ferromagnetic  ground  state,  701-703 
ferromagnetic  spin  waves,  704-709 
Hamiltonian,  681 

high  temperature  susceptibility,  709-712 
lack  of  ordering  in  isotropic  one  and  two 
dimensional,  708 

See  also  Magnetic  interactions;  Spin  Hamil¬ 
tonian 

Heitler-London  approximation,  678-679,  689 
formula  for  exchange  splitting  in  hydrogen 
molecule,  679 

in  Hubbard  model  of  hydrogen  molecule, 
689-691 

limitations  of,  678 
Helicons,  26-27 

Helium,  solid,  and  harmonic  approximation,  425 
and  neutron  scattering  (3He),  470 
pathological,  376n 

and  small  oscillations  assumption,  488,  490 
solidification  pressure  ( T  =  0),  398n 
and  theory  of  quantum  solids.  422 
Helmholz  free  energy,  see  Free  energy,  Helmholtz 


Hexagonal  close-packed  structure,  76-79 
c/a  ratio,  ideal,  78 
table,  77 

and  close- pa  eked  spheres,  78 
elements  with,  77 

vs.  face-centered  cubic  Bravais  lattice,  78 
and  nearly  free  electrons,  167-168,  299-300 
spin-orbit  coupling  in,  169,  299 
structure  factor,  109 

See  also  Simple  hexagonal  Bravais  lattice 
Hexagonal  crystal  system,  119 
Holes,  225-229 

and  conduction  in  semiconductors,  562 
and  DC  conductivity,  251 
effective  mass,  227 
and  Hall  effect,  235-236 
and  thermopower,  258 

See  also  Semiclassical  model;  Semiconductors 
Hole  orbits,  229 
in  noble  metals,  292 
Hole  surface,  165 
in  aluminum,  301 
in  lead,  303-304 
Honeycomb  network,  66 

and  hexagonal  close-packed  structure,  79 
as  lattice  with  basis,  75 
Horse,  after  cart,  92 
Hubbard  model,  685 
of  hydrogen  molecule,  689-691 
Hund-Mulliken  approximation,  678n 
Hund’s  rules,  650-651 
applied  to  rare  earth  ions,  657 
applied  to  transition  metal  ions,  658 
formula  for,  667 
table,  652 
Hybridization,  181 
Hydrogen  bond  376n,  379,  391-392 
and  ice,  392 

Hydrogen  molecule,  674-681 
in  Hubbard  model,  689-691 
Hyperfine  (contact,  Fermi)  interaction,  665n 
Hysteresis  (magnetic),  722 


Ice,  392 

Ideal  hexagonal  close  packed  structure,  78 
Image  charge  and  work  function,  360n 
Improper  operations,  124 
Impurities,  elastic  scattering  by,  320-322 
and  electrical  conductivity,  216,  315 
and  gapless  superconductivity,  727n 
and  Kondo  effect,  687-689 
magnetic,  685-689 
See  also  Defects  in  crystals 
Impurities  in  semiconductors,  Bohr  radius,  579 
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Impurities  in  semiconductors  ( continued ) 
and  carrier  densities,  573,  575-577,  582-584, 
586-587 

donors  and  acceptors,  577 
doping,  590-591 
importance  of,  564 
impurity  band  conduction,  584 
levels,  586-587 
table  of  binding  energies,  580 
population  of  levels  in  thermal  equilibrium, 
581-582 

and  resistivity  (graph),  565 
See  also  Semiconductors 
Independent  electron  approximation,  4,  60,  132, 
192 

failure  of,  in  low  density  electron  gas,  684 
in  magnetic  problems,  673,  676-679 
in  partially  filled  tight-binding  bands,  186- 
187 

in  rare  earth  metals,  308 
in  superconductors,  726 
vs.  free  electron  approximation,  4,  60-61 
justification  for,  345-348 
representation  of  electron-electron  interactions 
in,  330-337 

See  also  Electron-electron  interaction;  Fermi 
liquid  theory;  Hartree-Fock  equations 
Index  of  refraction,  534 
Kramers-Kronig  relations,  778 
See  also  Dielectric  constant 
Indirect  exchange,  681-682 
and  rare  earth  metals,  685n 
Indirect  optical  transitions,  567 
Inelastic  scattering  and  Wiedemann-Franz  law, 
322-323 

Inert  gas,  see  Noble  gas 

Infrared  absorption  and  reststrahl  resonance,  551 
in  superconductors,  735-736 
Inhomogeneous  semiconductors,  see  Semicon¬ 
ductors 

Insulator-metal  (Mott)  transition,  187,  685 
Insulators,  distinguished  from  metals,  60,  223, 
562 

distinguished  from  semiconductors,  562 
Integer,  convention  on  use  of  term,  34 
Interactions,  see  Electron-electron  interaction; 
Electron-ion  interaction;  Electron-phonon 
interaction;  Magnetic  dipolar  interaction; 
Magnetic  interactions;  Phonon-phonon  in¬ 
teraction 

Interband  transitions,  218 
condition  for  absence  in  semiclassical  motion, 
219-220,  773-775 
and  conductivity,  253n 
and  dielectric  constant,  778-779 


Interband  transitions  ( continued ) 
direct,  294n 
indirect,  294n 

and  recombination  in  semiconductors,  603-604 
threshold,  294 

Intermediate  state  in  superconductors,  733n 
International  nomenclature  for  crystallographic 
point  groups,  123-125 
tables,  121,  122 
Interstitials,  616, 618 
See  also  Defects  in  crystals 
Intrinsic  semiconductors,  564 
See  also  Semiconductors 
Inversion  in  a  point,  1 12, 123 
Inversion  symmetry  and  Fourier  coefficients  of 
periodic  potential,  137 
Ion,  convention  on  use  of  term,  416n 
Ion  cores,  374n 

See  also  Core-core  repulsion;  Core  electrons  s 
Ionic  bond,  379,  388  f 

Ionic  conduction,  621-622  \ 

Ionic  crystals,  377-379 

I- VII  (alkali  halides),  380-385 

II- VI,  385-387 

III- V  (mixed  ionic-covalent),  387-389 
cohesive  energy,  402-407 

color  centers  in,  622-626 
conductivity  of,  621 
compared  with  covalent,  388 
Madelung  energy,  403-405 
optical  modes  in,  547-552 
optical  properties,  549-552 
reflectivity,  551 
reststrahl,  551 

shell  model  of,  425n,  546,  549 
stability  of  crystal  structure,  413 
See  also  Alkali  halides;  Color  centers;  Ionic 
radii 

Ionic  displacement  variables,  423 
Ionic  plasma  frequency  in  metals,  512 
Ionic  radii,  382-387 
in  alkali  halides,  382-385 
table,  385 

in  II-VI  compounds,  385-387 
table,  386 

compared  with  half  nearest-neighbor  distances 
in  metals,  391 

Ion-ion  interactions,  "bare”  and  “dressed,"  513 
Ionization  energies  of  alkali  metal  and  hydrogen 
atoms,  391 

of  copper  compared  with  potassium,  289n 
Ising  model,  712-713 
and  order-disorder  transition,  309n 
Isotope  effect,  739 
Itinerant  exchange,  681-682 
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Jahn-Teller  theorem,  659 
Jones  zone,  167 
Josephson  effects,  751-753 
Joule  heating,  26,  253n 
Junction,  590-591 
abrupt,  590 
biased,  592 
sign  convention,  597 
graded,  591 
p~n,  590 

See  also  p-n  junction 

kB  (Boltzmann’s  constant),  23 
accurate  numerical  value,  757 
KKR,  see  Korringa,  Kohn,  Rostoker  method 
kF,  35 

See  also  Fermi  wave  vector 
Kelvin’s  method,  361-362 
Kilocalories  per  mole  to  eV/molecule  conversion, 
396n 

Kinetic  theory  of  gases,  applied  to  metals,  2-27 
applied  to  phonons,  499-501,  506-508 
Knight  shift,  666 
Kohler’s  rule,  262 
Kohn  anomalies,  515 
Kondo  theory,  687-689 

Korringa,  Kohn,  and  Rostoker  method  (KKR), 
202-206 

compared  with  augmented  plane-wave  (APW) 
method,  205 

Kramers  degeneracy,  659 
Kramers-Kronig  relations,  778 
Kronecker  delta,  87 
Kronig- Penney  model,  149 
k-space,  34 

density  of  levels.  35,  136 

Landau  diamagnetism,  664-665 
Landau-Ginzburg  theory,  see  Ginzburg-Landau 
theory 

Landau  levels  for  free  electrons,  270-271 
Landau  theory  of  a  Fermi  liquid,  see  Fermi  liquid 
theory 

Landau  tube,  273 
Lande  ^-factor,  654 
evaluation  of,  797 
Langevin  function,  559 
Langevin  susceptibility,  649n 
See  also  Diamagnetism 

LA  (longitudinal  acoustic)  phonons,  see  Acoustic 
modes 

Large  zone,  167 

Larmor  diamagnetism,  648-652 
See  also  Diamagnetism 


Lasers  and  measurement  of  phonon  spectrum, 
481 

Lattice,  61-62,  76n 
Lattice,  Bravais,  see  Bravais  lattice 
Lattice  conductivity,  see  Thermal  conductivity 
Lattice  constant,  73 
table,  cesium  chloride  structure,  81 
diamond  structure,  76 
fee  monatomic  Bravais  lattice,  70 
hexagonal  close-packed  structure,  77 
orthorhombic  crystals,  128 
rhombohedral  (trigonal)  crystals,  127 
sodium  chloride  structure,  80 
tetragonal  crystals,  127 
Lattice,  direct,  86 

Lattice  dynamics,  see  Lattice  vibrations 
Lattice  planes,  89-93 
convention  for  specifying,  92-93 
correspondence  with  reciprocal  lattice  vectors, 
90-91 

families  of,  90 
Miller  indices,  91-92 
Lattice  potential,  see  Periodic  potential 
Lattice,  reciprocal  see  Reciprocal  lattice 
Lattice  sums  of  inverse  powers,  table,  400 
for  Coulomb  potentials,  403-405 
table,  405 

Lattice  specific  heat,  see  Specific  heat  (lattice) 
Lattice  vibrations,  acoustic  branch,  435, 437, 442 
adiabatic  approximation,  425-426 
and  anharmonic  effects,  422, 487-509 
classical  theory  of,  422-450 
crystal  momentum,  482-486,  784-789 
Debye  model  of  spectrum,  457-461,  465-466 
density  of  normal  modes,  464-466 
determination  of  spectrum  by  light  scattering, 
481-486 

by  neutron  scattering,  470-480,  793-794 
by  X-ray  scattering,  794-795 
and  dielectric  constant  of  metals,  515-518 
dynamical  matrix,  439 
Einstein  model  of  spectrum,  462-463,  466 
and  elasticity  theory,  443-447 
and  electrical  resistivity,  315,  418,  523-528 
harmonic  approximation,  422-425 
interaction  of  radiation  with,  470-486 
in  ionic  crystals,  534,  547-549 
ionic  displacements  and  momenta  and  normal 
mode  raising  and  lowering  operators, 
781-783 

Kohn  anomalies,  515 

and  long  range  interactions,  443 n,  440n,  448 
m  metals,  512-531 
Bohm-Staver  relation,  514 
detailed  dispersion  relation,  529 
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Latiice  vibrations  ( continued ) 
of  one-dimensional  latiice  with  basis,  433-437, 
448 

of  one-dimensional  monatomic  Bravais  lattice. 
430-433,  448 

optical  branch,  435,  437,  442 
polarization,  440-442, 449,  781,  783 
quantum  theory  of,  452-453,  780-783 
and  scattering  of  electrons,  216,315,524-526 
and  specific  heat,  426-429, 453-464 
summary  of  major  physical  consequences, 
417-420 

and  superconductivity,  739-740 
and  thermal  conductivity,  495-505 
thermal  energy  density,  454 
and  thermal  expansion,  490-492, 495 
and  thermopower,  258 

of  three-dimensional  lattice  with  basis,  442-443 
of  three-dimensional  monatomic  Bravais  lat¬ 
tice,  437-442,  449-450 
van  Hove  singularities,  465-468 
volume  dependence  of  frequencies,  490-492 
wave  packets  of,  496 
and  X-ray  diffraction,  96n,  794-795 
See  also  Anharmonic  terms;  Crystal  momen¬ 
tum;  Debye  model;  Harmonic  approxi¬ 
mation;  Phonons;  Polarization 
Lattice  with  a  basis,  75 
reciprocal  lattice  for,  86n 
Laue  condition,  99 

Laue  formulation  of  X-ray  diffraction,  97-99 
equivalence  to  formulation  of  Bragg,  99-100 
and  reciprocal  lattice,  97-99 
Laue  method,  101-102 
Ewald  construction  for,  102 
Law  of  mass  action,  574 

LCAO  (linear  combination  of  atomic  orbitals), 
see  Tight-binding  method 
LEED,  see  Low  energy  electron  diffraction 
Lennard-Jones  6-12  potential,  398 
graph,  399 

parameters  for  noble  gases,  398 
Level  vs.  state,  32n 

Light,  scattering  of,  see  Brillouin  scattering; 
Raman  scattering 

Lindhard  dielectric  constant,  343-344,  352 
compared  with  Thomas  Fermi,  340-343 
See  also  Dielectric  constant ;  Screening 
Line  defects,  616 

See  also  Defects  in  crystals;  Dislocations 
Linear  chain,  431-433 
with  basis,  433-437 
diatomic,  448 

and  Gruneisen  parameter,  508 

with  mth  nearest-neighbor  interactions,  448 


) 

Liouville’s  theorem,  222,  771 
and  derivation  of  Boltzmann  equation,  319n 
Liquids,  theory  of,  vs.  theory  of  solids,  62 
Liquid  3He  and  triplet  pairing,  741 
Local  field  theory,  539-542 
Locality  assumption,  16-17,  277,  776 
Localized  moments,  685-689 
criterion  for,  686 

and  resistance  minimum,  687-689 
London  equation,  737-739 
applied  to  slab,  754 

and  Ginzburg- Landau  equation,  748-749 
London  penetration  depth,  739 
Longitudinal,  see  Polarization 
Long  range  interactions,  and  lattice  sums,  403- 
405 

and  lattice  vibrations,  433n,  440n,  448 
in  ionic  crystals,  547-548 
in  metals,  513 
and  work  function,  354 
See  also  Coulomb  potential 
LO  (longitudinal  optical)  phonons,  see  Optical 
modes 

Lorentz  force,  12 
Lorentz- Lorenz  relation,  542n 
Lorentz  relation,  541 
Lorentz  theory  of  metals,  63.  585n 
Lorenz  number,  20 
table,  21 

theoretical  value,  52 
See  also  Wiedemann- Franz  law 
Low  angle  grain  boundary,  638 
Low  energy  electron  diffraction,  364-366 
Lower  critical  field,  732-733 
Lyddane-Sachs-Teller  relation,  548-549 
in  ferroelec tries,  556 
generalization  of,  559 
and  soft  modes,  556 

Macroscopic  electric  field;  535-536 
of  uniformly  polarized  sphere,  541,  557-558 
Macroscopic  electrostatic  Maxwell  equations, 
534-539 

Madelung  constant,  405 
table,  405 

Madelung  energy,  403-405 
Magnetic  alloys  (dilute),  685-687 
Magnetic  anisotropy  and  domain  formation 
720-721 

and  easy  and  hard  axes,  721n 
energy,  720 

and  spin  Hamiltonian,  680 
and  spin  waves,  708 n-,  723 
Magnetic  breakthrough,  see  Breakthrough, 
magnetic 
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Magnetic  cooling,  659-661,  669 
Magnetic  dipolar  interactions,  673-674 
and  demagnetization  factors,  722 
and  domain  formation,  718-722 
vs.  exchange  (Coulomb)  interactions,  673-674, 
680 

Magnetic  flux  quantum  {he/ 2e),  734n,  749 
Magnetic  impurities  in  normal  metals,  685-689 
in  superconductors,  727n 
Magnetic  interactions,  672-691 
and  adiabatic  demagnetization,  660 
and  band  theory,  684n 
convention  on  use  of  term,  672n 
electrostatic  origin  of,  672-673, 674-675 
in  free  electron  gas,  682-684 
and  Hubbard  model,  685 
and  Hund’s  rules,  650 
and  local  moments,  685-689 
and  Pauli  principle,  674 
and  resistance  minimum,  687-689 
and  spin-orbit  coupling,  674 
in  two  electron  system,  674-681, 689-691 
See  also  Heisenberg  model;  Magnetic  dipolar 
interactions;  Magnetic  ordering;  Spin 
Hamiltonian 

Magnetic  moment,  classical  definition,  667 
effective,  654 

effective  Bohr  magneton  number,  656 
electronic,  646 
of  iron  group  ions,  658 
localized,  685-687 
nuclear,  665 
of  rare  earth  ions,  657 
See  also  Susceptibility 
Magnetic  ordering,  672,  694-724 
critical  point,  712-714 
critical  temperature,  694 
Heisenberg  model,  701-711 
Ising  model,  712-713 

lack  of  in  one-  and  two-dimensional  isotropic 
Heisenberg  model,  708 
and  mean  field  theory,  715-718,  723-724 
and  neutron  scattering,  696-699 
and  nuclear  magnetic  resonance,  697-699 
types  oC  672, 694-696 

See  also  Antiferromagnetism;  Critical  point; 
Ferrimagnetism;  Ferromagnetism;  Hei¬ 
senberg  model;  Magnetic  interactions; 
Mean  field  theory;  Susceptibility 
Magnetic  properties  of  superconductors,  see  Cri¬ 
tical  field;  Meissner  effect;  Superconductivity 
Magnetic  reflections  of  neutrons,  696 
Magnetic  susceptibility,  see  Susceptibility 
Magnetization  density,  644-645 
Magnetization  process,  721-722 


Magnetoacoustic  effect,  275-277 
Magnetoresistance,  12 
in  compensated  materials,  236,  240 
Drude  theory  of,  11-15 
longitudinal,  I2n 
of  noble  metals,  58,  291-293 
and  open  orbits,  236-237 
problems  with  free  electron  theory  of,  58 
saturation,  236 

semiclassical  theory  of,  234-239 
transverse,  12n 
in  two  band  model,  240 
Magnetostriction,  265 
Magneton,  Bohr,  646 
Magnons,  see  Spin  waves 
Majority  carriers,  598 
Many  body  problem,  332 
Martensitic  transformation  of  lithium,  284 
sodium,  82,  284 
Mass  action,  law  of,  574 
Mass,  effective,  see  Effective  mass 
Matthiessen’s  rule,  323-324,  328 
failure  of,  323 

Maxwell-Boltzmann  distribution,  30-32 
compared  with  Fermi-Dirac  distribution,  30- 
31 

graph,  31 

and  nondegenerate  semiconductors,  585 
Maxwell’s  equations  in  free  space,  17 
macroscopic  electrostatic.  534-539 
Af -center,  624 
Mean  field  theory,  715-718 
comparison  of  critical  temperature  with  exact, 
717 

near  critical  point,  723-724 
critique  of,  715 
Curie- Weiss  law,  718 

graphical  solution  for  spontaneous  magnetiza¬ 
tion,  716 

low  temperature  spontaneous  magnetization, 
717 

susceptibility,  718,  724 

validity  of  for  superconducting  transition,  744n, 
745n 

Mean  free  path  in  metals,  9 
expressed  in  terms  of  resistivity,  52 
See  also  Relaxation  time 
Mean  free  time,  see  Relaxation  time 
Mechanical  equivalent  of  heat,  428n 
Meissner  (Meissner-Ochsenfeld)  effect,  731 
and  BCS  theory,  746-747 
and  London  equation,  739 
See  also  Critical  field;  Superconductivity 
Melting,  418 
Metallic  bond,  379, 388 
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Metal-insulator  (Mott)  transition,  187,  685 
Metals,  distinguished  from  insulators,  60, 223, 562 
Miller  indices,  91-92 
in  cubic  crystals,  91 
Minority  carriers,  595n,  598 
currents,  609 

position  dependence  of  densities,  609 
See  also  p-n  junction;  Semiconductors 
Mirror  plane,  121 

Mixed  state  in  superconductors,  733 
compared  with  intermediate  state,  733n 
Mobility,  563n,  601-602 
Molecular  crystals,  376-378 
bonding,  398-402 
convention  on  use  of  term,  376n 
intermolecular  interaction,  389  -390,  398-399 
See  also  Noble  gas  solids 
Molecular  field  theory,  see  Mean  field  theory 
Mollwo  relation,  640 

Momentum  conservation,  contrasted  to  crystal 
momentum  conservation,  784-785 
See  also  Crystal  momentum 
Monatomic  Bravais  lattice,  75 
Monoclinic  crystal  system,  118 
Mott  transition,  187 
in  Hubbard  model,  685 
Mott-VVannier  exciton,  628 
Muffin  tin  potential,  199 
Multiphonon  background,  476 
Multiphonon  processes,  and  anharmonic  terms, 
796 

Multiple!,  652 
Multiplicity,  652 
Multipole  expansion,  355 

Na,  581 
nc,  572 
N„  573 
581 
Ndt  581 
ii,,  575 

Nearest  neighbor,  7 1 

Nearly  free  electron  approximation,  152-173 
analogy  in  theory  of  lattice  vibrations,  449n 
band  index,  153 

contrasted  with  conventional  perturbation 
theory,  153n 

eigenvalue  problem  near  Bragg  planes,  156 
energies  far  from  Bragg  planes,  1 55 
energies  near  one  Bragg  plane,  156-159, 
169-172 

geometrical  structure  factor,  166-168 
illustrated  with  selected  metals,  284-305 
and  interband  optical  threshold  in  alkali 
metals,  294-295 
in  aluminum,  302-303 


Nearly  free  electron  approximation  ( continued ) 
levels  in  face-centered  cubic  lattice,  161 
levels  in  extended;  reduced,  and  repeated  zone 
schemes,  160 

in  one  dimension,  159-161 
relation  to  orthogonalized  plane-wave  method, 
208 

relation  to  pseudopotential  method,  208-210 
similarity  to  theory  of  lattice  vibrations,  449n 
s-like  and  p-like  bands,  159 
and  spin-orbit  coupling,  169 
theory  of  surface  levels,  369-371 
Nearly  free  electron  (simple)  metals,  152,  306 
Neel  temperature,  699 
table,  697 
Nemst  effect,  258n 
Net  65n 

Neutron,  cold  (thermal),  478 
energy  momentum  relation,  470-471 
compared  with  photon,  471 
groups,  477 

interaction  with  electrons,  470n,  696 
mass,  470 

Neutron  scattering,  420,  470-480,  790-795 
coherent,  470n,  790n 

conservation  laws  in  one-phonon  processes, 
478-480 

and  crystal  momentum,  471-472,  788-789 
and  Debye-Waller  factor,  486, 793 
and  determination  of  crystal  structure,  473,  793 
general  theory,  790-795 
and  magnetic  ordering,  470n,  696-697 
magnetic  reflections,  696 
one-phonon  scattering,  473-474,  793-794 
and  phonon  lifetimes,  476-478 
and  phonon  polarizations,  478,  794 
scattering  length,  790 
and  spin  waves,  709 
two-phonon  scattering,  474-476 
viewed  as  a  wave  phenomenon,  482-486 
zero-phonon  scattering,  473,  793 
Noble  gas  solids,  389-390,  498-402 
bulk  moduli,  cohesive  energies,  and  nearest- 
neighbor  distances,  table  of,  401 
de  Boer  parameter,  412-413 
diamagnetic  susceptibilities,  table  of,  649 
ionic  zero  point  motion,  401,  402 
Lennard- Jones  parameters,  398 
polarizabilities,  table  of,  544 
specific  heats,  428 

See  also  Helium,  solid;  Molecular  crystals 
Noble  metals,  band  structure  and  Fermi  surface, 
288-293 

bulk  moduli,  table  of,  39 

conduction  electron  densities,  table  of,  5 

Hall  coefficients,  1 5 
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Noble  metals  ( continued ) 
hole  orbits  in,  292 
lattice  constants,  table  of,  70 
magnetoresistance,  58,  29 1  -293 
optical  properties,  297 
specific  heats,  table  of,  48 
Nondegenerate  semiconductors,  573 
See  also  Semiconductors 
Nonlocal  effects,  see  Locality  assumption 
Nonsymmorphic  space  groups,  126 
Normal  Fermi  system,  349 
Normal  modes  of  harmonic  crystal,  429 
See  also  Harmonic  approximation;  Lattice 
vibrations;  Phonons 
Normal  process  (N-process),  501 
and  thermal  equilibrium,  503 
vs.  umklapp  process,  502 
See  also  Umklapp  process 
Nuclear  magnetic  resonance,  665-666 
and  antiferromagnetism.  697,  699 
and  Pauli  paramagnetism,  666 
Nuclear  size,  470n 
n-type  semiconductor,  577 
See  also  Semiconductors 

Ohm  to  electrostatic  units  conversion,  9n 
Ohm’s  law,  6-7 

One  electron  potential,  132,  192,  330-331 
effect  of  surface  on,  354-357 
representation  of  exchange  as,  337 
See  also  Hartree  equations;  Hartree-Fock 
equations;  Periodic  potential 
Onsager  relations,  262 
Open  orbits,  see  Orbits 

Optical  measurement  of  phonon  spectra,  481-486 
Optical  modes,  435,  442 
vs.  acoustic  modes,  437 
Debye  and  Einstein  models  of,  462 
in  ionic  crystals  547-552 
and  Raman  scattering,  48 1 
See  also  Lattice  vibrations;  Phonons 
Optical  properties,  293-294,  776-779 
Brillouin  scattering,  481-482 
and  distinction  between  dielectric  constant 
and  conductivity,  776-778 
of  ionic  crystals,  549-552 
of  metals,  16-19,  293-294 
alkali  metals,  294-296 
aluminum,  302-303 

and  independent  electron  approximation, 
345n 

noble  metals,  295-297 
Raman  scattering.  481-482 
of  semiconductors,  566-568 
OPVV,  see  Orthogonalized  plane-wave  method 
Orbital  angular  momentum,  quenching  of,  657 


Orbitals,  Linear  combination  of  atomic,  see 
Tight-binding  method 

Orbits  of  Bloch  electrons  in  a  magnetic  field, 
229-233 
hole,  229 
open,  231 

and  magnetoresistance,  236-239 
in  noble  metals,  291-293 
period,  231-233 
quantization,  271-272 

relation  between  fc-space  and  r-space  orbits,  230 
See  also  Extremal  orbits;  Semiclassical  model 
Order  of  Bragg  reflection,  97 
relation  to  Laue  condition,  100 
-Order-disorder  transition,  309n 
Order  parameter  (superconductivity),  747 
analogy  to  ferromagnetism.  748n 
See  also  Ginzburg-Landau  theory 
Orthogonalized  plane-wave  method,  206-208 
applied  to  selected  metals,  284-305 
and  nearly  free  electron  approximation,  208 
and  pseudopotential,  208 
Orthorhombic  crystal  system,  117 
examples  among  the  elements,  128 
Oscillator  raising  and  lowering  operators,  780, 
781 

Overlap  integrals,  180n 
and  bandwidth,  184-185 
Overscreening,  519 

P-  581 
Pi-’  572 
Pr,  573 

Packing  fraction,  83 
Pade  approximants.  712n 

Pairing,  see  Bardeen,  Cooper,  Schrieffer  theory; 

Cooper  pairs 
Paramagnetism,  653-659 
compared  with  diamagnetism  in  metals,  668 
compared  with  Larmor  diamagnetism,  668 
Curie’s  law,  655-659 
in  doped  semiconductors,  666 
Pauli,  661-664 

effect  of  electron-electron  interactions  on. 
669-670 

effect  of  electron-phonon  interactions  on, 
663n 

susceptibilities,  table  of,  664 
thermal  corrections  to,  669 
van  Vleck,  653 

See  also  Curie’s  law;  Hund’s  rules;  Suscepti¬ 
bility 

Pauli  exclusion  principle,  32,  332n 
and  classical  electron  dynamics,  5 1 
and  Cooper  pairs,  755 
and  electron-electron  scattering,  346-348 
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Pauli  exclusion  principle  ( continued ) 
and  electron  scattering  rate,  314 
and  free  electron  ground  state,  35 
and  impenetrability  of  ions,  379 
and  inertness  of  filled  bands,  222n 
irrelevance  of  for  impurity  scattering,  322 
and  magnetic  interactions,  674-675 
Pauli  paramagnetism,  see  Paramagnetism 
Pauli  spin  matrices,  algebraic  properties,  667 
p-band,  181 

tight-binding  in  simple  and  face-centered  cubic 
lattices,  189-190 
Peltier  coefficient,  258 
Peltier  effect,  258 

lack  of  in  superconductors,  256,  730,  75 In 
Penetration  depth,  739 

See  also  London  equation;  Superconductivity 
Periodic  boundary  condition,  see  boundary 
conditions 

Periodic  functions.  Green’s  theorem  for,  772 
plane-wave  expansions  of,  762-764 
Periodic  potential,  132-133,  192-193,  330-331 
See  also  Hartree  equations;  Hartree-Fock 
equations;  Bloch’s  theorem 
Perovskite  structure,  557 
Persistent  currents,  727,  728-730 
decay  of,  75 In 
theory  of,  750-751 
See  also  Superconductivity 
Perturbation  theory,  and  electron  levels  in  mag¬ 
netic  field,  647 
“k  •  p”,  765-766 

and  nearly  free  electron  approximation,  153n, 
156n 

Phase  shift,  148 
Phase  space,  221 
Phase  transitions,  order  of,  556 
See  also  Critical  point;  Critical  temperature 
Phonon  drag,  527-528 
Phonons,  acoustic,  435, 437, 442 
annihilation  and  creation  operators  (Bravais 
lattice),  781 

lattice  with  a  basis,  782-783 
and  band  gap  in  semiconductors,  566 
collisions  among,  499 

compared  with  classical  gas,  500n,  503n, 
506-507 

compared  with  photons,  453,  466-467 

creation,  destruction,  and  scattering  of,  497-499 

crystal  momentum,  472,  784-789 

Debye  model  of  spectrum,  457-461.  465-466 

density  of  levels,  464-466 

and  density  of  vacancies,  618,  638 

and  dielectric  constant  of  metals,  515-518 

Einstein  model  of  spectrum,  462-463,  466 


Phonons  ( continued ) 
and  electrical  resistivity.  523-528 
and  electron  effective  mass,  519-521,  530 
energies  compared  with  X-ray,  480 
interactions  with  electrons,  519-528 
and  interband  transitions,  294 
in  ionic  crystals,  534,  547-549 
ionic  displacements  and  momenta  in  terms  of 
annihilation  and  creation  operators  (Bra¬ 
vais  lattice),  78 1 
lattice  with  a  basis.  782-783 
Kohn  anomalies,  515 
lifetimes.  477 

and  linewidths  of  color  centers,  625 
mean  number  in  thermal  equilibrium.  454 
measurement  of  spectrum  by  light  scattering, 
481-486 

by  neutron  scattering,  470-480,  793-794 
by  X-ray  scattering,  794-795 
in  metals.  512-531 
normal  processes,  501-502 
occupation  numbers,  471n 
optical.  435,  437,  442 

and  optical  properties  of  semiconductors, 
567-568 

and  Pauli  susceptibility,  663n 
polarization,  440-442,  449,  781,  783 
and  neutron  scattering.  478n 
processes,  0-phonon,  473,  793 

1- phonon,  473-474,  793-794 

2- phonon,  474-476 
n-phonon,  796 

as  quantized  normal  modes,  453 

quantum  mechanics  of,  780-783 

second  sound,  506-508 

and  specific  heats,  453-464 

summary  of  major  physical  effects.  417-420 

and  superconductivity,  739-740 

and  thermal  conductivity,  495-505 

thermal  energy  density,  454 

and  thermal  expansion,  490-492,  495 

and  thermopower,  258 

umklapp  processes,  501  -505 

van  Hove  singularities,  465-468 

wave  packets,  496 

See  also  Anharmonic  terms;  Crystal  momen¬ 
tum;  Debye  model;  Electron-phonon 
interaction;  Harmonic  approximation; 
Lattice  vibrations;  Polarization;  Umklapp 
process 

Photoconductivity,  563 

Photoelectric  effect,  and  independent  electron 
approximation,  345n 
and  measurement  of  work  function,  362 
and  surfaces  of  solids,  354 
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Photons,  compared  with  phonons,  453,  466-467 
energy-momentum  relation,  470-471 
compared  with  neutrons,  471 
Piezoelectricity,  555n 
Planck  radiation  law,  467 
Plane  waves.  34 
lattice  sum  of,  767-768 
sum  over  first  Brillouin  zone,  767 
Plasma  frequency,  18 
ionic,  512 

numerical  formulas  for,  758 
and  optical  properties  of  metals,  18-19,  293 
Plasma  oscillation  (plasmon),  19 
observation  of,  20 
surface.  27 

Plastic  deformation,  630 

p-like  nearly  free  electron  Bloch  function,  159 
p-n  junction,  590-613 
“bending  of  bands”  in,  594 
biased,  sign  convention  for,  597 
carrier  densities,  equilibrium.  592-597 
graph,  597 

nonequilibrium,  606-611 
charge  density,  equilibrium,  597 
nonequilibrium,  608 

depletion  layer  (space  charge  region),  590-597, 
606-610,611-612 
currents  in,  606-608,  613 
in  equilibrium  592-597 
size  of,  equilibrium,  596 
nonequilibrium,  598 
diffusion  current,  601 
diffusion  region,  606-610 
fields  in,  613 
drift  current,  601 
fabrication  of,  590-591 
generation  current,  598,  605,  6 1 1 
homogeneous  region,  606-610 
I-V curve,  600 

minority  carrier  currents,  609 
densities,  609 

potential  drop  across,  equilibrium,  593-596 
graph.  597 
nonequilibrium,  598 
graph,  599 

recombination  current,  599 

related  to  generation  current,  600 
rectification  by,  597-600,  605-611 
reverse  bias,  605 
saturation  current,  600,  605 
See  also  Semiconductors 
Pockets  of  electrons  and/or  holes,  166 
Point  defects,  616 
See  also  Defects  in  crystals 
Point  groups,  see  Crystallographic  point  groups 


Poisson  distribution,  25-26 
Polar  crystals,  555n 
Polariton,  551 
Polarizability,  542-552 
of  alkali  metal  ions,  544 
atomic,  542-544 
bond,  552 

of  covalent  crystals,  552-553 
and  dielectric  constant  (Clausius-Mossotti 
relation),  542 

displacement,  543.  544-545 
of  halogen  ions,  544 
of  hydrogen  atom,  558 
of  noble  gas  atoms,  544 
shell  model  of,  546,  549 
See  also  Dielectric  constant 
Polarization  density,  535 
Polarization  of  lattice  vibrations,  439 
completeness  relations,  781-783 
in  lattice  with  a  basis,  442 
and  lattice  symmetries,  449 
longitudinal  vs.  transverse,  441-442 
“strictly,’’  442 

and  neutron  scattering,  478n,  794 
Polarons,  626 
Polar  semiconductors,  565 
Polycrystalline  state,  64n 
in  X-ray  diffraction,  103 
Polyvalent  metals,  298-309 
band  structure,  of  rare  earth  metals,  308-309 
of  simple  metals,  298-305 
of  transition  metals,  305-308 
Potential,  periodic,  see  Periodic  potential 
Powder  method,  102-104 
Ewald  construction  for,  1 04 
Primitive  (unit)  cell,  71-72 
volume  of,  7 1 

in  reciprocal  lattice,  89,  93 
Primitive  vectors,  65 

algorithm  for  construction  of,  83 
Prose,  page  of  unrelieved,  627 
p-type  semiconductor,  577 
See  also  Semiconductors 
Pseudopotential  methods,  208-210 

relation  to  nearly  free  electron  method,  208-210 
relation  to  orthogonalized  plane-wave  method. 

208 

Pyroelectric  crystals,  5 3 8n,  554-555 
symmetries  of,  555 

Quadrupole  interactions  in  crystals  of  low  sym¬ 
metry,  355n 

Quantum  effects  in  noble  gas  solids,  see  Zero 
point  energy 

Quantum  of  flux,  734n,  749 
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Quasichemical  potential,  613 
Quasiparticles  (quasielectrons),  348-350 
Quantization  of  orbits,  271  -272 
Quantum  solids,  417n,  422n 
Quenching  of  orbital  angular  momentum,  657 

r„  4 

numerical  formula  for,  757 
table,  5 

Radius,  covalent,  387n 
Radius,  ionic,  see  Ionic  radii 
Raman  scattering,  419, 481-486 
anti-Stokes  and  Stokes  components,  481 
classical  view  of,  482-486 
Range  of  atomic  wave  functions,  177 
Rare  earth  ions,  effective  magneton  numbers,  657 
metals,  band  structure,  308-309 
and  indirect  exchange,  685n 
Rare  gas,  see  Noble  gas 
Rayleigh-Jeans  catastrophe,  466 
R-center,  624 
Reciprocal  lattice,  86-94 
of  body-centered  cubic  Bravais  lattice,  88 
construction  of  basis  vectors,  86 
of  face-centered  cubic  Bravais  lattice,  88 
and  lattices  with  bases,  86n 
and  Miller  indices,  91  -92 
proof  that  it  is  a  Bravais  lattice,  86 
reciprocal  of,  87 
rods  (surface),  365 
of  simple  cubic  Bravais  lattice,  88 
of  simple  hexagonal  Bravais  lattice,  88, 94 
of  rhombohedral  (trigonal)  Bravais  lattice,  94 
volume  of  primitive  cell,  89,  93 
See  also  Bravais  lattice;  Brillouin  zones;  Lattice 
planes 

Recombination  current,  599 
relation  to  generation  current,  600 
See  also  p-n  junction 
Recombination  time,  603 
Rectification  by  p-n  junction,  597-600,  605-611 
Reduced  zone  scheme,  142,  159-160 
Reflection,  Bragg,  100 
order  of,  97 

Reflection  coefficient  (barrier  penetration),  147 
Reflection  in  a  plane,  112n,  121 
Reflection,  specular,  96n 

Reflectivity  and  complex  dielectric  constant,  778 
and  real  dielectric  constant,  551 
Relaxation  time,  6 
and  collision  probability,  317 
for  electron-electron  scattering,  347-348 
for  electron-phonon  scattering,  525 
energy  dependent,  53,  249 
expressed  in  terms  of  resistivity,  9 
table,  10 


Relaxation  time  ( continued ) 
for  isotropic,  elastic,  electron-impurity  scat¬ 
tering,  326 

for  phonons,  499-501 
normal  and  umklapp,  507 
and  Poisson  distribution,  25-26, 
generalized,  247-248 
position  dependent,  249n 
in  semiconductors,  collision,  602 
recombination,  603 

Relaxation-time  approximation,  6,  244-246 
compared  with  more  general  formulation,  318 
and  conservation  laws,  327 
critique  of,  3 14-328 

to  general  nonequilibrium  electronic  distribu¬ 
tion  function,  246-250 
and  local  charge  conservation,  260 
and  Matthiessen’s  rule,  323-324 
validity  of  for  isotropic,  elastic  impurity  scat¬ 
tering,  324-326 

and  Wiedemann-Franz  law,  322-323 
See  also  Boltzmann  equation;  Collisions; 
Scattering 

Renormalization  group,  699n 
Repeated  zone  scheme,  142,  160 
Residual  ray  (Restslrahl),  551 
Residual  resistivity,  687 
Resistance,  electrical,  6-7 
See  also  Conductivity 
Resistance  minimum,  687-689 
Resistivity,  DC  electrical,  7-8 
table,  8 
tensor,  7n 

in  magnetic  field,  240 
See  also  Conductivity 

Resonance,  nuclear  magnetic,  see  Nuclear  mag¬ 
netic  resonance 
Reslstrahl  (residual  ray),  551 
Rhombohedral  (trigonal)  Bravais  lattice,  1 19 
examples  among  the  elements,  127 
reciprocal  of,  94 

relation  to  cubic  Bravais  lattices,  119n,  128 
Richardson-Dushman  equation,  363-364 
Rigid  ion  approximation,  545 
failure  of,  546,  549 
Righi-Leduc  effect,  258n 
Rods,  365 

Rotating  crystal  method.  102 
Ewald  construction  for,  103 
Rotation  axis,  120-121 
allowed  angles  in  Bravais  lattice,  121,  129 
Rotations  and  elastic  energy,  444 
Rotation-inversion  axis,  121 
Rotation-reflection  axis,  121 
Ruderman-Kittel  oscillations,  343 
Russel-Saunders  coupling,  650 
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Rydberg  (Ry),  37 
accurate  numerical  value,  757 

Saturation  current,  in  p-n  junction,  600,  605 
in  thermionic  emission,  363 
Saturation  magnetization,  in  ferromagnet,  705 
in  paramagnet,  655 
s-band,  181 

tight-binding  in  face-centered  cubic  lattice, 
181-184 

sc,  see  Simple  cubic  Bravais  lattice 
Scattering,  elastic  isotropic  impurity,  320-322 
electron-electron,  5-6, 346-348 
electron-phonon,  523-528 
forward  (small  angle)  and  temperature  depen¬ 
dence  of  resistivity,  526 
inelastic  and  failure  of  Wiedemann-Franz  law, 
322-323 

by  magnetic  impurities,  687-688 
and  persistent  currents,  750-751 
phonon-phonon,  497-499 
sources  of  electronic,  3 1 5-3 1 6 
See  also  Anharmonic  terms;  Brillouin  scatter¬ 
ing;  Collisions;  Neutron  scattering;  Raman 
scattering;  Relaxation  time;  Relaxation¬ 
time  approximation;  X-ray  diffraction 
Scattering  length,  790 

Schoenfiics  nomenclature  for  crystallographic 
point  groups,  121-123 
tables,  121-122 
Schottky  defect,  616 
See  also  Defects  in  crystals 
Screening,  337-345,  515-519 
and  electron  gas  ferromagnetism,  684 
of  Hartree-Fock  approximation,  344,  519-521 
of  ion-ion  interaction,  513 
Lindhard,  343-344 
overscreening,  519 
and  superconductivity,  739-740 
in  p-n  junction,  611 
screened  Coulomb  potential,  342 
oscillations  in,  343 
Thomas-Fermi,  340-342 
Screening  current,  734 
Screw  axis,  114n,  126 
Screw  dislocation,  632 
See  also  Dislocations 
s-d  mixing,  181 
Second  sound,  506-508 
See  beck  effect,  24,  256 
lack  of  in  superconductors,  256,  730,  75 In 
Self-consistent  field  (Hartree)  approximation,  see 
Periodic  potential 
Semiclassical  model,  214-241 
AC  electrical  conductivity,  252-253 
and  carrier  types,  218-219 


Semiclassical  model  (continued) 

DC  electrical  conductivity,  250-25 1 
in  uniform  magnetic  field,  259,  261-262 
equations  of  motion,  216 
and  excitons,  628 
and  filled  bands,  221-223 
vs.  free  electron  model,  214 
and  Hall  effect,  234-236 
Hamiltonian  formulation,  771 
and  holes,  225-229 

and  impurity  levels  in  semiconductors,  577  -580 
and  inhomogeneous  semiconductors,  591-592 
possibility  of  failure,  592 
limits  of  validity,  219-221,  252,  773-775,  779 
Liouville’s  theorem,  222,  771 
and  magnetoresistance,  234-239,  240,  241 
and  motion  in  crossed  fields,  233 
and  orbit  quantization,  269,  271  -272 
phase  space,  221 
in  static  electric  field,  224,  241 
and  thermal  conductivity,  253-256 
and  thermoelectric  effects,  257-258 
transport  theory,  244-262 
in  uniform  magnetic  field,  229-233 
See  also  Bloch  electrons;  Orbits 
Semiconductors,  562-613 
band  structure,  examples  of,  568-572 
germanium,  569-570 
indium  antimonide,  570 
silicon,  569 

carrier  densities,  572-577,  581  584,  586-587 
See  also  Carrier  densities  in  semiconductors 
chemical  potential,  573,  575-577 
vs.  Fermi  energy,  142n,  573n 
conduction  bands,  562 
conductivity,  563-565 
cyclotron  resonance,  570-572 
degenerate,  573 
density  of  levels,  562,  574,  586 
with  impurities,  579 
diamagnetism  (doped),  666 
diffusion  constant,  602 
diffusion  length,  604-605 
doping  of,  590-591 
electrochemical  potential,  593-594 
energy  gap  (band  gap),  562-564 
measurement  of,  566-568 
table,  566 

temperature  dependence  of,  566-567 
examples,  564-566 
extrinsic,  564 
carrier  densities,  575-577 
Fermi  level,  57 3n 
generation  of  carriers,  603 
impurities,  see  Impurities  in  semiconductors 
vs.  insulators,  562 
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Semiconductors  ( continued ) 
intrinsic,  564 
carrier  densities,  572-575 
law  of  mass  action,  574 
majority  carriers,  598 
minority  carriers,  595n,  598 
mobility,  563n,  601  602 
nondegenerate,  573 
transport  in,  585 
n-type,  577 

optical  properties  of,  566-568 
paramagnetism  (doped),  666 
photoconductivity,  563 
p-n  junction,  590 
See  also  p-n  junction 
polar,  565 
p-typc,  577 

recombination  of  carriers,  603 
recombination  time,  603 
resistance,  562-564 
graph,  565 

scmiclassical  model,  591  -592 
vs.  scmimetals,  304n 
thermopower,  563 
valence  bands,  562 
Semimetals,  304-305 
effective  mass,  305n 
vs.  semiconductors,  304n 
specific  heat,  307n 
Shear  strain,  630 
Shear  stress,  63 1 

Shell  model  (ionic  crystals),  425n,  546,  549 
Shown,  it  can  be,  167, 277, 280,  320, 322, 324,  333, 
343,  442,  503,  664,  683,  685,  707,  708,  734n, 
741,  782,  792,  793,  794 
Shubnikov-de  Haas  effect,  265 
Silsbee  effect,  730 
Simple  cubic  Bravais  lattice,  65 
coordination  number.  71 
example  among  elements,  69 
lattice  sum  of  inverse  powers,  400 
packing  fraction.  83 
reciprocal  of,  88 

Simple  hexagonal  Bravais  lattice,  77 
reciprocal  of,  88-89 
relation  to  orthorhombic,  119n 
relation  to  rhombohedral  (trigonal),  1 25n 
See  also  Hexagonal  close- packed  structure 
Simple  metals  (nearly  free  electron  metals),  152, 
306 

Simple  monoclinic  Bravais  lattice,  118 
Simple  orthorhombic  Bravais  lattice,  117 
Simple  tetragonal  Bravais  lattice,  1 1 5 
Singlet  states,  675 
Singlet-triplet  splitting,  676-679 


Size  effects,  280-281 
Skin  depth,  277,  281 

Slater  approximation  to  exchange  term,  337 
Slater  determinant,  333 
s-Iike  nearly  free  electron  Bloch  function,  159 
Slip,  631 

Small  oscillations  approximation,  422,  488 
Sodium  chloride  structure,  80 
in  alkali  halides,  390 
compounds  with,  table  of,  80 
structure  factor,  109-110 
Soft  modes,  456n 
in  ferroelectrics,  556 
Soft  X  ray  emission,  see  X  ray 
Sommerfeld  expansion,  45-46,  53 
derivation,  760-761 
Sommerfeld  theory  of  metals,  30-55 
See  also  Free  electron  approximation 
Sound,  Bohm-Staver  relation  for  velocity  in 
metals,  514 

in  classical  gas.  506-507 
and  continuum  mechanics,  514n 
as  long  wavelength  limit  of  lattice  vibrations, 
440 

in  metals,  513-515 

attenuation  of,  and  Fermi  surface,  275-277 
and  superconductivity,  736 
second,  506-508 
See  also  Elasticity,  theory  of 
Space  charge  effects  in  thermionic  emission,  363 
Space  charge  region  (depletion  layer),  591 
See  also  p-n  junction 
Space  groups,  112 
equivalence  of,  115n 
number  of,  119-120 

related  to  point  groups  and  Bravais  lattices, 
125-126 

symmorphic  vs.  nonsym morph ic,  126 
Specific  heat  (electronic),  compared  with  lattice, 
463-464,  530 

constant  volume  vs.  constant  pressure,  49n 
cubic  term,  54,  59,  530 
and  density  of  levels,  47 
failure  of  classical  theory,  22-23 
free  electron  theory,  43,  46-49 
table,  48 

linear  term,  47,  48,  59 

phonon  correction  to,  520n 
in  superconductors,  734-735 
low  temperature,  746 
relation  to  critical  field,  754 
at  transition,  745 

measured  values  of  discontinuity,  747 
Specific  heat  (lattice),  417,  453-464 
compared  with  electronic,  463-464,  530 
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Specific  heat  (lattice)  ( continued ) 
compared  with  magnetic,  669 
constant  volume  vs.  constant  pressure,  427n, 
492 

in  ^-dimensions,  468 
Debye  model,  457-461 
interpolation  formula,  459 
graph,  460 
table,  461 

Einstein  model,  462-463 
compared  with  Debye  model,  462-463 
graph,  463 

failure  of  classical  theory,  426-429 
high  temperature  form,  454-455, 467-468 
general  form  in  harmonic  approximation,  454 
low  temperature  form,  452, 455-457 
for  nonlinear  (small  k)  dispersion,  468 
See  also  Dulong  and  Petit  law 
Specific  heat,  magnetic,  compared  with  lattice,  669 
singularity  at  critical  point,  699 
Spectroscopic  code,  652 
Specular  reflection,  96n 

Spheres,  close  packing  of,  see  Close  packing  of 
spheres 

Spin  correlation  function,  706 
high  temperature  form,  710-711 
scaling  form  near  critical  temperature,  7l4n 
Spin  density  wave,  684 
Spin  of  electron,  35 
and  density  of  levels,  l43n 
Spin  entropy,  659-661 
Spin  Hamiltonian,  679-681 
vs.  dipolar  interaction  energy,  680 
isotropy  of,  680 

See  also  Heisenberg  model;  Magnetic  inter¬ 
actions 

Spin-orbit  coupling,  in  divalent  hexagonal  metals, 
299 

in  hexagonal  close- packed  metals,  169,  299 
and  magnetic  anisotropy,  680,  720 
and  magnetic  breakthrough,  220 
and  magnetic  interactions,  674 
in  nearly  free  electron  calculations,  169 
in  tight-binding  method,  186 
Spin  waves,  in  anisotropic  Heisenberg  model, 
708n,  723 

in  antiferromagnets,  708 
in  ferromagnets,  704-709 
and  low  temperature  spontaneous  magne¬ 
tization,  707-709 
observation  of,  709 
superposition  of,  706-707 
See  also  Heisenberg  model 
Spontaneous  magnetization,  672,  694 
general  mean  field  form,  716 


Spontaneous  magnetization  (continued ) 
low  temperature  form,  Heisenberg  model, 
707-709 

mean  field  theory,  716 
singularity  at  critical  point,  699 
See  also  Ferrimagnetism;  Ferromagnetism 
Stacking  faults,  637-638 
State  vs.  level,  32n 
Static  lattice  model,  61,  416-420 
Statistical  mechanics,  fundamental  rule  of,  426n 
Stefan-Boltzmann  law,  467 
Stoichiometric  alloys,  309 
Stokes  component,  481 
Strain  components,  445 
Strain  tensor,  444 
Strength  of  crystals,  perfect,  630 
real,  635-636 
See  also  Dislocations 

Strictly  longitudinal  or  transverse  modes,  442 
See  also  Polarization  of  lattice  vibrations 
Structure  factor,  104-107 
for  bcc  Bravais  lattice,  considered  as  simple 
cubic  with  a  basis,  105 
for  diamond  structure,  106 
in  divalent  hexagonal  metals,  299 
dynamical,  792 

for  hexagonal  close-packed  structure,  109 
in  nearly  free  electron  approximation,  166-168 
for  sodium  chloride  structure,  109-110 
for  zincblende  structure,  109 
Sublattices,  magnetic,  695 
See  also  Antiferromagnetism;  Ferrimagnetism 
Superconductivity,  726-755 
and  acoustic  attenuation,  736 
coherence  length,  73 8n,  742 
critical  current,  730 
in  cylindrical  wire,  755 
critical  temperature,  728 
table,  729 

density  of  superconducting  electrons,  737 
destruction  of  by  applied  field,  728 
See  also  Critical  field 
elements,  table  of,  726 
flux  quantization,  734n,  749 
and  infrared  absorption,  735-736 
gapless,  727n 
ground  state,  741 
hard  superconductors,  734 
intermediate  state,  733n 
isotope  effect,  739 
Josephson  effects,  751-753 
latent  heat  in  magnetic  field,  754 
magnetization  curves,  733 
Meissner  (Meissner-Ochsenfeld)  effect,  73 1, 739, 
746-747 
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Superconductivity  (continued) 
and  microwave  absorption,  735 
mixed  state,  733 
vs.  intermediate  state,  733n 
order  parameter,  747 
vs.  perfect  conductivity,  731, 738 
perfect  diamagnetism,  727 
phase  diagram,  732 
specific  heat,  discontinuity,  745,  747 
low  temperature  electronic,  746 
relation  to  critical  field,  754 
strong  coupling,  743,  744,  745 
thermal  conductivity,  730  731 
thermodynamics  of,  753-754 
thermoelectric  properties,  256, 730,  75 In 
and  triplet  pairing,  741 
type  I  vs.  type  II,  732-733 
vortex  lines,  733-  734 

See  also  Bardeen,  Cooper,  SchriefTer  theory; 
Cooper  pairs;  Critical  field;  Energy  gap; 
Ginzburg- Landau  theory;  London  equa¬ 
tion;  Persistent  currents;  Tunneling 
Superexchange,  681-682 

Surface  corrections  to  bulk  one-electron  potential, 
354-357 

Surface  defects,  617,  637-638 
See  also  Defects  in  crystals 
Surface  dipole  layer,  358 
Surface  levels,  367-371 
and  Bloch’s  theorem,  368 
nearly  free  electron  theory,  369-371 
Surface  plasmons,  27 
Susceptibility,  magnetic,  645 
alkali  halide  ions,  table  for,  649 
of  antiferromagnet,  699,  702 
atomic,  645-650.  653-656 
high  temperature  (Heisenberg  model),  709-712 
and  calculation  of  critical  temperature,  712 
of  ions  with  partially  filled  shells,  653-659 
Landau  diamagnetic,  664-665 
Larmor  diamagnetic,  648-650 
mean  field  (Curie-Weiss  law),  718 
molar,  649 

noble  gas  atoms,  table  for,  649 
Pauli  paramagnetic,  see  Paramagnetism 
singularity  at  Tc,  699 
See  also  Critical  point;  Curie’s  law 
Symmetry  operations  of  Bravais  lattice,  113-114 
Symmorphic  space  groups,  126 
Szigeti  charge,  549 

TA  (transverse  acoustic)  phonon,  see  Acoustic 
modes 

Tetragonal  crystal  system,  115 
examples  among  the  elements,  127 


Tetra valent  metals,  302-304 
Thermal  conductivity,  see  Conductivity,  thermal 
Thermal  equilibrium,  and  collisions,  6,  245 
local,  6,  244-245 
Thermal  expansion,  490-495 
and  band  gaps  of  semiconductors,  566 
coefficient  of,  492 
alkali  halides,  table  of,  494 
metals,  table  of,  496 
and  Griineisen  parameter,  493 
of  metals,  495 

temperature  dependence,  493 
and  volume  dependence  of  normal  mode 
frequencies,  490-492 
See  also  Griineisen  parameter 
Thermionic  emission,  362-364 
Thermoelectric  effects,  24-25,  26,  52,  253-259, 
261 

measurement  of,  256-257 
in  superconductors,  256,  730-731,  75 In 
Thermoelectric  field,  24-25 
Thermomagnetic  effects,  258n,  260 
Thermopower  (thermoelectric  power),  24,  257 
failure  of  classical  theory,  25 
free  electron  theory  of,  52 
and  lattice  vibrations,  258 
and  Peltier  effect,  258 
in  semiclassical  model,  256-258 
in  semiconductors,  563,  585 
and  sign  of  charge  carriers,  258 
of  superconductors,  256,  730-731,  751n 
and  Thomson  effect,  261 
Third  law  of  thermodynamics,  490n 
and  adiabatic  demagnetization,  660 
Thomas-Fermi  method,  340-342 
dielectric  constant,  341 
vs.  Lindhard  method,  340,  343 
nonlinear,  341 
wave  vector,  342 

See  also  Dielectric  constant;  Screening 
Thomson  effect,  26,  258,  261 
Tight-binding  method,  176-190 
“failure”  of,  187,  308,  677n 
in  lattices  with  bases,  185-186 
and  Mott  transition,  187 
vs.  nearly  free  electron  method,  179n 
p-bands,  189-190 
s-bands,  181-184 

similarity  to  theory  of  lattice  vibrations,  437n 
spin-orbit  coupling  in,  186 
and  transition  metals,  306 
Tilt  boundary,  638 
Tin,  white  vs.  grey,  302 n,  564 
TO  (transverse  optical)  phonons,  see  Optical 
modes 
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Transition  metals,  305-308 
ions,  magnetic  moment  of,  658 
specific  heats,  59 

Translations  of  Bravais  lattice,  70 
Translation  operator,  134 
in  terms  of  momentum  operator,  785 
Transmission  coefficient  (barrier  penetration),  147 
Transverse,  see  Polarization 
Triclinic  Bravais  lattice,  118 
Triclinic  crystal  system,  118 
Trigonal  Bravais  lattice,  see  Rhombohedral 
Bravais  lattice 
Trigonal  crystal  system,  119 
examples  among  the  elements,  127 
relation  to  cubic,  1 19 
relation  to  hexagonal,  I25n 
Triplet  pairing  in  liquid  3He,  741  n 
Triplet-singlet  splitting,  676-679 
Triplet  stales,  675 
Trivalent  metals,  299-302 
Tube,  Landau,  273 

Tunneling  interpretation  of  Bloch  electron  mo¬ 
tion,  185 

Tunneling,  between  superconductors,  Josephson 
effects,  751-753 
normal,  751 

from  normal  metal  to  superconductor,  735 
and  energy  gap,  735 
Twinning,  632n,  637 
Twist  boundary,  638 
Two  fluid  model,  737 

Ultrasonic  attenuation,  and  Fermi  surface,  277 
in  superconductors,  736 
Umklapp  process  (U-process),  501-502 
and  choice  of  primitive  cell,  502 
and  crystal  momentum  conservation,  502 
and  electrical  resistivity,  526-528 
“freezing  out”  of,  502 
vs.  normal  process,  502 
and  phonon  drag,  527-528 
and  thermal  conductivity,  503-505 
Unit  cell,  see  Conventional  unit  cell;  Primitive 
cell 

Upper  critical  field,  732-733 

Vacancies,  616 
See  also  Defects  in  crystals 
Valence  bands,  in  metals,  193 
wave  function,  194 
compared  with  core,  195 
in  semiconductors,  562 
Valence  crystals,  see  Covalent  crystals 
Valence  electrons,  3 
mystery,  59-60 


Valence  electrons  ( continued ) 
solution  of  mystery,  223 

Van  der  Waals  force,  see  Fluctuating  dipole  force 
Van  Hove  singularities,  in  electronic  density  of 
levels,  145,  150 

in  nearly  free  electron  model,  171 
in  phonon  density  of  levels,  465, 468 
Van  Leeuwen  theorem,  269.  646 
Van  Vleck  paramagnetism,  653 
Variational  principle,  for  Boltzmann  equation. 
327-328 

for  Schrodinger  equation  in  periodic  potential, 
769-770 

Vector  potential,  and  electron  magnetism,  646 
in  semiclassical  equations  of  motion,  771 
Velocity  of  Bloch  electrons,  141,  765 
in  tight-binding  model,  185 
FK-center,  625 

Voltmeter,  what  it  measures,  256-257 
Von  Laue  condition,  99 

Von  Laue  formulation  of  X-ray  diffraction,  97-99 
equivalence  to  formulation  of  Bragg,  99-100 
and  reciprocal  lattice,  97-99 
Von  Laue  method,  101-102 
Ewald  construction  for,  102 
Voronoy  polyhedron,  73n 
Vortex  lines  in  superconductors,  733-734 
and  Ginzburg-Landau  theory,  748n 
and  magnetic  flux  quantum,  734n 

Wannier  functions,  187-189 
orthonormality  relations,  190 
uses  of,  188 

Wave  packets,  electron,  215-217 
phonon,  496 

Wave  vectors,  density  of,  35,  136 
Whiskers,  637 
White  tin  structure,  127 
Wiedemann-Franz  law,  20-21 
classical  theory,  23 
accidental  success  of,  23,  52 
failure  for  inelastic  scattering,  323 
in  free  electron  theory,  52 
in  semiclassical  model,  255-256 
See  also  Conductivity,  thermal,  of  metals; 
Lorenz  number 
Wigner  crystal,  684 
Wigner-Eckart  theorem,  654.  668 
Wigner-Seitz  cell,  73-75 
algorithm  for  construction  of,  74 
for  body-centered  cubic  Bravais  lattice,  74,  82 
for  face-centered  cubic  Bravais  lattice,  74,  82 
proof  of  primitive  nature,  74 
in  reciprocal  space,  see  Brillouin  zone,  first 
in  two  dimensions,  82 
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Work  function.  354-360 
and  contact  potentials,  360 
for  inequi valent  surfaces,  359 
table,  364 

and  thermionic  emission,  362-364 
Work  hardening,  636 
Wronskian,  148 
Wurtzitc  structure,  387n 

X  rays,  characteristic  energies,  96 
compared  with  phonon  energies,  480 
X-ray  diffraction,  96-1 10,  794-795 
atomic  form  factor,  107-108 
Bragg  condition,  97 
Bragg  formulation,  96-97 
Bragg  peaks,  96 
Bragg  reflection,  100 
Compton  background,  480 
convention  for  angle  of  incidence,  96n 
Debye- Waller  factor,  486,  792 
diffuse  background,  96n 
effect  of  lattice  vibrations  on,  420,  794-795 
equivalence  of  Bragg  and  Laue  formulations, 
99  100 

Ewald  construction,  101 
Laue  formulation,  97-99 
Laue  method,  101 

nearly  elastic  nature  of  scattering,  98n 


X-ray  diffraction  (continued) 
from  particle  point  of  view,  473 
and  phonon  spectrum,  480,  794-795 
powder  method,  102 
rotating  crystal  method,  102 
structure  factor,  104-107 

X  ray,  soft  emission,  and  independent  electron 
approximation,  345n 
and  measurement  of  bandwidth,  335 

Yukawa  potential,  342 

Zener  breakdown,  see  Breakdown,  electric 

Zero  point  motion  of  ions,  416 
contribution  to  thermal  energy  density,  454 
and  de  Boer  parameter.  412-413 
and  inadequacy  of  classical  theory  of  lattice 
vibrations,  422n 

manifestations  of  in  lighter  noble  gas  solids. 
401-402 

Zincblende  structure,  76,  81 
compounds  with,  tables  of,  81 
and  1I-V1  ionic  crystals,  386 
and  111-V  mixed  ionic  covalent  crystals,  387, 389 
relation  to  wurzite  structure,  387n 
structure  factor,  109 

Zone,  see  Brillouin  zone;  Extended  zone  scheme; 
Jones  zone;  Large  zone;  Reduced  zone 
scheme;  Repeated  zone  scheme 


